gofilipa
/

mistral-7b-congress-117-118

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

gofilipa commited on May 28

Commit

39e6979

•

1 Parent(s): 3ada7d9

Update README.md

Files changed (1) hide show

README.md +10 -17

README.md CHANGED Viewed

@@ -17,21 +17,11 @@ tags: []
 This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
-- **Developed by:** [More Information Needed]
-- **Funded by [optional]:** [More Information Needed]
-- **Shared by [optional]:** [More Information Needed]
-- **Model type:** [More Information Needed]
-- **Language(s) (NLP):** [More Information Needed]
-- **License:** [More Information Needed]
-- **Finetuned from model [optional]:** [More Information Needed]
-### Model Sources [optional]
-<!-- Provide the basic links for the model. -->
-- **Repository:** [More Information Needed]
-- **Paper [optional]:** [More Information Needed]
-- **Demo [optional]:** [More Information Needed]
 ## Uses
@@ -75,11 +65,15 @@ Use the code below to get started with the model.
 ## Training Details
 ### Training Data
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
-[More Information Needed]
 ### Training Procedure
@@ -89,7 +83,6 @@ Use the code below to get started with the model.
 [More Information Needed]
 #### Training Hyperparameters
 - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->

 This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
+- **Developed by:** Filipa Calado
+- **Model type:** Text Generation
+- **Language(s) (NLP):** English
+- **License:** MIT
+- **Finetuned from model:** Mistral 7-b
 ## Uses
 ## Training Details
+num_train_epochs=5
+learning_rate=2e-4
+weight_decay=0.001
 ### Training Data
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
+[gender_congress_117-118 ](https://huggingface.co/datasets/gofilipa/gender_congress_117-118)
 ### Training Procedure
 [More Information Needed]
 #### Training Hyperparameters
 - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->