liuylhf
/

mistral-lora

Generated from Trainer

Model card Files Files and versions Community

liuylhf commited on Feb 26

Commit

38f6b3e

•

1 Parent(s): 20e8ee9

Model save

Files changed (1) hide show

README.md +3 -17

README.md CHANGED Viewed

@@ -2,7 +2,6 @@
 license: apache-2.0
 library_name: peft
 tags:
-- axolotl
 - generated_from_trainer
 base_model: mistralai/Mistral-7B-Instruct-v0.2
 model-index:
@@ -60,7 +59,7 @@ wandb_log_model: end
 gradient_accumulation_steps: 4
 micro_batch_size: 2
-num_epochs: 0.5
 optimizer: paged_adamw_8bit
 lr_scheduler: cosine
 learning_rate: 0.001
@@ -103,9 +102,7 @@ fsdp_config:
 # mistral-lora
-This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the None dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.1480
 ## Model description
@@ -136,18 +133,7 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.95) and epsilon=1e-05
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 10
-- num_epochs: 0.5
-### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 1.3787        | 0.0   | 1    | 1.4156          |
-| 0.0868        | 0.1   | 31   | 0.1745          |
-| 0.149         | 0.21  | 62   | 0.1603          |
-| 0.1328        | 0.31  | 93   | 0.1532          |
-| 0.1635        | 0.41  | 124  | 0.1480          |
 ### Framework versions

 license: apache-2.0
 library_name: peft
 tags:
 - generated_from_trainer
 base_model: mistralai/Mistral-7B-Instruct-v0.2
 model-index:
 gradient_accumulation_steps: 4
 micro_batch_size: 2
+num_epochs: 2
 optimizer: paged_adamw_8bit
 lr_scheduler: cosine
 learning_rate: 0.001
 # mistral-lora
+This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on an unknown dataset.
 ## Model description
 - optimizer: Adam with betas=(0.9,0.95) and epsilon=1e-05
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 10
+- num_epochs: 2
 ### Framework versions