liuylhf
/

special-token-qkvo

Generated from Trainer

4-bit precision

Model card Files Files and versions Community

liuylhf commited on Apr 8

Commit

c9174f2

•

1 Parent(s): 7542d6d

End of training

Files changed (2) hide show

README.md +15 -1
adapter_model.bin +3 -0

README.md CHANGED Viewed

@@ -2,6 +2,7 @@
 license: apache-2.0
 library_name: peft
 tags:
 - generated_from_trainer
 base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
 model-index:
@@ -92,7 +93,9 @@ weight_decay: 0.0
 # special-token-qkvo
-This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on an unknown dataset.
 ## Model description
@@ -125,6 +128,17 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 4
 ### Framework versions
 - PEFT 0.9.0

 license: apache-2.0
 library_name: peft
 tags:
+- axolotl
 - generated_from_trainer
 base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
 model-index:
 # special-token-qkvo
+This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.0788
 ## Model description
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 4
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 2.1829        | 0.01  | 1    | 2.1038          |
+| 0.0969        | 0.8   | 151  | 0.0879          |
+| 0.0772        | 1.58  | 302  | 0.0828          |
+| 0.0742        | 2.36  | 453  | 0.0804          |
+| 0.0748        | 3.14  | 604  | 0.0788          |
 ### Framework versions
 - PEFT 0.9.0

adapter_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3004a557c07fc1c36d80ab46328cc3d7479d7cc65a4f7f7c1879a7d60253ea9b
+size 109144714