liuylhf commited on
Commit
c9174f2
1 Parent(s): 7542d6d

End of training

Browse files
Files changed (2) hide show
  1. README.md +15 -1
  2. adapter_model.bin +3 -0
README.md CHANGED
@@ -2,6 +2,7 @@
2
  license: apache-2.0
3
  library_name: peft
4
  tags:
 
5
  - generated_from_trainer
6
  base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
7
  model-index:
@@ -92,7 +93,9 @@ weight_decay: 0.0
92
 
93
  # special-token-qkvo
94
 
95
- This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on an unknown dataset.
 
 
96
 
97
  ## Model description
98
 
@@ -125,6 +128,17 @@ The following hyperparameters were used during training:
125
  - lr_scheduler_warmup_steps: 10
126
  - num_epochs: 4
127
 
 
 
 
 
 
 
 
 
 
 
 
128
  ### Framework versions
129
 
130
  - PEFT 0.9.0
 
2
  license: apache-2.0
3
  library_name: peft
4
  tags:
5
+ - axolotl
6
  - generated_from_trainer
7
  base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
8
  model-index:
 
93
 
94
  # special-token-qkvo
95
 
96
+ This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on the None dataset.
97
+ It achieves the following results on the evaluation set:
98
+ - Loss: 0.0788
99
 
100
  ## Model description
101
 
 
128
  - lr_scheduler_warmup_steps: 10
129
  - num_epochs: 4
130
 
131
+ ### Training results
132
+
133
+ | Training Loss | Epoch | Step | Validation Loss |
134
+ |:-------------:|:-----:|:----:|:---------------:|
135
+ | 2.1829 | 0.01 | 1 | 2.1038 |
136
+ | 0.0969 | 0.8 | 151 | 0.0879 |
137
+ | 0.0772 | 1.58 | 302 | 0.0828 |
138
+ | 0.0742 | 2.36 | 453 | 0.0804 |
139
+ | 0.0748 | 3.14 | 604 | 0.0788 |
140
+
141
+
142
  ### Framework versions
143
 
144
  - PEFT 0.9.0
adapter_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3004a557c07fc1c36d80ab46328cc3d7479d7cc65a4f7f7c1879a7d60253ea9b
3
+ size 109144714