deboramachadoandrade commited on
Commit
581d994
1 Parent(s): 28ba385

<deboramachadoandrade>/mistral-7binstruct-summary-100s

Browse files
README.md CHANGED
@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 1.5231
24
 
25
  ## Model description
26
 
@@ -52,14 +52,14 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:-----:|:----:|:---------------:|
55
- | 1.6414 | 0.03 | 25 | 1.6233 |
56
- | 1.5128 | 0.05 | 50 | 1.5516 |
57
- | 1.5799 | 0.08 | 75 | 1.5417 |
58
- | 1.5475 | 0.11 | 100 | 1.5351 |
59
- | 1.4229 | 0.13 | 125 | 1.5304 |
60
- | 1.6014 | 0.16 | 150 | 1.5308 |
61
- | 1.5296 | 0.19 | 175 | 1.5230 |
62
- | 1.577 | 0.22 | 200 | 1.5231 |
63
 
64
 
65
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 1.5225
24
 
25
  ## Model description
26
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:-----:|:----:|:---------------:|
55
+ | 1.6742 | 0.03 | 25 | 1.6357 |
56
+ | 1.5587 | 0.05 | 50 | 1.5519 |
57
+ | 1.5001 | 0.08 | 75 | 1.5390 |
58
+ | 1.5723 | 0.11 | 100 | 1.5331 |
59
+ | 1.4801 | 0.13 | 125 | 1.5288 |
60
+ | 1.5756 | 0.16 | 150 | 1.5269 |
61
+ | 1.4888 | 0.19 | 175 | 1.5251 |
62
+ | 1.5356 | 0.22 | 200 | 1.5225 |
63
 
64
 
65
  ### Framework versions
adapter_config.json CHANGED
@@ -19,8 +19,8 @@
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
22
- "q_proj",
23
- "v_proj"
24
  ],
25
  "task_type": "CAUSAL_LM",
26
  "use_dora": false,
 
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
22
+ "v_proj",
23
+ "q_proj"
24
  ],
25
  "task_type": "CAUSAL_LM",
26
  "use_dora": false,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e44ce263e6fd885f50d82ca515b9325375b43ee36ededb75acf161ce88bc2e41
3
- size 48
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b415eaade98187c4cf064805d28d258c7def485508e13d5cec4c31dcb66ab8bf
3
+ size 27280152
runs/Mar04_23-17-12_b2f1c2a0054d/events.out.tfevents.1709594235.b2f1c2a0054d.1093.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ceb476bdfde91146945cac929675545bdc28c5801868491d91ecb54b498cc332
3
+ size 11718
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:441d7ec3d3fb537aafc3b9e7acf12e80be2f6e0a6a99efdb3644ed6efb3298b0
3
  size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d4321f9c9cd609e474e245efd4f3aac2f1bfdd1c2e00f2fe723ae3f0a287fa71
3
  size 4920