BaoTNguyen commited on
Commit
930dce0
1 Parent(s): 8f80a4c

baotnguyen/ncis

Browse files
README.md CHANGED
@@ -14,16 +14,13 @@ model-index:
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/phdatdt/Fine%20tuning%20mistral%207B/runs/3qvcjkr4)
18
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/phdatdt/Fine%20tuning%20mistral%207B/runs/gvsqg7wo)
19
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/phdatdt/Fine%20tuning%20mistral%207B/runs/7xoqej9s)
20
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/phdatdt/Fine%20tuning%20mistral%207B/runs/crnfiy2g)
21
  # ncis
22
 
23
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
24
  It achieves the following results on the evaluation set:
25
- - Loss: 0.1286
26
- - Accuracy: 0.9545
27
 
28
  ## Model description
29
 
@@ -54,14 +51,13 @@ The following hyperparameters were used during training:
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
56
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
57
- | 0.3789 | 1.0 | 37 | 0.1643 | 0.9444 |
58
- | 0.3206 | 2.0 | 74 | 0.1628 | 0.9192 |
59
- | 0.2918 | 3.0 | 111 | 0.2010 | 0.9040 |
60
- | 0.2759 | 4.0 | 148 | 0.3873 | 0.9242 |
61
- | 0.1263 | 5.0 | 185 | 0.1286 | 0.9545 |
62
- | 0.1482 | 6.0 | 222 | 0.1546 | 0.9495 |
63
- | 0.0104 | 7.0 | 259 | 0.1359 | 0.9646 |
64
- | 0.004 | 8.0 | 296 | 0.1463 | 0.9697 |
65
 
66
 
67
  ### Framework versions
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/phdatdt/Fine%20tuning%20mistral%207B/runs/92tgfr4g)
 
 
 
18
  # ncis
19
 
20
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.1145
23
+ - Accuracy: 0.9697
24
 
25
  ## Model description
26
 
 
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
53
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
54
+ | 0.8268 | 1.0 | 37 | 0.4646 | 0.8030 |
55
+ | 0.8255 | 2.0 | 74 | 0.2184 | 0.9040 |
56
+ | 0.3172 | 3.0 | 111 | 0.2934 | 0.9192 |
57
+ | 0.0518 | 4.0 | 148 | 0.1145 | 0.9697 |
58
+ | 0.0066 | 5.0 | 185 | 0.1204 | 0.9697 |
59
+ | 0.026 | 6.0 | 222 | 0.1884 | 0.9495 |
60
+ | 0.0012 | 7.0 | 259 | 0.2145 | 0.9545 |
 
61
 
62
 
63
  ### Framework versions
adapter_config.json CHANGED
@@ -23,10 +23,10 @@
23
  "rank_pattern": {},
24
  "revision": null,
25
  "target_modules": [
26
- "q_proj",
27
- "k_proj",
28
  "o_proj",
29
- "v_proj"
 
30
  ],
31
  "task_type": "SEQ_CLS",
32
  "use_dora": false,
 
23
  "rank_pattern": {},
24
  "revision": null,
25
  "target_modules": [
26
+ "v_proj",
 
27
  "o_proj",
28
+ "q_proj",
29
+ "k_proj"
30
  ],
31
  "task_type": "SEQ_CLS",
32
  "use_dora": false,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:37e008a8ad6faf6db2be04f965de61e6f8443c6c51b8e463e236b7686e2b2e35
3
  size 54593240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1be9b8ad1462b8d33e7d8a7170ac4eb2ef33db144cf09467cf3fe025f8fbce92
3
  size 54593240
runs/Jul26_15-32-43_9fa5a67d0f63/events.out.tfevents.1722007964.9fa5a67d0f63.34.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4feabf185d00e35ad38928ac21cac59fef30012eaec66c4d3b55b7e5893b583e
3
+ size 18661
runs/Jul26_15-32-43_9fa5a67d0f63/events.out.tfevents.1722009505.9fa5a67d0f63.34.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bf054e565faafc500fb58caf27729aee1de6165a44e09a2a9be5ec8126b6ec8e
3
+ size 411
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:51923c4ede97e0320b66d0b8b34b53f056ec58ecd068e25507b03d108cd40696
3
  size 5176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:75cfa01ba8e272e07b160a97927ee4253900b314ede7ae7a139cf60917fa564d
3
  size 5176