baotnguyen/ncis

Browse files

Files changed (6) hide show

README.md +10 -14
adapter_config.json +3 -3
adapter_model.safetensors +1 -1
runs/Jul26_15-32-43_9fa5a67d0f63/events.out.tfevents.1722007964.9fa5a67d0f63.34.0 +3 -0
runs/Jul26_15-32-43_9fa5a67d0f63/events.out.tfevents.1722009505.9fa5a67d0f63.34.1 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -14,16 +14,13 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/phdatdt/Fine%20tuning%20mistral%207B/runs/3qvcjkr4)
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/phdatdt/Fine%20tuning%20mistral%207B/runs/gvsqg7wo)
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/phdatdt/Fine%20tuning%20mistral%207B/runs/7xoqej9s)
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/phdatdt/Fine%20tuning%20mistral%207B/runs/crnfiy2g)
 # ncis
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1286
-- Accuracy: 0.9545
 ## Model description
@@ -54,14 +51,13 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 0.3789        | 1.0   | 37   | 0.1643          | 0.9444   |
-| 0.3206        | 2.0   | 74   | 0.1628          | 0.9192   |
-| 0.2918        | 3.0   | 111  | 0.2010          | 0.9040   |
-| 0.2759        | 4.0   | 148  | 0.3873          | 0.9242   |
-| 0.1263        | 5.0   | 185  | 0.1286          | 0.9545   |
-| 0.1482        | 6.0   | 222  | 0.1546          | 0.9495   |
-| 0.0104        | 7.0   | 259  | 0.1359          | 0.9646   |
-| 0.004         | 8.0   | 296  | 0.1463          | 0.9697   |
 ### Framework versions

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/phdatdt/Fine%20tuning%20mistral%207B/runs/92tgfr4g)
 # ncis
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1145
+- Accuracy: 0.9697
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 0.8268        | 1.0   | 37   | 0.4646          | 0.8030   |
+| 0.8255        | 2.0   | 74   | 0.2184          | 0.9040   |
+| 0.3172        | 3.0   | 111  | 0.2934          | 0.9192   |
+| 0.0518        | 4.0   | 148  | 0.1145          | 0.9697   |
+| 0.0066        | 5.0   | 185  | 0.1204          | 0.9697   |
+| 0.026         | 6.0   | 222  | 0.1884          | 0.9495   |
+| 0.0012        | 7.0   | 259  | 0.2145          | 0.9545   |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -23,10 +23,10 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "k_proj",
     "o_proj",
-    "v_proj"
   ],
   "task_type": "SEQ_CLS",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v_proj",
     "o_proj",
+    "q_proj",
+    "k_proj"
   ],
   "task_type": "SEQ_CLS",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:37e008a8ad6faf6db2be04f965de61e6f8443c6c51b8e463e236b7686e2b2e35
 size 54593240

 version https://git-lfs.github.com/spec/v1
+oid sha256:1be9b8ad1462b8d33e7d8a7170ac4eb2ef33db144cf09467cf3fe025f8fbce92
 size 54593240

runs/Jul26_15-32-43_9fa5a67d0f63/events.out.tfevents.1722007964.9fa5a67d0f63.34.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4feabf185d00e35ad38928ac21cac59fef30012eaec66c4d3b55b7e5893b583e
+size 18661

runs/Jul26_15-32-43_9fa5a67d0f63/events.out.tfevents.1722009505.9fa5a67d0f63.34.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bf054e565faafc500fb58caf27729aee1de6165a44e09a2a9be5ec8126b6ec8e
+size 411

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:51923c4ede97e0320b66d0b8b34b53f056ec58ecd068e25507b03d108cd40696
 size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:75cfa01ba8e272e07b160a97927ee4253900b314ede7ae7a139cf60917fa564d
 size 5176