baotnguyen/ncis

Files changed (6) hide show

README.md CHANGED Viewed

@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1148
-- Accuracy: 0.9545
 ## Model description
@@ -50,12 +50,10 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 0.5913        | 1.0   | 37   | 0.6625          | 0.7778   |
-| 0.3835        | 2.0   | 74   | 0.2032          | 0.9040   |
-| 0.2724        | 3.0   | 111  | 0.1148          | 0.9545   |
-| 0.1354        | 4.0   | 148  | 0.2613          | 0.9646   |
-| 0.034         | 5.0   | 185  | 0.2067          | 0.9646   |
-| 0.062         | 6.0   | 222  | 0.2459          | 0.9747   |
 ### Framework versions
@@ -63,5 +61,5 @@ The following hyperparameters were used during training:
 - PEFT 0.12.0
 - Transformers 4.44.0
 - Pytorch 2.1.2
-- Datasets 2.20.0
 - Tokenizers 0.19.1

 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1850
+- Accuracy: 0.9192
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 0.4941        | 1.0   | 37   | 0.1850          | 0.9192   |
+| 1.0952        | 2.0   | 74   | 0.4716          | 0.8737   |
+| 0.221         | 3.0   | 111  | 0.3881          | 0.8636   |
+| 0.3177        | 4.0   | 148  | 0.1900          | 0.9596   |
 ### Framework versions
 - PEFT 0.12.0
 - Transformers 4.44.0
 - Pytorch 2.1.2
+- Datasets 2.21.0
 - Tokenizers 0.19.1

adapter_config.json CHANGED Viewed

@@ -23,10 +23,10 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "k_proj",
-    "v_proj",
     "o_proj",
-    "q_proj"
   ],
   "task_type": "SEQ_CLS",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "o_proj",
+    "k_proj",
+    "q_proj",
+    "v_proj"
   ],
   "task_type": "SEQ_CLS",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5de477e27cb0cff9296879a93d0b91e4dd1d00cc0170463899017218b0d2a9e4
 size 54593240

 version https://git-lfs.github.com/spec/v1
+oid sha256:8c9cb8364228b9f77e8f99ec3e9385e6395fdd2f5ea94a0a0a93c6c5ea59c75e
 size 54593240

runs/Aug14_22-30-42_2b24443ee4ab/events.out.tfevents.1723674643.2b24443ee4ab.34.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:6b91386bfcc77e2b3656c10ae7dca29c1e89269abe302ba1434a4a5ef0160c46
+size 13050

runs/Aug14_22-30-42_2b24443ee4ab/events.out.tfevents.1723675568.2b24443ee4ab.34.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:768330bdacf02cc40e7caf0ae6d9e80051c603464a0d4f19b92fb491691804c0
+size 411

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d30a2bf07e4f43cb53ebee8cb01f77367fc8fe9c92b241d3cd93ccec092c1a9b
 size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:22c453b1c3d4196ee54f74e3487e2f372819a699e3c9a1374ca28f81e8590b97
 size 5176