SmolLM-1.7B-Instruct_fsdp_qlora_nf4_adapter

Browse files

Files changed (5) hide show

README.md +23 -23
adapter_config.json +6 -6
adapter_model.safetensors +1 -1
runs/Sep04_14-46-19_algo-2/events.out.tfevents.1725461208.algo-2.69.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -9,18 +9,18 @@ base_model: HuggingFaceTB/SmolLM-1.7B-Instruct
 datasets:
 - generator
 model-index:
-- name: SmolLM_1_7B_Instruct_qlora_nf4_merged
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# SmolLM_1_7B_Instruct_qlora_nf4_merged
 This model is a fine-tuned version of [HuggingFaceTB/SmolLM-1.7B-Instruct](https://huggingface.co/HuggingFaceTB/SmolLM-1.7B-Instruct) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.6129
 ## Model description
@@ -57,26 +57,26 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch   | Step | Validation Loss |
 |:-------------:|:-------:|:----:|:---------------:|
-| 2.088         | 0.9524  | 10   | 1.9222          |
-| 1.8671        | 2.0     | 21   | 1.7931          |
-| 1.7735        | 2.9524  | 31   | 1.7340          |
-| 1.7236        | 4.0     | 42   | 1.6932          |
-| 1.6739        | 4.9524  | 52   | 1.6680          |
-| 1.652         | 6.0     | 63   | 1.6494          |
-| 1.6354        | 6.9524  | 73   | 1.6379          |
-| 1.6139        | 8.0     | 84   | 1.6288          |
-| 1.5938        | 8.9524  | 94   | 1.6233          |
-| 1.5828        | 10.0    | 105  | 1.6189          |
-| 1.5722        | 10.9524 | 115  | 1.6164          |
-| 1.5588        | 12.0    | 126  | 1.6149          |
-| 1.5539        | 12.9524 | 136  | 1.6141          |
-| 1.5506        | 14.0    | 147  | 1.6134          |
-| 1.5437        | 14.9524 | 157  | 1.6132          |
-| 1.5427        | 16.0    | 168  | 1.6130          |
-| 1.5407        | 16.9524 | 178  | 1.6130          |
-| 1.5386        | 18.0    | 189  | 1.6130          |
-| 1.5373        | 18.9524 | 199  | 1.6130          |
-| 1.5397        | 19.0476 | 200  | 1.6129          |
 ### Framework versions

 datasets:
 - generator
 model-index:
+- name: SmolLM_1_7B_Instruct_qlora_nf4
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# SmolLM_1_7B_Instruct_qlora_nf4
 This model is a fine-tuned version of [HuggingFaceTB/SmolLM-1.7B-Instruct](https://huggingface.co/HuggingFaceTB/SmolLM-1.7B-Instruct) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.6111
 ## Model description
 | Training Loss | Epoch   | Step | Validation Loss |
 |:-------------:|:-------:|:----:|:---------------:|
+| 2.0769        | 0.9524  | 10   | 1.9176          |
+| 1.8602        | 2.0     | 21   | 1.7910          |
+| 1.7729        | 2.9524  | 31   | 1.7320          |
+| 1.7147        | 4.0     | 42   | 1.6913          |
+| 1.6753        | 4.9524  | 52   | 1.6662          |
+| 1.6518        | 6.0     | 63   | 1.6477          |
+| 1.6228        | 6.9524  | 73   | 1.6361          |
+| 1.6118        | 8.0     | 84   | 1.6274          |
+| 1.5843        | 8.9524  | 94   | 1.6214          |
+| 1.5805        | 10.0    | 105  | 1.6173          |
+| 1.5712        | 10.9524 | 115  | 1.6151          |
+| 1.5524        | 12.0    | 126  | 1.6133          |
+| 1.5491        | 12.9524 | 136  | 1.6121          |
+| 1.5445        | 14.0    | 147  | 1.6113          |
+| 1.5397        | 14.9524 | 157  | 1.6113          |
+| 1.5392        | 16.0    | 168  | 1.6114          |
+| 1.5337        | 16.9524 | 178  | 1.6111          |
+| 1.5347        | 18.0    | 189  | 1.6111          |
+| 1.5337        | 18.9524 | 199  | 1.6111          |
+| 1.5351        | 19.0476 | 200  | 1.6111          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,13 +20,13 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "down_proj",
-    "up_proj",
-    "o_proj",
-    "v_proj",
     "k_proj",
-    "gate_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "k_proj",
+    "v_proj",
+    "gate_proj",
+    "o_proj",
+    "up_proj",
+    "q_proj",
+    "down_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e1d77b8cef142fc3f4e62dcf57318c7ebf3809772ac44c6388d5222ab7b734c8
 size 36220744

 version https://git-lfs.github.com/spec/v1
+oid sha256:c3a4c40f6a10d6fc3272a401bd4d0f52278f41ae12d179f9cadc4402d40e0279
 size 36220744

runs/Sep04_14-46-19_algo-2/events.out.tfevents.1725461208.algo-2.69.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:116ce818c01ba78a5ff0c099c7b051a4b6418b19f3bf8d60d9717135513417dc
+size 15367

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:67bd4f3f39a9abe7f016e7e8083bbc1700854b6b03abdfbb614abf6c15d3b93b
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:3cdd7102f1042f158c2abca7f8429922f42b01b3ea7573c0aea6cd2d44ac80de
 size 5240