baotnguyen/ncis

Browse files

Files changed (5) hide show

README.md +10 -10
adapter_model.safetensors +1 -1
runs/Jul22_18-46-19_90cc069fc707/events.out.tfevents.1721673980.90cc069fc707.34.12 +3 -0
runs/Jul22_18-46-19_90cc069fc707/events.out.tfevents.1721674573.90cc069fc707.34.13 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -20,12 +20,13 @@ should probably proofread and complete it, then remove this comment. -->
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/phdatdt/Fine%20tuning%20mistral%207B/runs/ujkstyig)
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/phdatdt/Fine%20tuning%20mistral%207B/runs/4xqrob1d)
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/phdatdt/Fine%20tuning%20mistral%207B/runs/errke2dy)
 # ncis
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3363
-- Accuracy: 0.9231
 ## Model description
@@ -44,24 +45,23 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 7.5e-05
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 8
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 0.5027        | 1.0   | 19   | 1.7724          | 0.6496   |
-| 0.0243        | 2.0   | 38   | 1.3924          | 0.7607   |
-| 0.0           | 3.0   | 57   | 0.3363          | 0.9231   |
-| 0.0           | 4.0   | 76   | 3.2341          | 0.6325   |
-| 0.0           | 5.0   | 95   | 3.7350          | 0.5726   |
-| 0.0           | 6.0   | 114  | 3.7031          | 0.5726   |
 ### Framework versions

 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/phdatdt/Fine%20tuning%20mistral%207B/runs/ujkstyig)
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/phdatdt/Fine%20tuning%20mistral%207B/runs/4xqrob1d)
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/phdatdt/Fine%20tuning%20mistral%207B/runs/errke2dy)
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/phdatdt/Fine%20tuning%20mistral%207B/runs/ljcim1zv)
 # ncis
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8589
+- Accuracy: 0.8205
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0001
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 6
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 0.5641        | 1.0   | 19   | 1.7375          | 0.6496   |
+| 0.0034        | 2.0   | 38   | 0.8589          | 0.8205   |
+| 0.0           | 3.0   | 57   | 1.2763          | 0.7692   |
+| 0.0           | 4.0   | 76   | 1.3331          | 0.7692   |
+| 0.0           | 5.0   | 95   | 1.3380          | 0.7692   |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:46137ea6e37505cac166e695031a21408b9713fa7a73783a938b5a9364438d4f
 size 54593240

 version https://git-lfs.github.com/spec/v1
+oid sha256:b7585b86fd9ab388962672fa7edfe491525ec813c7ebf0b8f26882dc9335187f
 size 54593240

runs/Jul22_18-46-19_90cc069fc707/events.out.tfevents.1721673980.90cc069fc707.34.12 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e1d635a0dac4ac533f37e93695db9bdb3af4a57b81f70349116aba3d07d7bfcb
+size 8490

runs/Jul22_18-46-19_90cc069fc707/events.out.tfevents.1721674573.90cc069fc707.34.13 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:01330b564ed4e52dc2b86cc9faca99eccd3f07a303dcfe48cb657cab6befa976
+size 405

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7040d3a277e60cb9c8009355eca52383eb2249b75284662fc63e610091a0f887
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:ffcb371fc1749a568224e775b91905372fedbc0aca04c489537e2de8f4a173cb
 size 5112