bunbohue commited on
Commit
7bb801a
1 Parent(s): 7a70456

Model save

Browse files
Files changed (1) hide show
  1. README.md +12 -6
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 1.7089
20
 
21
  ## Model description
22
 
@@ -35,7 +35,7 @@ More information needed
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
- - learning_rate: 1e-05
39
  - train_batch_size: 1
40
  - eval_batch_size: 1
41
  - seed: 42
@@ -44,16 +44,22 @@ The following hyperparameters were used during training:
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
  - lr_scheduler_warmup_ratio: 0.05
47
- - num_epochs: 3
48
  - mixed_precision_training: Native AMP
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:-----:|:----:|:---------------:|
54
- | 1.5026 | 1.0 | 1457 | 1.7267 |
55
- | 1.4926 | 2.0 | 2915 | 1.7124 |
56
- | 1.4237 | 3.0 | 4371 | 1.7089 |
 
 
 
 
 
 
57
 
58
 
59
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 1.6552
20
 
21
  ## Model description
22
 
 
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
+ - learning_rate: 0.0001
39
  - train_batch_size: 1
40
  - eval_batch_size: 1
41
  - seed: 42
 
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
  - lr_scheduler_warmup_ratio: 0.05
47
+ - num_epochs: 2
48
  - mixed_precision_training: Native AMP
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:-----:|:----:|:---------------:|
54
+ | 1.548 | 0.21 | 300 | 1.7227 |
55
+ | 1.3576 | 0.41 | 600 | 1.7006 |
56
+ | 1.3032 | 0.62 | 900 | 1.6854 |
57
+ | 1.4151 | 0.82 | 1200 | 1.6771 |
58
+ | 1.8081 | 1.03 | 1500 | 1.6698 |
59
+ | 1.6592 | 1.23 | 1800 | 1.6649 |
60
+ | 1.6466 | 1.44 | 2100 | 1.6600 |
61
+ | 1.7293 | 1.65 | 2400 | 1.6574 |
62
+ | 1.6455 | 1.85 | 2700 | 1.6552 |
63
 
64
 
65
  ### Framework versions