leonardlin
commited on
Commit
•
d8391e8
1
Parent(s):
853e9cb
Update README.md
Browse files
README.md
CHANGED
@@ -101,7 +101,7 @@ micro_batch_size: 2
|
|
101 |
num_epochs: 3
|
102 |
optimizer: paged_adamw_8bit
|
103 |
lr_scheduler: linear
|
104 |
-
learning_rate:
|
105 |
|
106 |
train_on_inputs: false
|
107 |
group_by_length: false
|
@@ -157,7 +157,7 @@ More information needed
|
|
157 |
### Training hyperparameters
|
158 |
|
159 |
The following hyperparameters were used during training:
|
160 |
-
- learning_rate:
|
161 |
- train_batch_size: 2
|
162 |
- eval_batch_size: 2
|
163 |
- seed: 42
|
|
|
101 |
num_epochs: 3
|
102 |
optimizer: paged_adamw_8bit
|
103 |
lr_scheduler: linear
|
104 |
+
learning_rate: 8e-6
|
105 |
|
106 |
train_on_inputs: false
|
107 |
group_by_length: false
|
|
|
157 |
### Training hyperparameters
|
158 |
|
159 |
The following hyperparameters were used during training:
|
160 |
+
- learning_rate: 8e-6
|
161 |
- train_batch_size: 2
|
162 |
- eval_batch_size: 2
|
163 |
- seed: 42
|