arshiakarimian1 commited on
Commit
b2af41d
1 Parent(s): d3e1d33

End of training

Browse files
Files changed (2) hide show
  1. README.md +2 -2
  2. generation_config.json +1 -0
README.md CHANGED
@@ -33,14 +33,14 @@ More information needed
33
  ### Training hyperparameters
34
 
35
  The following hyperparameters were used during training:
36
- - learning_rate: 5e-05
37
  - train_batch_size: 8
38
  - eval_batch_size: 8
39
  - seed: 42
40
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
  - lr_scheduler_type: linear
42
  - lr_scheduler_warmup_steps: 500
43
- - num_epochs: 1
44
  - mixed_precision_training: Native AMP
45
 
46
  ### Training results
 
33
  ### Training hyperparameters
34
 
35
  The following hyperparameters were used during training:
36
+ - learning_rate: 2e-05
37
  - train_batch_size: 8
38
  - eval_batch_size: 8
39
  - seed: 42
40
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
  - lr_scheduler_type: linear
42
  - lr_scheduler_warmup_steps: 500
43
+ - num_epochs: 4
44
  - mixed_precision_training: Native AMP
45
 
46
  ### Training results
generation_config.json CHANGED
@@ -6,5 +6,6 @@
6
  128008,
7
  128009
8
  ],
 
9
  "transformers_version": "4.44.2"
10
  }
 
6
  128008,
7
  128009
8
  ],
9
+ "pad_token_id": 128009,
10
  "transformers_version": "4.44.2"
11
  }