File size: 658 Bytes
82ea31e
 
 
 
 
 
 
 
 
 
c709228
82ea31e
 
 
 
6107752
82ea31e
829d31a
9d8303f
3b181f5
c3bdcb9
ca4d67b
dc53680
6847bc6
155db6f
127ce88
80eca61
4e74d2e
129f58f
555afbb
6b83cbd
96c9e40
e0769b2
43a9860
7f15b6c
10e0f2c
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
---
language: en
license: mit
library_name: pytorch
---
# Plainly Optimized Network
Dataset: BIGBENCH

Trainer Hyperparameters:
- `lr` = 5e-05
- `per_device_batch_size` = 1
- `gradient_accumulation_steps` = 4
- `weight_decay` = 1e-09
- `seed` = 42

|eval_loss|eval_mse|epoch|
|--|--|--|
|58.741|0.055|1.0|
|60.624|0.058|2.0|
|60.765|0.057|3.0|
|55.858|0.051|4.0|
|57.271|0.053|5.0|
|56.004|0.051|6.0|
|60.246|0.056|7.0|
|55.218|0.049|8.0|
|55.261|0.049|9.0|
|54.730|0.049|10.0|
|58.137|0.052|11.0|
|53.927|0.048|12.0|
|56.143|0.051|13.0|
|54.604|0.049|14.0|
|53.596|0.048|15.0|
|54.241|0.049|16.0|
|55.500|0.050|17.0|
|53.256|0.047|18.0|
|53.139|0.047|19.0|