kennethge123's picture
Upload README.md with huggingface_hub
e74c857 verified
|
raw
history blame
498 Bytes
---
language: en
license: mit
library_name: pytorch
---
# Plainly Optimized Network
Dataset: BIGBENCH
Trainer Hyperparameters:
- `lr` = 5e-05
- `per_device_batch_size` = 8
- `gradient_accumulation_steps` = 2
- `weight_decay` = 0.0
- `seed` = 42
|eval_loss|eval_accuracy|epoch|
|--|--|--|
|10.410|0.571|1.0|
|10.191|0.571|2.0|
|9.468|0.643|3.0|
|10.414|0.571|4.0|
|10.468|0.571|5.0|
|10.335|0.571|6.0|
|10.296|0.571|7.0|
|9.998|0.571|8.0|
|10.080|0.571|9.0|
|10.186|0.571|10.0|
|9.862|0.571|11.0|