kennethge123's picture
Upload README.md with huggingface_hub
5d6e85c verified
|
raw
history blame
330 Bytes
---
language: en
license: mit
library_name: pytorch
---
# Plainly Optimized Network
Dataset: BIGBENCH
Trainer Hyperparameters:
- `lr` = 5e-05
- `per_device_batch_size` = 1
- `gradient_accumulation_steps` = 4
- `weight_decay` = 1e-09
- `seed` = 42
|eval_loss|eval_accuracy|epoch|
|--|--|--|
|66.323|0.063|1.0|
|59.935|0.055|2.0|