kennethge123's picture
Upload README.md with huggingface_hub
e74c857 verified
|
raw
history blame
498 Bytes
metadata
language: en
license: mit
library_name: pytorch

Plainly Optimized Network

Dataset: BIGBENCH

Trainer Hyperparameters:

  • lr = 5e-05
  • per_device_batch_size = 8
  • gradient_accumulation_steps = 2
  • weight_decay = 0.0
  • seed = 42
eval_loss eval_accuracy epoch
10.410 0.571 1.0
10.191 0.571 2.0
9.468 0.643 3.0
10.414 0.571 4.0
10.468 0.571 5.0
10.335 0.571 6.0
10.296 0.571 7.0
9.998 0.571 8.0
10.080 0.571 9.0
10.186 0.571 10.0
9.862 0.571 11.0