File size: 1,038 Bytes
7908b97 c67fb50 7908b97 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 |
Wandb Runs: https://wandb.ai/eleutherai/pythia-rlhf/runs/644tyaq0?workspace=user-yongzx
Model Evals:
| Task |Version|Filter| Metric |Value | |Stderr|
|-------------|-------|------|--------|-----:|---|-----:|
|arc_challenge|Yaml |none |acc |0.2287|± |0.0123|
| | |none |acc_norm|0.2619|± |0.0128|
|arc_easy |Yaml |none |acc |0.5248|± |0.0102|
| | |none |acc_norm|0.4533|± |0.0102|
|logiqa |Yaml |none |acc |0.2089|± |0.0159|
| | |none |acc_norm|0.2765|± |0.0175|
|piqa |Yaml |none |acc |0.6855|± |0.0108|
| | |none |acc_norm|0.6823|± |0.0109|
|sciq |Yaml |none |acc |0.8050|± |0.0125|
| | |none |acc_norm|0.7080|± |0.0144|
|winogrande |Yaml |none |acc |0.5335|± |0.0140|
|wsc |Yaml |none |acc |0.3654|± |0.0474|
|lambada_openai|Yaml |none |perplexity|9.8265|± |0.3139|
| | |none |acc |0.5135|± |0.0070| |