eleuther-pythia2.8b-hh-sft / sft-2.8b-eval-files /sft-pythia-2.8b-0shot-shelloutput.txt
lomahony's picture
Upload 10 files
f38f7d9
bootstrapping for stddev: perplexity
hf (pretrained=lomahony/eleuther-pythia2.8b-hh-sft), limit: None, num_fewshot: 0, batch_size: 16
| Task |Version|Filter| Metric | Value | |Stderr|
|--------------|-------|------|---------------|------:|---|-----:|
|arc_challenge |Yaml |none |acc | 0.3003|± |0.0134|
| | |none |acc_norm | 0.3268|± |0.0137|
|arc_easy |Yaml |none |acc | 0.6486|± |0.0098|
| | |none |acc_norm | 0.5657|± |0.0102|
|boolq |Yaml |none |acc | 0.6468|± |0.0084|
|hellaswag |Yaml |none |acc | 0.4516|± |0.0050|
| | |none |acc_norm | 0.5870|± |0.0049|
|lambada_openai|Yaml |none |perplexity | 4.9120|± |0.1230|
| | |none |acc | 0.6344|± |0.0067|
|openbookqa |Yaml |none |acc | 0.2540|± |0.0195|
| | |none |acc_norm | 0.3700|± |0.0216|
|piqa |Yaml |none |acc | 0.7448|± |0.0102|
| | |none |acc_norm | 0.7405|± |0.0102|
|sciq |Yaml |none |acc | 0.8720|± |0.0106|
| | |none |acc_norm | 0.8010|± |0.0126|
|wikitext |Yaml |none |word_perplexity|20.7220| | |
| | |none |byte_perplexity| 1.6482| | |
| | |none |bits_per_byte | 0.7209| | |
|winogrande |Yaml |none |acc | 0.5888|± |0.0138|