|
bootstrapping for stddev: perplexity |
|
hf (pretrained=lomahony/eleuther-pythia2.8b-hh-sft), limit: None, num_fewshot: 0, batch_size: 16 |
|
| Task |Version|Filter| Metric | Value | |Stderr| |
|
|--------------|-------|------|---------------|------:|---|-----:| |
|
|arc_challenge |Yaml |none |acc | 0.3003|± |0.0134| |
|
| | |none |acc_norm | 0.3268|± |0.0137| |
|
|arc_easy |Yaml |none |acc | 0.6486|± |0.0098| |
|
| | |none |acc_norm | 0.5657|± |0.0102| |
|
|boolq |Yaml |none |acc | 0.6468|± |0.0084| |
|
|hellaswag |Yaml |none |acc | 0.4516|± |0.0050| |
|
| | |none |acc_norm | 0.5870|± |0.0049| |
|
|lambada_openai|Yaml |none |perplexity | 4.9120|± |0.1230| |
|
| | |none |acc | 0.6344|± |0.0067| |
|
|openbookqa |Yaml |none |acc | 0.2540|± |0.0195| |
|
| | |none |acc_norm | 0.3700|± |0.0216| |
|
|piqa |Yaml |none |acc | 0.7448|± |0.0102| |
|
| | |none |acc_norm | 0.7405|± |0.0102| |
|
|sciq |Yaml |none |acc | 0.8720|± |0.0106| |
|
| | |none |acc_norm | 0.8010|± |0.0126| |
|
|wikitext |Yaml |none |word_perplexity|20.7220| | | |
|
| | |none |byte_perplexity| 1.6482| | | |
|
| | |none |bits_per_byte | 0.7209| | | |
|
|winogrande |Yaml |none |acc | 0.5888|± |0.0138| |
|
|
|
|