bootstrapping for stddev: perplexity hf (pretrained=lomahony/eleuther-pythia2.8b-hh-sft), limit: None, num_fewshot: 0, batch_size: 16 | Task |Version|Filter| Metric | Value | |Stderr| |--------------|-------|------|---------------|------:|---|-----:| |arc_challenge |Yaml |none |acc | 0.3003|± |0.0134| | | |none |acc_norm | 0.3268|± |0.0137| |arc_easy |Yaml |none |acc | 0.6486|± |0.0098| | | |none |acc_norm | 0.5657|± |0.0102| |boolq |Yaml |none |acc | 0.6468|± |0.0084| |hellaswag |Yaml |none |acc | 0.4516|± |0.0050| | | |none |acc_norm | 0.5870|± |0.0049| |lambada_openai|Yaml |none |perplexity | 4.9120|± |0.1230| | | |none |acc | 0.6344|± |0.0067| |openbookqa |Yaml |none |acc | 0.2540|± |0.0195| | | |none |acc_norm | 0.3700|± |0.0216| |piqa |Yaml |none |acc | 0.7448|± |0.0102| | | |none |acc_norm | 0.7405|± |0.0102| |sciq |Yaml |none |acc | 0.8720|± |0.0106| | | |none |acc_norm | 0.8010|± |0.0126| |wikitext |Yaml |none |word_perplexity|20.7220| | | | | |none |byte_perplexity| 1.6482| | | | | |none |bits_per_byte | 0.7209| | | |winogrande |Yaml |none |acc | 0.5888|± |0.0138|