yongzx usvsnsp commited on
Commit
7908b97
1 Parent(s): f32b9a8

Add Model Evals (#1)

Browse files

- Add Model Evals (0eeb6f4c637ccdc4a08ea9843c22011911147950)
- Add Lambada OpenAI to evals (48fe0daccff2135f4c0c7b12195ad93032b1b774)


Co-authored-by: USVSN Sai Prashanth <[email protected]>

Files changed (1) hide show
  1. README.md +18 -1
README.md CHANGED
@@ -1 +1,18 @@
1
- Wandb Runs: https://wandb.ai/eleutherai/pythia-rlhf/runs/644tyaq0?workspace=user-yongzx
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Wandb Runs: https://wandb.ai/eleutherai/pythia-rlhf/runs/644tyaq0?workspace=user-yongzx
2
+
3
+ Model Evals:
4
+ | Task |Version|Filter| Metric |Value | |Stderr|
5
+ |-------------|-------|------|--------|-----:|---|-----:|
6
+ |arc_challenge|Yaml |none |acc |0.2287|± |0.0123|
7
+ | | |none |acc_norm|0.2619|± |0.0128|
8
+ |arc_easy |Yaml |none |acc |0.5248|± |0.0102|
9
+ | | |none |acc_norm|0.4533|± |0.0102|
10
+ |logiqa |Yaml |none |acc |0.2089|± |0.0159|
11
+ | | |none |acc_norm|0.2765|± |0.0175|
12
+ |piqa |Yaml |none |acc |0.6855|± |0.0108|
13
+ | | |none |acc_norm|0.6823|± |0.0109|
14
+ |sciq |Yaml |none |acc |0.8050|± |0.0125|
15
+ | | |none |acc_norm|0.7080|± |0.0144|
16
+ |winogrande |Yaml |none |acc |0.5335|± |0.0140|
17
+ |lambada_openai|Yaml |none |perplexity|9.8265|± |0.3139|
18
+ | | |none |acc |0.5135|± |0.0070|