eval
Browse files
README.md
CHANGED
@@ -157,7 +157,7 @@ litgpt evaluate --tasks 'hellaswag,gsm8k,truthfulqa_mc2,mmlu,winogrande,arc_chal
|
|
157 |
litgpt evaluate --tasks 'leaderboard' --out_dir 'evaluate-leaderboard/' --batch_size 4 --dtype 'bfloat16' out/pretrain/final/
|
158 |
```
|
159 |
|
160 |
-
Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
|
161 |
|-----------------------------------------------------------|-------|------|-----:|-----------------------|---|-----:|---|------|
|
162 |
|leaderboard | N/A| | | | | | | |
|
163 |
| - leaderboard_bbh | N/A| | | | | | | |
|
|
|
157 |
litgpt evaluate --tasks 'leaderboard' --out_dir 'evaluate-leaderboard/' --batch_size 4 --dtype 'bfloat16' out/pretrain/final/
|
158 |
```
|
159 |
|
160 |
+
| Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
|
161 |
|-----------------------------------------------------------|-------|------|-----:|-----------------------|---|-----:|---|------|
|
162 |
|leaderboard | N/A| | | | | | | |
|
163 |
| - leaderboard_bbh | N/A| | | | | | | |
|