DiscoResearch
/

DiscoLM-120b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

bjoernp commited on Dec 3, 2023

Commit

c9888c4

•

1 Parent(s): a5365d0

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -81,6 +81,9 @@ We use [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-eval
 | MMLU   | 67.9 |
 | **Avg.**                  | **53.3** |
 ### MTBench
 ```json
@@ -100,6 +103,8 @@ We use [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-eval
     "average": 7.95
 }
 ```
 ## Prompt Format

 | MMLU   | 67.9 |
 | **Avg.**                  | **53.3** |
+This places DiscoLM 120b firmly ahead of gpt-3.5-turbo-0613 as seen on the screenshot of the current (sadly no longer maintained) FastEval CoT leaderboard:
+![FastEval Leaderboard](imgs/cot_leaderboard.png)
 ### MTBench
 ```json
     "average": 7.95
 }
 ```
+Screenshot of the current FastEval MT Bench leaderboard:
+![FastEval Leaderboard](imgs/mtbench_leaderboard.png)
 ## Prompt Format