abacusai
/

Fewshot-Metamath-OrcaVicuna-Mistral

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ArkaAbacus commited on Jan 17

Commit

f557d97

•

1 Parent(s): 10f189d

Update README.md

Files changed (1) hide show

README.md +21 -2

README.md CHANGED Viewed

@@ -17,10 +17,29 @@ Instruction tuned with the following parameters:
 - Micro Batch Size 32 over 4xH100, gradient accumulation steps = 1
 - AdamW with learning rate 5e-5
-### Evaluation Results
 | Average | ARC | HellaSwag | MMLU | TruthfulQA | Winogrande | GSM8K |
 | --- | --- | --- | --- | --- | --- | --- |
 | 67.33    | 59.64 | 81.82 | 61.69 | 53.23 | 78.45 | 69.14 |
-For comparison the GSM8K score for the original `metamath/MetaMath-Mistral-7B` was 68.84 and average score was 65.78.

 - Micro Batch Size 32 over 4xH100, gradient accumulation steps = 1
 - AdamW with learning rate 5e-5
+## Evaluation Results
+### HuggingFace Leaderboard
 | Average | ARC | HellaSwag | MMLU | TruthfulQA | Winogrande | GSM8K |
 | --- | --- | --- | --- | --- | --- | --- |
 | 67.33    | 59.64 | 81.82 | 61.69 | 53.23 | 78.45 | 69.14 |
+For comparison the GSM8K score for the original `metamath/MetaMath-Mistral-7B` was 68.84 and average score was 65.78.
+### MT-Bench
+########## First turn ##########
+                                          score
+model                               turn
+fewshot_metamath_orcavicuna_mistral 1       6.9
+########## Second turn ##########
+                                            score
+model                               turn
+fewshot_metamath_orcavicuna_mistral 2     6.51875
+########## Average ##########
+                                        score
+model
+fewshot_metamath_orcavicuna_mistral  6.709375