abacusai
/

Fewshot-Metamath-OrcaVicuna-Mistral

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

siddartha-abacus commited on Jan 11

Commit

4648012

•

1 Parent(s): 1dbbf47

Update README.md

Files changed (1) hide show

README.md +9 -1

README.md CHANGED Viewed

@@ -13,4 +13,12 @@ Instruction tuned with the following parameters:
 - LORA, Rank 8, Alpha 16, Dropout 0.05, all modules (QKV and MLP)
 - 3 epochs
 - Micro Batch Size 32 over 4xH100, gradient accumulation steps = 1
-- AdamW with learning rate 5e-5

 - LORA, Rank 8, Alpha 16, Dropout 0.05, all modules (QKV and MLP)
 - 3 epochs
 - Micro Batch Size 32 over 4xH100, gradient accumulation steps = 1
+- AdamW with learning rate 5e-5
+### Evaluation Results
+| Average | ARC | HellaSwag | MMLU | TruthfulQA | Winogrande | GSM8K |
+| --- | --- | --- | --- | --- | --- | --- |
+| 67.33    | 59.64 | 81.82 | 61.69 | 53.23 | 78.45 | 69.14 |
+For comparison the GSM8K score for the original `metamath/MetaMath-Mistral-7B` was 68.84 and average score was 65.78.