abacusai
/

Fewshot-Metamath-OrcaVicuna-Mistral

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ArkaAbacus commited on Jan 11

Commit

1dbbf47

•

1 Parent(s): 42ac13a

Update README.md

Files changed (1) hide show

README.md +10 -4

README.md CHANGED Viewed

@@ -1,10 +1,16 @@
 ---
 license: apache-2.0
 ---
-Trained on the Metamath Multishot dataset from base Mistral, as well as the Vicuna dataset and the OrcaChat dataset. Dataset will be updated shortly.
-Instruction tuned from base Mistral 7B for 3 epochs on the above dataset.
-Model Description
-This model is trained on a multi-shot version of the popular Metamath dataset. This aligns the model more closely to the few-shot setting of the LLM leaderboard evaluation, as well as robustifies the model for few-shot use in natural usage.

 ---
 license: apache-2.0
+datasets:
+- abacusai/MetaMathFewshot
+- shahules786/orca-chat
+- anon8231489123/ShareGPT_Vicuna_unfiltered
 ---
+Trained on the MetamathFewshot (https://huggingface.co/datasets/abacusai/MetaMathFewshot) dataset from base Mistral, as well as the Vicuna (https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered) dataset and the OrcaChat (https://huggingface.co/datasets/shahules786/orca-chat) dataset.
+Instruction tuned with the following parameters:
+- LORA, Rank 8, Alpha 16, Dropout 0.05, all modules (QKV and MLP)
+- 3 epochs
+- Micro Batch Size 32 over 4xH100, gradient accumulation steps = 1
+- AdamW with learning rate 5e-5