Locutusque
/

TinyMistral-248M-v2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Locutusque commited on Jan 7

Commit

07a01ca

•

1 Parent(s): 937ed7a

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -37,4 +37,6 @@ This model was trained on two datasets, shown in this model page.
 Training took approximately 500 GPU hours on a single Titan V.
 # Metrics
 You can look at the training metrics here:
-https://wandb.ai/locutusque/TinyMistral-V2/runs/g0rvw6wc

 Training took approximately 500 GPU hours on a single Titan V.
 # Metrics
 You can look at the training metrics here:
+https://wandb.ai/locutusque/TinyMistral-V2/runs/g0rvw6wc
+* This model performed excellently on TruthfulQA, outperforming models more than 720x its size. These models include: mistralai/Mixtral-8x7B-v0.1, tiiuae/falcon-180B, berkeley-nest/Starling-LM-7B-alpha, upstage/SOLAR-10.7B-v1.0, and more.