Locutusque
commited on
Commit
•
07a01ca
1
Parent(s):
937ed7a
Update README.md
Browse files
README.md
CHANGED
@@ -37,4 +37,6 @@ This model was trained on two datasets, shown in this model page.
|
|
37 |
Training took approximately 500 GPU hours on a single Titan V.
|
38 |
# Metrics
|
39 |
You can look at the training metrics here:
|
40 |
-
https://wandb.ai/locutusque/TinyMistral-V2/runs/g0rvw6wc
|
|
|
|
|
|
37 |
Training took approximately 500 GPU hours on a single Titan V.
|
38 |
# Metrics
|
39 |
You can look at the training metrics here:
|
40 |
+
https://wandb.ai/locutusque/TinyMistral-V2/runs/g0rvw6wc
|
41 |
+
|
42 |
+
* This model performed excellently on TruthfulQA, outperforming models more than 720x its size. These models include: mistralai/Mixtral-8x7B-v0.1, tiiuae/falcon-180B, berkeley-nest/Starling-LM-7B-alpha, upstage/SOLAR-10.7B-v1.0, and more.
|