Update README.md
Browse files
README.md
CHANGED
@@ -3,4 +3,8 @@ datasets:
|
|
3 |
- open-web-math/open-web-math
|
4 |
---
|
5 |
|
6 |
-
Mistral-7b with continued pretraining using Quiet-STaR (https://arxiv.org/abs/2403.09629) for generating 8 thought tokens before each output token.
|
|
|
|
|
|
|
|
|
|
3 |
- open-web-math/open-web-math
|
4 |
---
|
5 |
|
6 |
+
Mistral-7b with continued pretraining using Quiet-STaR (https://arxiv.org/abs/2403.09629) for generating 8 thought tokens before each output token.
|
7 |
+
|
8 |
+
|
9 |
+
|
10 |
+
Forked from Crystalcareai/Quiet-Star-Custom
|