pharaouk
/

Quiet-Star-Custom

Text Generation

Model card Files Files and versions Community

pharaouk commited on Apr 6

Commit

790f2f1

•

1 Parent(s): b0e4fff

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -3,4 +3,8 @@ datasets:
 - open-web-math/open-web-math
 ---
-Mistral-7b with continued pretraining using Quiet-STaR (https://arxiv.org/abs/2403.09629) for generating 8 thought tokens before each output token.

 - open-web-math/open-web-math
 ---
+Mistral-7b with continued pretraining using Quiet-STaR (https://arxiv.org/abs/2403.09629) for generating 8 thought tokens before each output token.
+Forked from Crystalcareai/Quiet-Star-Custom