Locutusque
/

TinyMistral-248M-v2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Locutusque commited on Jan 4

Commit

de5fb0a

•

1 Parent(s): 879e2ee

Update README.md

Files changed (1) hide show

README.md +0 -12

README.md CHANGED Viewed

@@ -4,18 +4,6 @@ language:
 - en
 pipeline_tag: text-generation
 ---
-Work in progress...
-Like version 1, this model will be trained on a single GPU, with hopes of getting better peformance.
-# Roadmap
-- Train on 1,000,000 examples of Skylion007/openwebtext at a learning rate of 3e-4 and batch size of 32
-- Once perplexity reaches an average of ~100, a cosine scheduler will be applied, and batch size will be increased to 4096
-- Once the perplexity reaches an average of 50, the model will be trained on graelo/wikipedia and mattymchen/refinedweb-3m, and the batch size will be increased to 12,288.
-- I'm open to any suggestions to modify this roadmap if you feel it isn't sufficient!
-# Disclaimer
-This model may be cancelled if performance improvement is not seen over its predecessor. The roadmap may also be changed during training.
 # Release date
 This model is set to be released by January 7, 2024. This date may be extended.
 Watch the training live here:

 - en
 pipeline_tag: text-generation
 ---
 # Release date
 This model is set to be released by January 7, 2024. This date may be extended.
 Watch the training live here: