pszemraj
/

GPT-Neo-33M-simplewiki-2048-scratch

Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on Sep 15, 2023

Commit

4724ac1

•

1 Parent(s): 08af85c

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -32,14 +32,18 @@ widget:
 pipeline_tag: text-generation
 datasets:
 - pszemraj/simple_wikipedia_LM
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# BL-TinyStories-33M-simple_wikipedia_LM-2048-scratch
-This model is a fine-tuned version of [roneneldan/TinyStories-33M](https://huggingface.co/roneneldan/TinyStories-33M) on the None dataset.
 It achieves the following results on the evaluation set:
 - Loss: 3.9511
 - Accuracy: 0.3843

 pipeline_tag: text-generation
 datasets:
 - pszemraj/simple_wikipedia_LM
+license: apache-2.0
+language:
+- en
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# GPT-Neo-33M-simplewiki-2048-scratch
+Initialized from random weights based on config from [roneneldan/TinyStories-33M](https://huggingface.co/roneneldan/TinyStories-33M), 3 epochs bf16.
 It achieves the following results on the evaluation set:
 - Loss: 3.9511
 - Accuracy: 0.3843