crumb
/

apricot-wildflower-20

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

crumb commited on Dec 26, 2023

Commit

58067e9

•

1 Parent(s): 27610b5

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -6,6 +6,8 @@ license: apache-2.0
 This model is the Mistral-7b model finetuned for 1k steps with a combined lm loss and distillation loss on Openwebtext2 with a >=20 reddit score filter with training logits from Mixtral. I'm not going to pretend it was a big project I did it in a dream and woke up and replicated the code without any actual reason, idk how well it fares in benchmarks.
 ### use
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer

 This model is the Mistral-7b model finetuned for 1k steps with a combined lm loss and distillation loss on Openwebtext2 with a >=20 reddit score filter with training logits from Mixtral. I'm not going to pretend it was a big project I did it in a dream and woke up and replicated the code without any actual reason, idk how well it fares in benchmarks.
+(update: not very good)
 ### use
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer