facebook
/

MobileLLM-600M

Text Generation

Model card Files Files and versions Community

zechunliu commited on 2 days ago

Commit

efb73db

•

1 Parent(s): bbbe9ea

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -40,7 +40,7 @@ We are providing 2 ways to run the model:
 To load the pretrained model for further finetuning or evaluation:
 ```bash
 from transformers import AutoModelForCausalLM, AutoTokenizer
-tokenizer = AutoTokenizer.from_pretrained("facebook/MobileLLM-600M")
 model = AutoModelForCausalLM.from_pretrained("facebook/MobileLLM-600M", trust_remote_code=True)
 ```
 Note that the default tokenizer does not contain special tokens. For example you can use:
@@ -64,7 +64,7 @@ We provide the pretraining code in https://github.com/facebookresearch/MobileLLM
 # run pretraining
 > bash pretrain.sh
 ```
-We also provide evaluation script for calculating wiki2 testset ppl
 ```bash
 > bash eval.sh
 ```

 To load the pretrained model for further finetuning or evaluation:
 ```bash
 from transformers import AutoModelForCausalLM, AutoTokenizer
+tokenizer = AutoTokenizer.from_pretrained("facebook/MobileLLM-600M", use_fast_tokenizer=False)
 model = AutoModelForCausalLM.from_pretrained("facebook/MobileLLM-600M", trust_remote_code=True)
 ```
 Note that the default tokenizer does not contain special tokens. For example you can use:
 # run pretraining
 > bash pretrain.sh
 ```
+We also provide evaluation script for calculating ppl of wikitext-2 test split:
 ```bash
 > bash eval.sh
 ```