meta-llama/Meta-Llama-3-70B-Instruct · Update generation

Apr 18

In the Model Card, I see that there is a workaround by manually updating eos_token_id in any generate call or pipeline:

terminators = [
    tokenizer.eos_token_id,
    tokenizer.convert_tokens_to_ids("<|eot_id|>")
]

outputs = pipeline(
    prompt,
    max_new_tokens=256,
    eos_token_id=terminators,
    do_sample=True,
    temperature=0.6,
    top_p=0.9,
)

But I think there is a simpler way to fix this! If you just update the generation_config.json to stop on both <|end_of_text|> as well as <|eot_id|>, then it should work automatically and you won't need to build the terminators.

abhi-db

Apr 18

see related PR here for llama-3-8b-instruct: https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct/discussions/4

lemonflourorange

Apr 19

•

edited Apr 19

No description provided.

pcuenq

Meta Llama org Apr 19

Thank you @abhi-db !

pcuenq changed pull request status to merged Apr 19

meta-llama
/

Meta-Llama-3-70B-Instruct

Update generation_config.json