nvidia
/

Llama-3_1-Nemotron-51B-Instruct

Text Generation

Model card Files Files and versions Community

Llama-3_1-Nemotron-51B-Instruct / modeling_decilm.py

Commit History

Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model

775f652
verified

tomer-nv commited on Oct 13

DeciLMForCausalLM(DeciLMPreTrainedModel, GenerationMixin) for v4.50 (#16)

3209eec
verified

itlevy commited on Sep 30

transformers>=4.44.2, backward compat

b5dfaf4
verified

itlevy commited on Sep 24