Llama-3_1-Nemotron-51B-Instruct / modeling_decilm.py

Commit History

Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model
775f652
verified

tomer-nv commited on

DeciLMForCausalLM(DeciLMPreTrainedModel, GenerationMixin) for v4.50 (#16)
3209eec
verified

itlevy commited on

transformers>=4.44.2, backward compat
b5dfaf4
verified

itlevy commited on