Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
Llama-3_1-Nemotron-51B-Instruct
like
194
Follow
NVIDIA
4,281
Text Generation
Transformers
Safetensors
PyTorch
English
nemotron-nas
nvidia
llama-3
conversational
custom_code
arxiv:
4 papers
License:
nvidia-open-model-license
Model card
Files
Files and versions
Community
20
Train
Use this model
Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model
#19
by
tomer-nv
- opened
Oct 13
base:
refs/heads/main
←
from:
refs/pr/19
Discussion
Files changed
+45
-1
tomer-nv
NVIDIA org
Oct 13
No description provided.
Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model
775f6527
itlevy
changed pull request status to
merged
Oct 13
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment