Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
Llama-3_1-Nemotron-51B-Instruct
like
209
Follow
NVIDIA
45.1k
Text Generation
Transformers
Safetensors
PyTorch
English
nemotron-nas
nvidia
llama-3
conversational
custom_code
arxiv:
4 papers
License:
nvidia-open-model-license
Model card
Files
Files and versions
xet
Community
25
Deploy
Use this model
Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model
#18
by
tomer-nv
- opened
Oct 13, 2024
base:
refs/heads/main
←
from:
refs/pr/18
Discussion
Files changed
+19
-0
tomer-nv
NVIDIA org
Oct 13, 2024
No description provided.
Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model
d71a214b
tomer-nv
changed pull request status to
closed
Oct 13, 2024
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment