Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
Llama-3_1-Nemotron-51B-Instruct
like
205
Follow
NVIDIA
18.8k
Text Generation
Transformers
Safetensors
PyTorch
English
nemotron-nas
nvidia
llama-3
conversational
custom_code
arxiv:
4 papers
License:
nvidia-open-model-license
Model card
Files
Files and versions
Community
24
Train
Deploy
Use this model
Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model
#19
by
tomer-nv
- opened
Oct 13, 2024
base:
refs/heads/main
←
from:
refs/pr/19
Discussion
Files changed
+45
-1
tomer-nv
NVIDIA org
Oct 13, 2024
No description provided.
Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model
775f6527
itlevy
changed pull request status to
merged
Oct 13, 2024
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Your need to confirm your account before you can post a new comment.
Comment
·
Sign up
or
log in
to comment