Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
/
Meta-Llama-3.1-70B-Instruct-quantized.w4a16
like
30
Follow
Neural Magic
334
Text Generation
Transformers
Safetensors
8 languages
llama
int4
vllm
conversational
text-generation-inference
Inference Endpoints
4-bit precision
gptq
arxiv:
2210.17323
License:
llama3.1
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
8c670bc
Meta-Llama-3.1-70B-Instruct-quantized.w4a16
Commit History
Update README.md
8c670bc
verified
alexmarques
commited on
Aug 13, 2024
Update README.md
14dfb3c
verified
abhinavnmagic
commited on
Aug 8, 2024
Update README.md
dfb3652
verified
abhinavnmagic
commited on
Aug 1, 2024
Create README.md
aae93a4
verified
abhinavnmagic
commited on
Jul 31, 2024
Upload folder using huggingface_hub
b74ae47
verified
abhinavnmagic
commited on
Jul 31, 2024
initial commit
d710f68
verified
abhinavnmagic
commited on
Jul 31, 2024