Model Details

4bit GPTQ quantized variant of nvidia/Llama3-ChatQA-1.5-8B.

License

The use of this model is governed by the META LLAMA 3 COMMUNITY LICENSE AGREEMENT

Downloads last month
77
Safetensors
Model size
1.99B params
Tensor type
FP16
·
I32
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for Rakuto/Llama3-ChatQA-1.5-8B-GPTQ-4bit

Quantized
(21)
this model