Rakuto/Llama3-ChatQA-1.5-8B-GPTQ-4bit

Hugging Face

Model Details

4bit GPTQ quantized variant of nvidia/Llama3-ChatQA-1.5-8B.

License

The use of this model is governed by the META LLAMA 3 COMMUNITY LICENSE AGREEMENT

Downloads last month: 77

Safetensors

Model size

1.99B params

Tensor type

FP16

I32

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for Rakuto/Llama3-ChatQA-1.5-8B-GPTQ-4bit

Base model

nvidia/Llama3-ChatQA-1.5-8B

Quantized

(21)

this model