gemma-2-9b GGUF

Llama.cpp version b3259 was used for hf to gguf conversion.

Original model: https://huggingface.co/google/gemma-2-9b

Available Precisions:

  • f16
  • q8_0

License

Gemma Terms of Use applies the same as the original model.

Downloads last month
28
GGUF
Model size
9.24B params
Architecture
gemma2

8-bit

16-bit

Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.