Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

23,488

Full-text search

Active filters: llama-cpp

herrkobold/Llama3-DiscoLeo-Instruct-8B-32k-v0.1-Q5_K_M-GGUF

8B • Updated Jul 13, 2024

julioc-p/Gemma2BFullPrecision-Q4_K_M-GGUF

3B • Updated Jul 13, 2024 • 1

Outbox/Hebrew-Mistral-7B-200K-Q4_K_M-GGUF

8B • Updated Jul 13, 2024 • 1

NikolayKozloff/Tiger-Gemma-9B-v1-Q8_0-GGUF

9B • Updated Jul 13, 2024 • 2 • 1

NikolayKozloff/Tiger-Gemma-9B-v1-Q5_0-GGUF

9B • Updated Jul 13, 2024 • 3 • 1

NikolayKozloff/Tiger-Gemma-9B-v1-Q4_0-GGUF

9B • Updated Jul 13, 2024 • 13 • 1

NikolayKozloff/Tiger-Gemma-9B-v1-IQ4_NL-GGUF

9B • Updated Jul 13, 2024 • 4 • 1

balrogbob/open_llama_3b_v2-python-instruct-0.1.2-IQ3_XXS-GGUF

3B • Updated Jul 13, 2024 • 4

balrogbob/open_llama_3b_v2-python-instruct-0.1.2-Q4_K_M-GGUF

3B • Updated Jul 13, 2024 • 1

ironlanderl/TinyLlama_v1.1-Q8_0-GGUF

1B • Updated Jul 13, 2024 • 1

fernandoruiz/madlad400-3b-mt-Q4_K_M-GGUF

3B • Updated Jul 13, 2024 • 1

fernandoruiz/madlad400-3b-mt-Q4_K_S-GGUF

Translation • 3B • Updated Jul 13, 2024 • 47

balrogbob/open_llama_3b_v2-python-instruct-0.1.3-IQ3_XXS-GGUF

3B • Updated Jul 14, 2024 • 5

Kondara/cendol-llama2-7b-chat-Q4_K_M-GGUF

7B • Updated Jul 14, 2024

OlivierP/Hathor_Stable-v0.2-L3-8B-Q4_K_M-GGUF

8B • Updated Jul 14, 2024

tklohj/merged_8b_llama-Q4_K_M-GGUF

8B • Updated Jul 14, 2024

tritiumoxide/madlad400-7b-mt-bt-Q2_K-GGUF

Translation • 8B • Updated Jul 14, 2024 • 7

Kondara/cendol-llama2-13b-merged-inst-Q4_K_M-GGUF

13B • Updated Jul 14, 2024

fernandoruiz/gemma-2-27b-it-Q4_K_M-GGUF

Text Generation • 27B • Updated Jul 14, 2024 • 2

fernandoruiz/gemma-2-27b-it-Q4_K_S-GGUF

Text Generation • 27B • Updated Jul 14, 2024 • 2

v000000/TripletBoreas-7B-t0.0001-Q5_K_S-GGUF

7B • Updated Jul 14, 2024

Tech-Meld/NuminaMath-7B-TIR-Q4_K_M-GGUF

Text Generation • 7B • Updated Jul 14, 2024 • 4

pythonplayer123/gemma-2-27b-it-Q4_K_M-GGUF

Text Generation • 27B • Updated Jul 14, 2024 • 10 • 1

fernandoruiz/Phi-3-mini-4k-instruct-Q4_K_M-GGUF

Text Generation • 4B • Updated Jul 14, 2024 • 1

v000000/DupletBoreas-7B-t0.0001-Q5_K_S-GGUF

7B • Updated Jul 14, 2024

fernandoruiz/Phi-3-mini-4k-instruct-Q4_K_S-GGUF

Text Generation • 4B • Updated Jul 14, 2024

v000000/DupletBoreas-7B-t0.0001-Q8_0-GGUF

7B • Updated Jul 14, 2024

fernandoruiz/Phi-3-medium-4k-instruct-Q4_K_M-GGUF

Text Generation • 14B • Updated Jul 14, 2024

fernandoruiz/Phi-3-medium-4k-instruct-Q4_K_S-GGUF

Text Generation • 14B • Updated Jul 14, 2024 • 3

v000000/TripletBoreas-7B-t0.0001-Q8_0-GGUF

7B • Updated Jul 14, 2024