Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Nscale
Fireworks
Cohere
SambaNova
Novita
Featherless AI
Hyperbolic
Nebius AI Studio
Cerebras
Replicate
Together AI
fal
HF Inference API
Misc
Reset Misc
llama-cpp
Inference Endpoints
Merge
text-generation-inference
Eval Results
4-bit precision
Mixture of Experts
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Apply filters
Models
20,538
Full-text search
Edit filters
Sort: Trending
Active filters:
llama-cpp
Clear all
utterlygreat/omost-dolphin-2.9-llama3-8b-Q8_0-GGUF
Updated
Jul 10, 2024
•
3
kscommhit/Llama3-ChatQA-1.5-8B-Q8_0-GGUF
Text Generation
•
Updated
Jul 10, 2024
•
3
utterlygreat/omost-dolphin-2.9-llama3-8b-Q6_K-GGUF
Updated
Jul 10, 2024
•
2
NikolayKozloff/NuminaMath-7B-TIR-Q8_0-GGUF
Text Generation
•
Updated
Jul 10, 2024
•
1
•
1
NikolayKozloff/NuminaMath-7B-TIR-Q5_0-GGUF
Text Generation
•
Updated
Jul 10, 2024
•
1
•
1
NikolayKozloff/NuminaMath-7B-TIR-Q4_0-GGUF
Text Generation
•
Updated
Jul 10, 2024
•
2
•
1
NikolayKozloff/NuminaMath-7B-TIR-IQ4_NL-GGUF
Text Generation
•
Updated
Jul 10, 2024
•
1
•
1
genevera/mistral-orthogonalized-Q5_K_S-GGUF
Updated
Jul 11, 2024
genevera/mistral-orthogonalized-Q8_0-GGUF
Updated
Jul 11, 2024
•
2
nvhf/chatgpt_paraphraser_on_T5_base-Q6_K-GGUF
Text2Text Generation
•
Updated
Jul 11, 2024
•
22
arrio/Qwen2-1.5B-Instruct-Q4_K_S-GGUF
Text Generation
•
Updated
Jul 11, 2024
•
2
arrio/Gemma-2-9B-Chinese-Chat-Q2_K-GGUF
Text Generation
•
Updated
Jul 11, 2024
•
3
yichen0104/ReluLLaMA-7B-Q4_K_M-GGUF
Updated
Jul 11, 2024
•
4
Fizzarolli/writer-8b-Q4_K_S-GGUF
Updated
Jul 11, 2024
•
35
mchl914/Llama-3-Taiwan-8B-Instruct-Q8_0-GGUF
Text Generation
•
Updated
Jul 11, 2024
•
3
mchl914/Llama3-TAIDE-LX-8B-Chat-Alpha1-Q8_0-GGUF
Updated
Jul 11, 2024
•
6
qizc/Phi-3-mini-4k-instruct-Q2_K-GGUF
Text Generation
•
Updated
Jul 11, 2024
•
21
MisterSP/AlphaMist7B-slr-v4-slow2-Q4_K_M-GGUF
Updated
Jul 11, 2024
•
1
•
1
martintomov/Qwen2-7B-Instruct-Q4_K_M-GGUF
Text Generation
•
Updated
Jul 11, 2024
•
7
jackchoucn/Gemma-2-9B-Chinese-Chat-Q8_0-GGUF
Text Generation
•
Updated
Jul 11, 2024
•
3
amirm/Meta-Llama-3-8B-Instruct-Q2_K-GGUF
Text Generation
•
Updated
Jul 11, 2024
•
8
amirm/Meta-Llama-3-8B-Q2_K-GGUF
Text Generation
•
Updated
Jul 11, 2024
•
3
Stark2008/GutenLaserPi-Q6_K-GGUF
Updated
Jul 11, 2024
•
1
Stark2008/Qwen1.5-14B-Chat-Q3_K_S-GGUF
Text Generation
•
Updated
Jul 11, 2024
•
4
vahhab70/CodeQwen1.5-7B-Chat-Q4_K_M-GGUF
Text Generation
•
Updated
Jul 11, 2024
•
3
HeRksTAn/Meta-Llama-3-8B-Q4_K_M-GGUF
Text Generation
•
Updated
Jul 11, 2024
•
3
sdkramer10/Meta-Llama-3-8B-Instruct-Q4_K_M-GGUF
Text Generation
•
Updated
Jul 12, 2024
•
27
Kolapsicle/llama-3-nvidia-ChatQA-1.5-8B-Q5_K_M-GGUF
Text Generation
•
Updated
Jul 11, 2024
•
3
netcat420/MFANNv0.17-Q4_K_M-GGUF
Updated
Jul 11, 2024
•
1
MugenYume/TinyHermes-phi-3-mini-4k-instruct-Q4_K_M-GGUF
Text Generation
•
Updated
Jul 11, 2024
•
4
Previous
1
...
88
89
90
91
92
...
100
Next