Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Fireworks
Together AI
SambaNova
Replicate
fal
HF Inference API
Misc
Reset Misc
vllm
Inference Endpoints
text-generation-inference
AutoTrain Compatible
8-bit precision
4-bit precision
Merge
custom_code
Eval Results
Misc with no match
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
420
Full-text search
Edit filters
Sort: Trending
Active filters:
vllm
Clear all
neuralmagic/granite-3.1-2b-base-quantized.w4a16
Text Generation
•
Updated
16 days ago
•
71
neuralmagic/granite-3.1-2b-base-FP8-dynamic
Text Generation
•
Updated
16 days ago
•
49
neuralmagic/granite-3.1-8b-base-quantized.w8a8
Text Generation
•
Updated
16 days ago
•
25
neuralmagic/granite-3.1-8b-base-quantized.w4a16
Text Generation
•
Updated
16 days ago
•
55
neuralmagic/granite-3.1-8b-base-FP8-dynamic
Updated
16 days ago
NeoChen1024/Ministral-8B-Instruct-2410-W8A8
Updated
30 days ago
•
7
neuralmagic/Llama-3.3-70B-Instruct-quantized.w8a8
Text Generation
•
Updated
24 days ago
•
1.93k
•
4
matrixportal/L3-Aspire-Heart-Matrix-8B-GGUF
Text Generation
•
Updated
24 days ago
•
73
tensorblock/Ministral-8B-Instruct-2410-GGUF
Updated
22 days ago
•
374
ReadyArt/Mistral-Small-Instruct-2409_EXL2_6.0bpw_H8
Updated
20 days ago
•
7
ReadyArt/Mistral-Small-Instruct-2409_EXL2_5.0bpw_H8
Updated
20 days ago
•
12
ReadyArt/Mistral-Small-Instruct-2409_EXL2_4.65bpw_H8
Updated
20 days ago
•
10
ReadyArt/Mistral-Small-Instruct-2409_EXL2_4.0bpw_H8
Updated
20 days ago
•
9
ReadyArt/Mistral-Small-Instruct-2409_EXL2_3.0bpw_H8
Updated
20 days ago
•
5
nikitagreb/test-upload
Text Classification
•
Updated
2 days ago
•
19
FlorianJc/DeepSeek-R1-Distill-Llama-8B-vllm-fp8
Text Generation
•
Updated
17 days ago
•
227
mlx-community/Mistral-Small-24B-Instruct-2501-6bit
Updated
16 days ago
•
168
sm54/Mistral-Small-24B-Instruct-2501-Q4_K_M-GGUF
Updated
16 days ago
•
147
sm54/Mistral-Small-24B-Instruct-2501-Q5_K_M-GGUF
Updated
16 days ago
•
120
phate334/Mistral-Small-24B-Instruct-2501-Q4_K_M-GGUF
Updated
16 days ago
•
38
sm54/Mistral-Small-24B-Instruct-2501-Q6_K-GGUF
Updated
16 days ago
•
171
tensorblock/Mistral-Small-24B-Instruct-2501-GGUF
Updated
16 days ago
•
403
Triangle104/Mistral-Small-24B-Instruct-2501-Q4_K_S-GGUF
Updated
16 days ago
•
57
Triangle104/Mistral-Small-24B-Instruct-2501-Q4_K_M-GGUF
Updated
16 days ago
•
40
MikeRoz/mistralai_Mistral-Small-24B-Instruct-2501-6.0bpw-h6-exl2
Updated
16 days ago
•
182
Triangle104/Mistral-Small-24B-Instruct-2501-Q5_K_M-GGUF
Updated
16 days ago
•
31
spmurrayzzz/Mistral-Small-24B-Instruct-2501-Q4_K_M-GGUF
Updated
16 days ago
•
32
Triangle104/Mistral-Small-24B-Instruct-2501-Q6_K-GGUF
Updated
16 days ago
•
254
Triangle104/Mistral-Small-24B-Instruct-2501-Q8_0-GGUF
Updated
16 days ago
•
30
MikeRoz/mistralai_Mistral-Small-24B-Instruct-2501-8.0bpw-h8-exl2
Updated
16 days ago
•
83
Previous
1
...
8
9
10
11
12
...
14
Next