-
-
-
-
-
-
Inference Providers
Active filters:
vllm
neuralmagic/DeepSeek-Coder-V2-Lite-Instruct-FP8
Text Generation
•
Updated
•
6.1k
•
6
neuralmagic/DeepSeek-Coder-V2-Lite-Base-FP8
Text Generation
•
Updated
•
41
mgoin/Mistral-Nemo-Instruct-2407-FP8-Dynamic
Text Generation
•
Updated
•
116
mgoin/Mistral-Nemo-Instruct-2407-FP8-KV
Text Generation
•
Updated
•
3
neuralmagic/Mistral-Nemo-Instruct-2407-FP8
Text Generation
•
Updated
•
33.6k
•
17
FlorianJc/Mistral-Nemo-Instruct-2407-vllm-fp8
Text Generation
•
Updated
•
1.24k
•
8
neuralmagic/DeepSeek-Coder-V2-Base-FP8
Text Generation
•
Updated
•
83
neuralmagic/DeepSeek-Coder-V2-Instruct-FP8
Text Generation
•
Updated
•
246
•
7
mgoin/Minitron-4B-Base-FP8
Text Generation
•
Updated
•
1.35k
•
3
mgoin/Minitron-8B-Base-FP8
Text Generation
•
Updated
•
11
•
3
mgoin/nemotron-3-8b-chat-4k-sft-hf
Text Generation
•
Updated
•
15
neuralmagic/Meta-Llama-3.1-8B-Instruct-FP8-dynamic
Text Generation
•
Updated
•
3.48k
•
5
neuralmagic/Meta-Llama-3.1-405B-Instruct-FP8
Text Generation
•
Updated
•
3.87k
•
31
neuralmagic/Meta-Llama-3.1-405B-Instruct-FP8-dynamic
Text Generation
•
Updated
•
239
•
14
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w8a16
Text Generation
•
Updated
•
3.98k
•
9
mgoin/Nemotron-4-340B-Base-hf
Text Generation
•
Updated
•
8
•
1
mgoin/Nemotron-4-340B-Base-hf-FP8
Text Generation
•
Updated
•
69
•
2
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a16
Text Generation
•
Updated
•
450
•
4
mgoin/Nemotron-4-340B-Instruct-hf
Text Generation
•
Updated
•
60
•
4
mgoin/Nemotron-4-340B-Instruct-hf-FP8
Text Generation
•
Updated
•
721
•
3
FlorianJc/ghost-8b-beta-vllm-fp8
Text Generation
•
Updated
•
6
FlorianJc/Meta-Llama-3.1-8B-Instruct-vllm-fp8
Text Generation
•
Updated
•
224
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
7.66k
•
19
neuralmagic/Meta-Llama-3.1-70B-FP8
Text Generation
•
Updated
•
363
•
1
neuralmagic/Meta-Llama-3.1-8B-quantized.w8a16
Text Generation
•
Updated
•
30.8k
•
1
neuralmagic/Meta-Llama-3.1-8B-quantized.w8a8
Text Generation
•
Updated
•
106
•
2
neuralmagic/starcoder2-15b-FP8
Text Generation
•
Updated
•
180
neuralmagic/starcoder2-7b-FP8
Text Generation
•
Updated
•
87
neuralmagic/starcoder2-3b-FP8
Text Generation
•
Updated
•
115
neuralmagic/Meta-Llama-3.1-405B-FP8
Text Generation
•
Updated
•
150