-
-
-
-
-
-
Inference Providers
Active filters:
dpo
TheBloke/CapybaraHermes-2.5-Mistral-7B-GGUF
7B
•
Updated
•
8.43k
•
121
TheBloke/SauerkrautLM-Mixtral-8x7B-GGUF
Text Generation
•
47B
•
Updated
•
940
•
9
argilla/CapybaraHermes-2.5-Mistral-7B
7B
•
Updated
•
17
•
70
mlabonne/NeuralDaredevil-8B-abliterated
Text Generation
•
8B
•
Updated
•
15k
•
•
220
QuantFactory/NeuralDaredevil-8B-abliterated-GGUF
Text Generation
•
8B
•
Updated
•
4.14k
•
70
HumanLLMs/Human-Like-Mistral-Nemo-Instruct-2407
Text Generation
•
12B
•
Updated
•
99
•
•
18
SmallDoge/Doge-320M-Instruct
Question Answering
•
0.3B
•
Updated
•
82
•
4
emretmrk/smolvlm-trl-dpo
AmberYifan/Qwen2.5-14B-Instruct-wildfeedback-RPO-DRIFT-iter1-4k
Text Generation
•
0.0B
•
Updated
•
15
•
1
mradermacher/Qwen2.5-14B-Instruct-wildfeedback-RPO-DRIFT-iter1-4k-GGUF
15B
•
Updated
•
257
•
1
AmberYifan/Qwen2.5-14B-Instruct-wildfeedback-RPO-DRIFT-iter2-4k
Text Generation
•
0.0B
•
Updated
•
6
•
1
mradermacher/Qwen2.5-14B-Instruct-wildfeedback-RPO-DRIFT-iter2-4k-GGUF
15B
•
Updated
•
229
•
1
AmberYifan/Qwen2.5-14B-Instruct-ultrafeedback-spin-iter1-RPO
Text Generation
•
0.0B
•
Updated
•
13
•
1
mradermacher/Qwen2.5-14B-Instruct-ultrafeedback-spin-iter1-RPO-GGUF
15B
•
Updated
•
259
•
1
AmberYifan/Qwen2.5-14B-Instruct-ultrafeedback-iterdpo-iter2-RPO
Text Generation
•
0.0B
•
Updated
•
4
•
1
AmberYifan/Qwen2.5-14B-Instruct-wildfeedback-RPO-iterDPO-iter1-4k
Text Generation
•
0.0B
•
Updated
•
17
•
1
mradermacher/Qwen2.5-14B-Instruct-ultrafeedback-iterdpo-iter2-RPO-GGUF
15B
•
Updated
•
2.07k
•
1
mradermacher/Qwen2.5-14B-Instruct-wildfeedback-RPO-iterDPO-iter1-4k-GGUF
15B
•
Updated
•
1.65k
•
1
lyogavin/Anima33B-DPO-Belle-1k
Text Generation
•
Updated
•
1
lyogavin/Anima33B-DPO-Belle-1k-merged
Text Generation
•
Updated
•
14
•
12
daekeun-ml/Llama-2-ko-DPO-13B
Text Generation
•
13B
•
Updated
•
825
•
19
lewtun/zephyr-7b-dpo-full
Text Generation
•
7B
•
Updated
•
5
alignment-handbook/zephyr-7b-dpo-full
Text Generation
•
7B
•
Updated
•
69
•
3
alignment-handbook/zephyr-7b-dpo-qlora
Updated
•
20
•
9
argilla/notus-7b-v1-lora
Text Generation
•
7B
•
Updated
•
5
•
7
argilla/notus-7b-v1-lora-adapter
Text Generation
•
Updated
•
3
argilla/notus-7b-v1
Text Generation
•
7B
•
Updated
•
147
•
122
ContextualAI/archangel_sft_pythia1-4b
Text Generation
•
1B
•
Updated
•
7
ContextualAI/archangel_sft_pythia2-8b
Text Generation
•
3B
•
Updated
•
42
•
1
ContextualAI/archangel_sft_pythia6-9b
Text Generation
•
7B
•
Updated
•
5