Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Together AI
Novita
Fireworks
Replicate
fal
SambaNova
Cerebras
Hyperbolic
Nebius AI Studio
HF Inference API
Misc
Reset Misc
rl
Inference Endpoints
text-generation-inference
AutoTrain Compatible
8-bit precision
custom_code
Eval Results
Misc with no match
Merge
4-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
112
Full-text search
Edit filters
Sort: Trending
Active filters:
rl
Clear all
asedmammad/Contextual_KTO_Mistral_PairRM-GGUF
Updated
Mar 11, 2024
•
376
•
2
mradermacher/archangel_sft-kto_llama30b-GGUF
Updated
May 31, 2024
•
289
•
1
mradermacher/archangel_sft-kto_llama30b-i1-GGUF
Updated
Aug 2, 2024
•
503
lithiumice/motion_imitation
Updated
Jan 23
tristan-deep/dqn-needle-tracker
Reinforcement Learning
•
Updated
Sep 10, 2024
•
1
•
1
iso-ai1/isopro
Updated
Nov 1, 2024
tensorblock/archangel_sft-dpo_pythia2-8b-GGUF
Updated
Dec 28, 2024
•
102
tensorblock/archangel_sft_llama7b-GGUF
Updated
Jan 7
•
47
tensorblock/archangel_sft-kto_llama13b-GGUF
Updated
Jan 9
•
71
prithivMLmods/Sombrero-Opus-14B-Elite5
Text Generation
•
Updated
10 days ago
•
361
•
12
Pinkstack/Superthoughts-lite-v1
Text Generation
•
Updated
3 days ago
•
1.6k
•
•
2
ayazfau/GPT2-124M-poetry-RL
Text Generation
•
Updated
26 days ago
•
71
mradermacher/GPT2-124-poetry-RLHF-GGUF
Text Generation
•
Updated
24 days ago
•
380
mradermacher/Superthoughts-mini-v1-3.8b-GGUF
Updated
18 days ago
•
206
mradermacher/Superthoughts-mini-v1-3.8b-i1-GGUF
Updated
18 days ago
•
360
mradermacher/Quasar-3.0-Max-GGUF
Updated
5 days ago
•
941
mradermacher/Quasar-3.0-Max-i1-GGUF
Updated
5 days ago
•
789
silx-ai/Quasar-3.3-Max
Text Generation
•
Updated
7 days ago
•
206
mradermacher/Quasar-3.3-Max-GGUF
Updated
5 days ago
•
1.01k
mradermacher/Quasar-3.3-Max-i1-GGUF
Updated
5 days ago
•
913
matrixportal/EmojiLlama-3.1-8B-GGUF
Text Generation
•
Updated
6 days ago
•
90
mradermacher/EmojiLlama-3.1-8B-GGUF
Updated
about 5 hours ago
•
493
Previous
1
2
3
4
Next