Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
SambaNova
Replicate
fal
Together AI
HF Inference API
Misc
Reset Misc
arxiv:
2402.03300
Inference Endpoints
AutoTrain Compatible
text-generation-inference
4-bit precision
8-bit precision
Carbon Emissions
Eval Results
Misc with no match
Merge
custom_code
text-embeddings-inference
Mixture of Experts
Apply filters
Models
192
Full-text search
Edit filters
Sort: Trending
Active filters:
2402.03300
Clear all
blockblockblock/deepseek-math-7b-rl-bpw4.6
Text Generation
•
Updated
Apr 17, 2024
•
5
blockblockblock/deepseek-math-7b-rl-bpw4.8
Text Generation
•
Updated
Apr 17, 2024
•
3
blockblockblock/deepseek-math-7b-rl-bpw5
Text Generation
•
Updated
Apr 17, 2024
•
3
blockblockblock/deepseek-math-7b-rl-bpw5.5
Text Generation
•
Updated
Apr 17, 2024
•
5
blockblockblock/deepseek-math-7b-rl-bpw6
Text Generation
•
Updated
Apr 17, 2024
•
5
RichardErkhov/deepseek-ai_-_deepseek-math-7b-rl-4bits
Text Generation
•
Updated
May 2, 2024
•
88
RichardErkhov/deepseek-ai_-_deepseek-math-7b-rl-8bits
Text Generation
•
Updated
May 2, 2024
•
78
sudhanshu746/deepseek-math-7b-rl-4bit
Text Generation
•
Updated
Jun 8, 2024
•
7
QuantFactory/deepseek-math-7b-rl-GGUF
Text Generation
•
Updated
Jun 10, 2024
•
905
•
1
sudhanshu746/deepseek-7b-instruct-matho-finetune
Text Generation
•
Updated
Jun 23, 2024
•
9
mav23/deepseek-math-7b-instruct-GGUF
Updated
Oct 30, 2024
•
390
•
1
RichardErkhov/deepseek-ai_-_deepseek-math-7b-rl-gguf
Updated
Nov 4, 2024
•
1.01k
c01zaut/deepseek-math-7b-rl-rk3588-1.1.2
Updated
Nov 28, 2024
•
2
c01zaut/deepseek-math-7b-rl-rk3588-1.1.4
Updated
Dec 29, 2024
•
4
c01zaut/deepseek-math-7b-instruct-rk3588-1.1.4
Updated
Dec 29, 2024
•
3
qgallouedec/Qwen2-0.5B-GRPO
Updated
24 days ago
nbd22/Llama-3.1-8B-Instruct-GRPO-gsm8k-ft-lora
Updated
15 days ago
sergiopaniego/Qwen2-0.5B-GRPO
Updated
12 days ago
spinech/qwen-2.5-3b-r1-countdown
Text Generation
•
Updated
12 days ago
•
6
riddickz/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
10 days ago
•
5
yooneo/qwen-0.5b-r1-aha
Updated
11 days ago
yooneo/qwen-1.5b-r1-aha
Updated
11 days ago
spinech/qwen2.5-3b-r1-rearc-stage1
Text Generation
•
Updated
11 days ago
•
128
hyunw3/qwen-2.5-0.5b-r1-countdown
Text Generation
•
Updated
11 days ago
•
5
hyunw3/qwen-2.5-0.5b-r1-countdown_lr1.0e-6
Text Generation
•
Updated
11 days ago
•
3
mgaimm/qwen-2.5-3b-r1-countdown
Text Generation
•
Updated
10 days ago
•
10
tuyentx/qwen-2.5-3b-r1-countdown
Text Generation
•
Updated
10 days ago
•
9
pablo-chocobar/qwen-2.5-3b-r1-countdown
Text Generation
•
Updated
8 days ago
•
7
Julian-Sheeper/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
10 days ago
•
6
pullpull/qwen-2.5-3b-r1-countdown
Text Generation
•
Updated
10 days ago
•
2
Previous
1
2
3
4
...
7
Next