Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
khuang2
/
qwen-2.5-3b-r1-countdown-offline_query_gen_solvable_only__train_query_gen-ckpt_175
like
0
Text Generation
Transformers
TensorBoard
Safetensors
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
2402.03300
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
qwen-2.5-3b-r1-countdown-offline_query_gen_solvable_only__train_query_gen-ckpt_175
Commit History
Training in progress, step 451
be09575
verified
khuang2
commited on
14 days ago
Model save
fcca926
verified
khuang2
commited on
15 days ago
Training in progress, step 450
0304f69
verified
khuang2
commited on
15 days ago
Training in progress, step 425
752635e
verified
khuang2
commited on
15 days ago
Training in progress, step 400
8649f7c
verified
khuang2
commited on
15 days ago
Training in progress, step 375
4d47191
verified
khuang2
commited on
15 days ago
Training in progress, step 350
330fb98
verified
khuang2
commited on
15 days ago
Training in progress, step 325
2438394
verified
khuang2
commited on
15 days ago
Training in progress, step 300
a446fc3
verified
khuang2
commited on
15 days ago
Training in progress, step 275
8ab178f
verified
khuang2
commited on
15 days ago
Training in progress, step 250
67be2ca
verified
khuang2
commited on
15 days ago
Training in progress, step 225
0c0603d
verified
khuang2
commited on
15 days ago
Training in progress, step 200
71cd19a
verified
khuang2
commited on
15 days ago
Training in progress, step 175
520a5fb
verified
khuang2
commited on
15 days ago
Training in progress, step 150
15bb648
verified
khuang2
commited on
15 days ago
Training in progress, step 125
1ecf708
verified
khuang2
commited on
15 days ago
Training in progress, step 100
479ca7e
verified
khuang2
commited on
15 days ago
Training in progress, step 75
c5dea33
verified
khuang2
commited on
15 days ago
Training in progress, step 50
ac32303
verified
khuang2
commited on
15 days ago
Training in progress, step 25
64bca27
verified
khuang2
commited on
15 days ago
initial commit
22c27d7
verified
khuang2
commited on
15 days ago