Dongwei
/

Qwen2.5-1.5B-Open-R1-GRPO_Math_smalllr

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Qwen2.5-1.5B-Open-R1-GRPO_Math_smalllr / merges.txt

Dongwei's picture

Model save

035ae8a verified 16 days ago

history contribute delete

1.67 MB

File too large to display, you can check the raw version instead.