jan-hq
/

Deepseek-Qwen2.5-7B-Redistil-GRPO-cp-800

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Deepseek-Qwen2.5-7B-Redistil-GRPO-cp-800

1 contributor

History: 4 commits

jan-hq's picture

Trained with Unsloth

b657404 verified about 17 hours ago