beyoru
/

MinCoder-4B-Exp

Text Generation

text-generation-inference

Model card Files Files and versions

beyoru commited on Nov 1, 2025

Commit

8963dab

·

verified ·

1 Parent(s): fa508a8

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -12,6 +12,11 @@ library_name: transformers
 ## Model details
 This model is fine-tuned from Qwen3-4B-Instruct using a custom reinforcement learning (RL) framework that rewards the model for producing solutions passing automated test cases — similar to the process of programming task evaluation on LeetCode.
 Instead of relying on labeled ground truth answers, the model learns through test-case-based rewards, promoting generalization and reasoning ability in algorithmic problem-solving.
 > This is an experimental model

 ## Model details
 This model is fine-tuned from Qwen3-4B-Instruct using a custom reinforcement learning (RL) framework that rewards the model for producing solutions passing automated test cases — similar to the process of programming task evaluation on LeetCode.
+<p align="center">
+  <img src="https://cdn-uploads.huggingface.co/production/uploads/65905af887944e494e37e09a/s4drmYGEYWZyt2ZUkxIpI.png" width="300">
+</p>
 Instead of relying on labeled ground truth answers, the model learns through test-case-based rewards, promoting generalization and reasoning ability in algorithmic problem-solving.
 > This is an experimental model