Update README.md
Browse files
README.md
CHANGED
|
@@ -13,9 +13,11 @@ library_name: transformers
|
|
| 13 |
|
| 14 |
# VL-Rethinker-7B
|
| 15 |
|
|
|
|
|
|
|
| 16 |
**VL-Rethinker-7B** achieves SoTA results on various multimodal reasoning benchmarks.
|
| 17 |
|
| 18 |
-
It is trained using the **GRPO-SSR and Forced Rethinking** techniques, using meticulously curated
|
| 19 |
|
| 20 |
For details of our approach and performance comparison, please see our [paper](https://github.com/TIGER-AI-Lab/VL-Rethinker/blob/main/paper.pdf).
|
| 21 |
|
|
|
|
| 13 |
|
| 14 |
# VL-Rethinker-7B
|
| 15 |
|
| 16 |
+
**🚀 News:** <u>We release our meticulously curated collection of RL training queries for multimodal reasoning: [ViRL39K](https://huggingface.co/datasets/TIGER-Lab/ViRL39K).</u>
|
| 17 |
+
|
| 18 |
**VL-Rethinker-7B** achieves SoTA results on various multimodal reasoning benchmarks.
|
| 19 |
|
| 20 |
+
It is trained using the **GRPO-SSR and Forced Rethinking** techniques, using meticulously curated [ViRL39K](https://huggingface.co/datasets/TIGER-Lab/ViRL39K).
|
| 21 |
|
| 22 |
For details of our approach and performance comparison, please see our [paper](https://github.com/TIGER-AI-Lab/VL-Rethinker/blob/main/paper.pdf).
|
| 23 |
|