AI & ML interests
None defined yet.
Recent Activity
Papers
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning
Think Visually, Reason Textually: Vision-Language Synergy in ARC
InternLM2 Reward Models
-
internlm/internlm2-math-plus-20b
Text Generation • 20B • Updated • 161 • 7 -
internlm/internlm2-math-plus-7b
Text Generation • 8B • Updated • 422 • 11 -
internlm/internlm2-math-plus-1_8b
Text Generation • 2B • Updated • 142 • 12 -
internlm/internlm2-math-plus-mixtral8x22b
Text Generation • 141B • Updated • 60 • 18
-
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 211 -
OpenGVLab/InternVL3_5-241B-A28B
Image-Text-to-Text • 241B • Updated • 1.25k • 132 -
OpenGVLab/InternVL3_5-38B
Image-Text-to-Text • 38B • Updated • 11.7k • 38 -
OpenGVLab/InternVL3_5-30B-A3B
Image-Text-to-Text • 31B • Updated • 44.8k • 37
-
internlm/OREAL-32B
Text Generation • 33B • Updated • 148 • 24 -
internlm/OREAL-7B
Text Generation • 8B • Updated • 83 • • 20 -
internlm/OREAL-DeepSeek-R1-Distill-Qwen-7B
Text Generation • 8B • Updated • 60 • 9 -
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
Paper • 2502.06781 • Published • 58
-
internlm/internlm-xcomposer2-4khd-7b
Visual Question Answering • Updated • 7.7k • 73 -
internlm/internlm-xcomposer2-vl-7b
Visual Question Answering • Updated • 7.82k • 83 -
internlm/internlm-xcomposer2-vl-1_8b
Visual Question Answering • Updated • 90 • 18 -
internlm/internlm-xcomposer2-7b
Text Generation • Updated • 18.2k • 31
-
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 211 -
OpenGVLab/InternVL3_5-241B-A28B
Image-Text-to-Text • 241B • Updated • 1.25k • 132 -
OpenGVLab/InternVL3_5-38B
Image-Text-to-Text • 38B • Updated • 11.7k • 38 -
OpenGVLab/InternVL3_5-30B-A3B
Image-Text-to-Text • 31B • Updated • 44.8k • 37
-
internlm/OREAL-32B
Text Generation • 33B • Updated • 148 • 24 -
internlm/OREAL-7B
Text Generation • 8B • Updated • 83 • • 20 -
internlm/OREAL-DeepSeek-R1-Distill-Qwen-7B
Text Generation • 8B • Updated • 60 • 9 -
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
Paper • 2502.06781 • Published • 58
InternLM2 Reward Models
-
internlm/internlm2-math-plus-20b
Text Generation • 20B • Updated • 161 • 7 -
internlm/internlm2-math-plus-7b
Text Generation • 8B • Updated • 422 • 11 -
internlm/internlm2-math-plus-1_8b
Text Generation • 2B • Updated • 142 • 12 -
internlm/internlm2-math-plus-mixtral8x22b
Text Generation • 141B • Updated • 60 • 18
-
internlm/internlm-xcomposer2-4khd-7b
Visual Question Answering • Updated • 7.7k • 73 -
internlm/internlm-xcomposer2-vl-7b
Visual Question Answering • Updated • 7.82k • 83 -
internlm/internlm-xcomposer2-vl-1_8b
Visual Question Answering • Updated • 90 • 18 -
internlm/internlm-xcomposer2-7b
Text Generation • Updated • 18.2k • 31