MiDashengLM: Efficient Audio Understanding with General Audio Captions Paper • 2508.03983 • Published 5 days ago • 6
Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success Paper • 2508.04280 • Published 5 days ago • 34
Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning Paper • 2508.03501 • Published 6 days ago • 44
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens Paper • 2508.01191 • Published 9 days ago • 185
Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models Paper • 2508.02120 • Published 7 days ago • 10
Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation Paper • 2508.05635 • Published 3 days ago • 64
Efficient Agents: Building Effective Agents While Reducing Cost Paper • 2508.02694 • Published 17 days ago • 70
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published 5 days ago • 46
CoTox: Chain-of-Thought-Based Molecular Toxicity Reasoning and Prediction Paper • 2508.03159 • Published 6 days ago • 22
OpenMed/OpenMed-NER-PharmaDetect-SuperMedical-125M Token Classification • 0.1B • Updated 6 days ago • 177k • 4
OpenMed/OpenMed-NER-PharmaDetect-SuperClinical-434M Token Classification • 0.4B • Updated 6 days ago • 367k • 9
AgroBench: Vision-Language Model Benchmark in Agriculture Paper • 2507.20519 • Published 14 days ago • 6
Phi-Ground Tech Report: Advancing Perception in GUI Grounding Paper • 2507.23779 • Published 10 days ago • 41