Submitted by Liang0223 84 On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification · 10 authors 37 4
Submitted by sundrops 59 Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation · 14 authors 38 2
Submitted by ZhangYuhan 22 Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity · 8 authors 3
Submitted by shuaishuaicdp 16 Are We on the Right Way for Assessing Document Retrieval-Augmented Generation? · 7 authors 14 2
Submitted by WhiteCatY 8 Can Large Multimodal Models Actively Recognize Faulty Inputs? A Systematic Evaluation Framework of Their Input Scrutiny Ability · 5 authors 2
Submitted by amazingj 6 Evaluating, Synthesizing, and Enhancing for Customer Support Conversation · 7 authors 2
Submitted by yichaodu 6 Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models · 11 authors 14 2
Submitted by SiriusL 5 InfiAlign: A Scalable and Sample-Efficient Framework for Aligning LLMs to Enhance Reasoning Capabilities · 7 authors 3
Submitted by HenghuiDing 4 MOSEv2: A More Challenging Dataset for Video Object Segmentation in Complex Scenes · 8 authors 2
Submitted by ChengmingX 4 StrandDesigner: Towards Practical Strand Generation with Sketch Guidance · 9 authors 3 2
Submitted by yxl66666 2 Visual Document Understanding and Question Answering: A Multi-Agent Collaboration Framework with Test-Time Scaling · 9 authors 2
Submitted by amanchadha 1 PRvL: Quantifying the Capabilities and Risks of Large Language Models for PII Redaction · 6 authors 2
Submitted by ZhengChen1999 1 Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression · 6 authors 2
Submitted by mnandwana 1 REINA: Regularized Entropy Information-Based Loss for Efficient Simultaneous Speech Translation · 4 authors 2
Submitted by amanchadha 1 I Think, Therefore I Am Under-Qualified? A Benchmark for Evaluating Linguistic Shibboleth Detection in LLM Hiring Evaluations · 4 authors 2
Submitted by reshmighosh 1 Hop, Skip, and Overthink: Diagnosing Why Reasoning Models Fumble during Multi-Hop Analysis · 10 authors 2
Submitted by fengyiwu 1 RPCANet++: Deep Interpretable Robust PCA for Sparse Object Segmentation · 7 authors 68 2
Submitted by liuziyan 1 I2CR: Intra- and Inter-modal Collaborative Reflections for Multimodal Entity Linking · 9 authors 2
Submitted by Zihao1 - Attention Basin: Why Contextual Position Matters in Large Language Models · 9 authors 2
Submitted by nielsr - Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decode · 5 authors 2