Submitted by taesiri 73 OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling · 19 authors 175 2
Submitted by xhyandwyy 35 UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning · 11 authors 5.64k 2
Submitted by taesiri 23 InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts · 12 authors 123 2
Submitted by taesiri 10 LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence · 7 authors 2
Submitted by Iman998 9 SearchInstruct: Enhancing Domain Adaptation via Retrieval-Based Instruction Dataset Creation · 3 authors 8 2
Submitted by ylu610 8 Learning to Optimize Multi-Objective Alignment Through Dynamic Reward Weighting · 9 authors 4 2
Submitted by ottogin 8 Locality in Image Diffusion Models Emerges from Data Statistics · 4 authors 13 2
Submitted by kaiyangzhou 5 Measuring Epistemic Humility in Multimodal Large Language Models · 4 authors 7 3
Submitted by Macro 3 Look Again, Think Slowly: Enhancing Visual Reflection in Vision-Language Models · 6 authors 2
Submitted by gauravfs-14 3 CognitiveSky: Scalable Sentiment and Narrative Analysis for Decentralized Social Media · 3 authors 4 3
Submitted by taesiri 2 PersonaX: Multimodal Datasets with LLM-Inferred Behavior Traits · 10 authors 2
Submitted by Eureka-Leo 1 Dr.V: A Hierarchical Perception-Temporal-Cognition Framework to Diagnose Video Hallucination by Fine-grained Spatial-Temporal Grounding · 15 authors 2
Submitted by UVSKKR 1 EthicsMH: A Pilot Benchmark for Ethical Reasoning in Mental Health AI · 1 authors 2
Submitted by AnirbanSaha 1 ClaimIQ at CheckThat! 2025: Comparing Prompted and Fine-Tuned Language Models for Verifying Numerical Claims · 4 authors 2
Submitted by amanchadha 1 FuseCodec: Semantic-Contextual Fusion and Supervision for Neural Codecs · 9 authors 2
Submitted by yixuantt 1 GAPrune: Gradient-Alignment Pruning for Domain-Aware Embeddings · 2 authors 1 2
Submitted by mayankagarwal - ToolRM: Outcome Reward Models for Tool-Calling Large Language Models · 7 authors 2
Submitted by Cloudriver - LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction · 14 authors 2