new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Dec 29

Submitted by

myyzzzoooo

InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion

kaist-ai

Submitted by

BishopGorov

Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding

tencent

Submitted by

taesiri

MAI-UI Technical Report: Real-World Centric Foundation GUI Agents

AlibabaTongyiLab

Submitted by

Andrew613

UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture

·
15 authors

Submitted by

zhengli1013

ProEdit: Inversion-based Editing From Prompts Done Right

·
7 authors

Submitted by

fanqiNO1

TimeBill: Time-Budgeted Inference for Large Language Models

·
3 authors

3

Submitted by

zss01

See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning

IIGroup

Tsinghua IIGroup

Submitted by

Andrew613

Omni-Weather: Unified Multimodal Foundation Model for Weather Generation and Understanding

·
13 authors

2

Submitted by

m-Just

InSight-o3: Empowering Multimodal Foundation Models with Generalized Visual Search

·
10 authors

Submitted by

wenzhengzeng

SlideTailor: Personalized Presentation Slide Generation for Scientific Papers

NationalUniversityofSingapore

National University of Singapore

Submitted by

taesiri

SWE-RM: Execution-free Feedback For Software Engineering Agents

·
9 authors

Submitted by

taesiri

SVBench: Evaluation of Video Generation Models on Social Reasoning

·
7 authors

Submitted by

dronperminov

A 58-Addition, Rank-23 Scheme for General 3x3 Matrix Multiplication

·
1 authors

Submitted by

txy

Rethinking Sample Polarity in Reinforcement Learning with Verifiable Rewards

·
8 authors

Submitted by

BK-Lee

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

nvidia