Yuxin Chen's picture

6

Yuxin Chen

Uasonchen

·

AI & ML interests

None yet

Recent Activity

authored a paper about 16 hours ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

upvoted a paper 1 day ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

upvoted a paper 7 days ago

StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

View all activity

Organizations

authored a paper about 16 hours ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published 3 days ago • 45

authored 6 papers 3 months ago

EA-VTR: Event-Aware Video-Text Retrieval

Paper • 2407.07478 • Published Jul 10, 2024 • 1

Taming Rectified Flow for Inversion and Editing

Paper • 2411.04746 • Published Nov 7, 2024

DOGE: Towards Versatile Visual Document Grounding and Referring

Paper • 2411.17125 • Published Nov 26, 2024

Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion

Paper • 2503.22262 • Published Mar 28 • 1

MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO

Paper • 2505.13031 • Published May 19 • 4

How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective

Paper • 2509.18905 • Published Sep 23 • 29