Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yuxin Chen's picture
6

Yuxin Chen

Uasonchen
21world's profile picture
·

AI & ML interests

None yet

Recent Activity

authored a paper about 16 hours ago
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
upvoted a paper 1 day ago
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
upvoted a paper 7 days ago
StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors
View all activity

Organizations

ARC Lab, Tencent PCG's profile picture

authored a paper about 16 hours ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published 3 days ago • 45
authored 6 papers 3 months ago

EA-VTR: Event-Aware Video-Text Retrieval

Paper • 2407.07478 • Published Jul 10, 2024 • 1

Taming Rectified Flow for Inversion and Editing

Paper • 2411.04746 • Published Nov 7, 2024

DOGE: Towards Versatile Visual Document Grounding and Referring

Paper • 2411.17125 • Published Nov 26, 2024

Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion

Paper • 2503.22262 • Published Mar 28 • 1

MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO

Paper • 2505.13031 • Published May 19 • 4

How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective

Paper • 2509.18905 • Published Sep 23 • 29
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs