A fine-grained visual reasoning benchmark (We show more question types in the extension dataset.)
Sicheng Feng
FSCCS
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 8 hours ago
Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems
upvoted
a
paper
9 days ago
WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion