HoT: Highlighted Chain of Thought for Referencing Supporting Facts from Inputs Paper • 2503.02003 • Published 10 days ago • 42
ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models Paper • 2502.09696 • Published 28 days ago • 39
VideoGameBunny: Towards vision assistants for video games Paper • 2407.15295 • Published Jul 21, 2024 • 22
Zoom is what you need: An empirical study of the power of zoom and spatial biases in image classification Paper • 2304.05538 • Published Apr 11, 2023 • 2