Submitted by ziyaosg 20 TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models · 7 authors 2
Submitted by Ray2333 15 DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models · 6 authors 5
Submitted by Franck-Dernoncourt 12 Survey of User Interface Design and Interaction Techniques in Generative AI Applications · 13 authors 2
Submitted by LogicTrainer 9 CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes · 5 authors 2
Submitted by DavidNguyen 8 LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models · 5 authors 2
Submitted by songkey 8 HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models · 7 authors 2
Submitted by sascha-kirch 7 SambaMixer: State of Health Prediction of Li-ion Batteries using Mamba State Space Models · 4 authors 2
Submitted by Franck-Dernoncourt 7 GRS-QA -- Graph Reasoning-Structured Question Answering Dataset · 10 authors 2
Submitted by ZenMoore 6 M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation · 16 authors 2