Submitted by akhaliq 107 Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models · 51 authors 4
Submitted by koalazf99 61 Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale · 5 authors 4
Submitted by akhaliq 15 DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion · 6 authors 3
Submitted by michaal94 14 AIM 2024 Sparse Neural Rendering Challenge: Dataset and Benchmark · 6 authors 2
Submitted by OAOA 13 Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors · 5 authors 5
Submitted by akhaliq 12 HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale · 3 authors 2
Submitted by akhaliq 12 Synchronize Dual Hands for Physics-Based Dexterous Guitar Playing · 2 authors 2
Submitted by chuanenlin 11 NoTeeline: Supporting Real-Time Notetaking from Keypoints with Large Language Models · 5 authors 2
Submitted by akhaliq 7 TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans · 6 authors 2