LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published Nov 15, 2024 • 114
Motion Prompting: Controlling Video Generation with Motion Trajectories Paper • 2412.02700 • Published Dec 3, 2024 • 15
SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step Diffusion Paper • 2412.04301 • Published Dec 5, 2024 • 36