CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers Paper • 2502.06527 • Published 1 day ago • 6
MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation Paper • 2502.04299 • Published 5 days ago • 14
MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation Paper • 2502.01572 • Published 8 days ago • 20
SliderSpace: Decomposing the Visual Capabilities of Diffusion Models Paper • 2502.01639 • Published 8 days ago • 24
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published 9 days ago • 168
PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding Paper • 2501.16411 • Published 15 days ago • 17
Running on Zero 1.68k 1.68k Chat With Janus-Pro-7B 🌍 A unified multimodal understanding and generation model.
Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step Paper • 2501.13926 • Published 19 days ago • 34
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments Paper • 2501.10893 • Published 24 days ago • 23
Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks Paper • 2501.11733 • Published 22 days ago • 27
GameFactory: Creating New Games with Generative Interactive Videos Paper • 2501.08325 • Published 28 days ago • 61