-
Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation
Paper • 2408.09787 • Published • 8 -
Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data
Paper • 2408.10119 • Published • 17 -
ControlNeXt: Powerful and Efficient Control for Image and Video Generation
Paper • 2408.06070 • Published • 53 -
Tora: Trajectory-oriented Diffusion Transformer for Video Generation
Paper • 2407.21705 • Published • 27
Collections
Discover the best community collections!
Collections including paper arxiv:2403.13535
-
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
Paper • 2404.15275 • Published -
IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models
Paper • 2403.13535 • Published • 22 -
PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models
Paper • 2309.05793 • Published • 50 -
GHOST 2.0: generative high-fidelity one shot transfer of heads
Paper • 2502.18417 • Published • 63
-
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation
Paper • 2404.19427 • Published • 72 -
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Paper • 2404.16771 • Published • 18 -
Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control
Paper • 2405.12970 • Published • 25 -
FlashFace: Human Image Personalization with High-fidelity Identity Preservation
Paper • 2403.17008 • Published • 20
-
Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era
Paper • 2305.06131 • Published • 2 -
Perpetual Humanoid Control for Real-time Simulated Avatars
Paper • 2305.06456 • Published • 1 -
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold
Paper • 2305.10973 • Published • 35 -
LDM3D: Latent Diffusion Model for 3D
Paper • 2305.10853 • Published • 10
-
TC4D: Trajectory-Conditioned Text-to-4D Generation
Paper • 2403.17920 • Published • 18 -
VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation
Paper • 2403.17001 • Published • 6 -
GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation
Paper • 2403.12365 • Published • 11 -
IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models
Paper • 2403.13535 • Published • 22
-
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
Paper • 2401.09048 • Published • 10 -
Improving fine-grained understanding in image-text pre-training
Paper • 2401.09865 • Published • 17 -
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Paper • 2401.10891 • Published • 60 -
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
Paper • 2401.13627 • Published • 74