DeepHermes Collection Preview models of hybrid reasoner Hermes series • 6 items • Updated about 6 hours ago • 13
PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC Paper • 2502.14282 • Published 22 days ago • 20
AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning Paper • 2503.07608 • Published 3 days ago • 16
VideoPainter Collection Any-length Video Inpainting and Editing with Plug-and-Play Context Control • 4 items • Updated 4 days ago • 2
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Paper • 2503.03751 • Published 8 days ago • 19
VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation Paper • 2502.07531 • Published about 1 month ago • 13
Magic 1-For-1: Generating One Minute Video Clips within One Minute Paper • 2502.07701 • Published about 1 month ago • 34
Animate Your Motion: Turning Still Images into Dynamic Videos Paper • 2403.10179 • Published Mar 15, 2024 • 3
Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like Paper • 2402.07383 • Published Feb 12, 2024 • 16
Temporal Preference Optimization Collection Temporal Preference Optimization for Long-form Video Understanding • 3 items • Updated Jan 19 • 5
Textoon: Generating Vivid 2D Cartoon Characters from Text Descriptions Paper • 2501.10020 • Published Jan 17 • 22
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI • Jan 15 • 43
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss Paper • 2402.05008 • Published Feb 7, 2024 • 22