Embodied Red Teaming for Auditing Robotic Foundation Models Paper • 2411.18676 • Published Nov 27, 2024 • 1
Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT Paper • 2502.06782 • Published 1 day ago • 8 • 1
CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers Paper • 2502.06527 • Published 1 day ago • 5 • 1
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published 1 day ago • 64 • 4
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation Paper • 2502.05179 • Published 4 days ago • 17 • 3
Goku: Flow Based Video Generative Foundation Models Paper • 2502.04896 • Published 4 days ago • 58 • 7
On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices Paper • 2502.04363 • Published 7 days ago • 9 • 3
Linear Correlation in LM's Compositional Generalization and Hallucination Paper • 2502.04520 • Published 5 days ago • 9 • 3
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published 4 days ago • 41 • 9
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published 4 days ago • 41 • 9
Generating Symbolic World Models via Test-time Scaling of Large Language Models Paper • 2502.04728 • Published 5 days ago • 15 • 2
Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models Paper • 2502.04404 • Published 6 days ago • 14 • 2
CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance Paper • 2502.04350 • Published 7 days ago • 8 • 3
Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach Paper • 2502.03639 • Published 6 days ago • 8 • 3
MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation Paper • 2502.04299 • Published 5 days ago • 14 • 3
Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2 Paper • 2502.03544 • Published 6 days ago • 37 • 5
Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis Paper • 2502.04128 • Published 5 days ago • 19 • 4