Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published about 23 hours ago • 23
Automated Movie Generation via Multi-Agent CoT Planning Paper • 2503.07314 • Published 3 days ago • 34
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching Paper • 2503.05179 • Published 6 days ago • 42
Forgetting Transformer: Softmax Attention with a Forget Gate Paper • 2503.02130 • Published 10 days ago • 26
The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation Paper • 2503.04606 • Published 7 days ago • 7
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Paper • 2503.03751 • Published 8 days ago • 19
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper • 2503.01743 • Published 10 days ago • 72
Societal Alignment Frameworks Can Improve LLM Alignment Paper • 2503.00069 • Published 14 days ago • 16
Discrete-Time Hybrid Automata Learning: Legged Locomotion Meets Skateboarding Paper • 2503.01842 • Published 10 days ago • 2
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper • 2503.00865 • Published 11 days ago • 58
SurveyX: Academic Survey Automation via Large Language Models Paper • 2502.14776 • Published 21 days ago • 92
Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity Paper • 2502.13063 • Published 23 days ago • 66
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation Paper • 2502.13145 • Published 23 days ago • 36
Phantom: Subject-consistent video generation via cross-modal alignment Paper • 2502.11079 • Published 25 days ago • 52
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published 29 days ago • 184
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Paper • 2502.08910 • Published 29 days ago • 143
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance Paper • 2502.08127 • Published 29 days ago • 50
TransMLA: Multi-head Latent Attention Is All You Need Paper • 2502.07864 • Published 30 days ago • 47