RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling Paper • 2503.09601 • Published 1 day ago • 10
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training Paper • 2503.08525 • Published 2 days ago • 10
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published 1 day ago • 31
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation Paper • 2503.09151 • Published 1 day ago • 24
VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary Paper • 2503.09402 • Published 1 day ago • 4
Cost-Optimal Grouped-Query Attention for Long-Context LLMs Paper • 2503.09579 • Published 1 day ago • 3
Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space Paper • 2503.09419 • Published 1 day ago • 3
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation Paper • 2503.06594 • Published 4 days ago • 4
AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models Paper • 2503.08417 • Published 2 days ago • 6
Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol Paper • 2503.05860 • Published 6 days ago • 7
^RFLAV: Rolling Flow matching for infinite Audio Video generation Paper • 2503.08307 • Published 3 days ago • 8