Submitted by akhaliq 116 LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens · 8 authors 20
Submitted by akhaliq 47 YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information · 3 authors 3
Submitted by akhaliq 20 Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis · 11 authors 1
Submitted by akhaliq 19 In deep reinforcement learning, a pruned network is a good network · 3 authors 1
Submitted by akhaliq 11 Music Style Transfer with Time-Varying Inversion of Diffusion Models · 6 authors 1
Submitted by akhaliq 10 ToDo: Token Downsampling for Efficient Generation of High-Resolution Images · 3 authors 1
Submitted by akhaliq 10 BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models · 8 authors 1
Submitted by akhaliq 7 Ouroboros: Speculative Decoding with Large Model Enhanced Drafting · 6 authors 1