Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding Paper • 2502.05609 • Published 6 days ago • 13
Revisiting In-Context Learning with Long Context Language Models Paper • 2412.16926 • Published Dec 22, 2024 • 30
VideoRAG: Retrieval-Augmented Generation over Video Corpus Paper • 2501.05874 • Published Jan 10 • 67
EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation Paper • 2412.12559 • Published Dec 17, 2024 • 1