Learning Video Representations without Natural Videos Paper • 2410.24213 • Published Oct 31, 2024 • 15
ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning Paper • 2410.17779 • Published Oct 23, 2024 • 7