R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model Paper β’ 2503.05132 β’ Published 7 days ago β’ 46
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper β’ 2411.10440 β’ Published Nov 15, 2024 β’ 114
view article Article Decoding Strategies in Large Language Models By mlabonne β’ Oct 29, 2024 β’ 47
Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models Paper β’ 2307.14539 β’ Published Jul 26, 2023 β’ 2 β’ 1