Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs Paper • 2412.21187 • Published Dec 30, 2024 • 37
MSViT: Dynamic Mixed-Scale Tokenization for Vision Transformers Paper • 2307.02321 • Published Jul 5, 2023 • 7