DeepRAG: Thinking to Retrieval Step by Step for Large Language Models Paper • 2502.01142 • Published 9 days ago • 21
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 7 days ago • 154
Preference Leakage: A Contamination Problem in LLM-as-a-judge Paper • 2502.01534 • Published 9 days ago • 34
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Paper • 2501.18585 • Published 12 days ago • 51
Large Language Models Think Too Fast To Explore Effectively Paper • 2501.18009 • Published 13 days ago • 22
GuardReasoner: Towards Reasoning-based LLM Safeguards Paper • 2501.18492 • Published 13 days ago • 80
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Paper • 2501.13106 • Published 20 days ago • 79