The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published 3 days ago • 104
Optimizing Large Language Model Training Using FP4 Quantization Paper • 2501.17116 • Published 18 days ago • 34
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published 18 days ago • 105
Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts Paper • 2501.14334 • Published 23 days ago • 17
Large Language Models Think Too Fast To Explore Effectively Paper • 2501.18009 • Published 17 days ago • 23
MatAnyone: Stable Video Matting with Consistent Memory Propagation Paper • 2501.14677 • Published 22 days ago • 29
SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model Paper • 2501.18636 • Published 18 days ago • 25
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs Paper • 2412.21187 • Published Dec 30, 2024 • 37
On the Compositional Generalization of Multimodal LLMs for Medical Imaging Paper • 2412.20070 • Published Dec 28, 2024 • 45
Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization Paper • 2412.18525 • Published Dec 24, 2024 • 73
GameFactory: Creating New Games with Generative Interactive Videos Paper • 2501.08325 • Published Jan 14 • 62
Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament Paper • 2501.13007 • Published 24 days ago • 20
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 25 days ago • 319
Control LLM: Controlled Evolution for Intelligence Retention in LLM Paper • 2501.10979 • Published 28 days ago • 6
Hallucinations Can Improve Large Language Models in Drug Discovery Paper • 2501.13824 • Published 23 days ago • 9
AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation Paper • 2403.14614 • Published Mar 21, 2024 • 3
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Paper • 2501.14492 • Published 23 days ago • 30