Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Paper β’ 2501.18585 β’ Published 15 days ago β’ 52 β’ 11
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Paper β’ 2501.18585 β’ Published 15 days ago β’ 52
view article Article FuseO1-Preview: System-II Reasoning Fusion of LLMs By Wanfq and 4 others β’ 25 days ago β’ 14
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models Paper β’ 2501.11873 β’ Published 24 days ago β’ 63
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ Dec 4, 2024 β’ 76