Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time Series Paper • 2401.03955 • Published Jan 8, 2024 • 8
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published 1 day ago • 83
Papers Collection Large Language Model (LLM) and NLP related papers. • 190 items • Updated 3 days ago • 9
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence Paper • 2406.11931 • Published Jun 17, 2024 • 63
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 21 days ago • 315
Multi-Grained Knowledge Retrieval for End-to-End Task-Oriented Dialog Paper • 2305.10149 • Published May 17, 2023 • 2
Weighted-Reward Preference Optimization for Implicit Model Fusion Paper • 2412.03187 • Published Dec 4, 2024 • 10
view article Article FuseO1-Preview: System-II Reasoning Fusion of LLMs By Wanfq and 4 others • 22 days ago • 13