CodeI/O Collection Collection for CodeI/O @ https://codei-o.github.io/ • 15 items • Updated 1 day ago • 3
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction Paper • 2502.07316 • Published 3 days ago • 28
QuEST: Stable Training of LLMs with 1-Bit Weights and Activations Paper • 2502.05003 • Published 7 days ago • 38
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated 12 days ago • 51
view article Article Process Reinforcement through Implicit Rewards By ganqu and 1 other • Jan 3 • 23
Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models Paper • 2501.13629 • Published 22 days ago • 44