view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 12 days ago • 33
PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World Paper • 2412.17589 • Published Dec 23, 2024 • 12
Zero-shot Model-based Reinforcement Learning using Large Language Models Paper • 2410.11711 • Published Oct 15, 2024 • 8