Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning Paper • 2502.06060 • Published 5 days ago • 29
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published 7 days ago • 89
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 5 items • Updated 8 days ago • 33
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 11 days ago • 95
Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models Paper • 2501.18119 • Published 15 days ago • 24