CoSER: Coordinating LLM-Based Persona Simulation of Established Roles Paper • 2502.09082 • Published 1 day ago • 16
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 23 days ago • 318
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Paper • 2501.11425 • Published 26 days ago • 91