HARP: A challenging human-annotated math reasoning benchmark Paper • 2412.08819 • Published Dec 11, 2024 • 2
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Paper • 2501.18511 • Published 12 days ago • 17 • 4