1171 95 559

Lewis Tunstall PRO

lewtun

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

new activity about 11 hours ago

open-r1/OpenR1-Math-220k:Update README.md

liked a dataset about 11 hours ago

Anthropic/EconomicIndex

upvoted a collection about 11 hours ago

OpenR1-Math

View all activity

Organizations

lewtun's activity

New activity in open-r1/OpenR1-Math-220k about 11 hours ago

Update README.md

#1 opened 1 day ago by

davidberenstein1957

liked a dataset about 11 hours ago

Anthropic/EconomicIndex

Viewer • Updated 1 day ago • 3.51k • 3.7k • 76

upvoted a collection about 11 hours ago

OpenR1-Math

Collection

Dataset and SFT model distilled from DeepSeek-R1. Check out our blog post for more details: https://huggingface.co/blog/open-r1/update-2 • 2 items • Updated about 12 hours ago • 2

liked a model about 11 hours ago

open-r1/OpenR1-Qwen-7B

Text Generation • Updated about 13 hours ago • 315 • 13

updated 2 collections about 12 hours ago

OpenR1-Math

Collection

Dataset and SFT model distilled from DeepSeek-R1. Check out our blog post for more details: https://huggingface.co/blog/open-r1/update-2 • 2 items • Updated about 12 hours ago • 2

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 11 items • Updated about 12 hours ago • 48

liked a dataset about 12 hours ago

simplescaling/s1K

Viewer • Updated about 20 hours ago • 1k • 2.67k • 147

updated a model about 13 hours ago

open-r1/OpenR1-Qwen-7B

Text Generation • Updated about 13 hours ago • 315 • 13

posted an update 1 day ago

Post

2234

Introducing OpenR1-Math-220k!

open-r1/OpenR1-Math-220k

The community has been busy distilling DeepSeek-R1 from inference providers, but we decided to have a go at doing it ourselves from scratch 💪

What’s new compared to existing reasoning datasets?

♾ Based on AI-MO/NuminaMath-1.5: we focus on math reasoning traces and generate answers for problems in NuminaMath 1.5, an improved version of the popular NuminaMath-CoT dataset.

🐳 800k R1 reasoning traces: We generate two answers for 400k problems using DeepSeek R1. The filtered dataset contains 220k problems with correct reasoning traces.

📀 512 H100s running locally: Instead of relying on an API, we leverage vLLM and SGLang to run generations locally on our science cluster, generating 180k reasoning traces per day.

⏳ Automated filtering: We apply Math Verify to only retain problems with at least one correct answer. We also leverage Llama3.3-70B-Instruct as a judge to retrieve more correct examples (e.g for cases with malformed answers that can’t be verified with a rules-based parser)

📊 We match the performance of DeepSeek-Distill-Qwen-7B by finetuning Qwen-7B-Math-Instruct on our dataset.

🔎 Read our blog post for all the nitty gritty details: https://huggingface.co/blog/open-r1/update-2