11 10 26

Kaiyan Zhang

iseesaw

iseesaw

AI & ML interests

None yet

Recent Activity

authored a paper about 14 hours ago

Process Reinforcement through Implicit Rewards

authored a paper about 15 hours ago

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

upvoted an article about 17 hours ago

Open-source DeepResearch – Freeing our search agents

View all activity

Organizations

iseesaw's activity

authored a paper about 14 hours ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 9 days ago • 53

authored a paper about 15 hours ago

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Paper • 2501.18362 • Published 13 days ago • 19

upvoted 3 articles about 17 hours ago

Article

Open-source DeepResearch – Freeing our search agents

8 days ago

• 919

Article

What is test-time compute and how to scale it?

and 1 other •

5 days ago

• 18

Article

Open R1: Update #2

and 6 others •

1 day ago

• 126

liked a dataset about 18 hours ago

open-r1/OpenR1-Math-220k

Viewer • Updated about 19 hours ago • 225k • 260 • 160

liked a dataset about 21 hours ago

AI-MO/NuminaMath-1.5

Viewer • Updated 1 day ago • 896k • 146 • 68

authored a paper about 22 hours ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 1 day ago • 83

upvoted a paper about 22 hours ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 1 day ago • 83

upvoted a paper 12 days ago

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Paper • 2501.18362 • Published 13 days ago • 19

upvoted an article about 1 month ago

Article

Process Reinforcement through Implicit Rewards

and 1 other •

Jan 3

• 23

upvoted a collection about 1 month ago

Reasoning Datasets

Collection

Reasoning datasets that are trending 🔥 • 10 items • Updated Jan 3 • 24

authored a paper about 1 month ago

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published Dec 23, 2024 • 40

upvoted a paper about 2 months ago

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published Dec 23, 2024 • 40

commented a paper about 2 months ago

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published Dec 23, 2024 • 40 •

authored a paper about 2 months ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 49

liked a Space about 2 months ago

1.88k

Anychat

🏢

liked a Space 2 months ago

877

QwQ-32B-Preview

🔍

QwQ-32B-Preview

authored a paper 2 months ago

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 32

upvoted a paper 2 months ago

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 32