39 194 48

KABI

dongguanting

https://dongguanting.github.io/

AI & ML interests

Reasoning and Alignment for Large Language Models

Recent Activity

liked a model 4 days ago

dongguanting/QwQ-32B-AEPO-DeepSearch

upvoted a paper 5 days ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

upvoted a paper 10 days ago

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

View all activity

Organizations

liked a model 4 days ago

dongguanting/QwQ-32B-AEPO-DeepSearch

Text Generation • 33B • Updated 12 days ago • 13 • 1

upvoted a paper 5 days ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 85

upvoted a paper 10 days ago

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

Paper • 2512.17260 • Published 13 days ago • 48

liked a model 12 days ago

dongguanting/Qwen3-8B-AEPO-DeepSearch

Text Generation • 8B • Updated 12 days ago • 22 • 2

updated 2 models 12 days ago

dongguanting/Qwen3-8B-AEPO-DeepSearch

Text Generation • 8B • Updated 12 days ago • 22 • 2

dongguanting/QwQ-32B-AEPO-DeepSearch

Text Generation • 33B • Updated 12 days ago • 13 • 1

updated a collection 12 days ago

AEPO

Collection

The official datasets and model checkpoints of AEPO • 5 items • Updated 12 days ago • 4

updated a model 12 days ago

dongguanting/QwQ-32B-ARPO-DeepSearch

33B • Updated 12 days ago • 9

updated a collection 12 days ago

ARPO

Collection

The official datasets and model checkpoints of ARPO • 10 items • Updated 12 days ago • 6

upvoted a paper 16 days ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published 17 days ago • 120

published 2 models 17 days ago

dongguanting/QwQ-32B-ARPO-DeepSearch

33B • Updated 12 days ago • 9

dongguanting/QwQ-32B-AEPO-DeepSearch

Text Generation • 33B • Updated 12 days ago • 13 • 1

upvoted 2 papers 17 days ago

Thinking with Images via Self-Calling Agent

Paper • 2512.08511 • Published 23 days ago • 21

Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving

Paper • 2512.10739 • Published 21 days ago • 45

upvoted a paper 29 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 279

upvoted 3 papers about 1 month ago

upvoted a paper about 2 months ago

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published Nov 7, 2025 • 42

KABI

AI & ML interests

Recent Activity

Organizations

dongguanting's activity