Xiang Fu

craigxiangfu

https://fufoundation.co

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions

liked a dataset 5 days ago

facebook/natural_reasoning

upvoted a paper 7 days ago

Self-rewarding correction for mathematical reasoning

View all activity

Organizations

craigxiangfu's activity

upvoted a paper 3 days ago

NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions

Paper • 2502.13124 • Published 23 days ago • 5

liked a dataset 5 days ago

facebook/natural_reasoning

Viewer • Updated 20 days ago • 1.15M • 10.7k • 397

upvoted a paper 7 days ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published 15 days ago • 76

upvoted a collection 9 days ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 15 days ago • 560

liked a model 9 days ago

meta-llama/Llama-3.3-70B-Instruct

Text Generation • Updated Dec 21, 2024 • 855k • • 2.13k

upvoted a paper 13 days ago

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published 15 days ago • 26

upvoted a paper 14 days ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 93

upvoted a paper 18 days ago

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Paper • 2501.03226 • Published Jan 6 • 41

liked a Space 21 days ago

2.23k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 24 days ago

Diverse Inference and Verification for Advanced Reasoning

Paper • 2502.09955 • Published 27 days ago • 17

upvoted 4 papers about 2 months ago

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 16 • 37

upvoted 2 papers 5 months ago

Can Models Learn Skill Composition from Examples?

Paper • 2409.19808 • Published Sep 29, 2024 • 10

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 171

upvoted a paper 6 months ago

Agent Workflow Memory

Paper • 2409.07429 • Published Sep 11, 2024 • 29

liked a dataset 9 months ago

HuggingFaceFW/fineweb-edu

Viewer • Updated Jan 31 • 3.3B • 516k • 649

liked 2 models 9 months ago

meta-llama/Meta-Llama-3-8B

Text Generation • Updated Sep 27, 2024 • 421k • 6.08k

meta-llama/Meta-Llama-3-70B

Text Generation • Updated Sep 27, 2024 • 33.3k • 852