wei's picture

wei

fengwei

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

upvoted a paper 7 days ago

s1: Simple test-time scaling

upvoted a paper 7 days ago

The Differences Between Direct Alignment Algorithms are a Blur

View all activity

Organizations

None yet

fengwei's activity

upvoted a paper 1 day ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 7 days ago • 154

upvoted 6 papers 7 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 11 days ago • 99

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published 9 days ago • 109

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 21 days ago • 91

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published 19 days ago • 48

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 17 days ago • 54

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 17 days ago • 55

upvoted a paper 17 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 21 days ago • 315

upvoted a paper 27 days ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published 29 days ago • 54

upvoted 2 papers about 1 month ago

GeAR: Generation Augmented Retrieval

Paper • 2501.02772 • Published Jan 6 • 23

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published Dec 23, 2024 • 64

upvoted 4 papers about 2 months ago

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Paper • 2412.14922 • Published Dec 19, 2024 • 85

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 106

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 128

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 345

upvoted 5 papers 3 months ago

GPT or BERT: why not both?

Paper • 2410.24159 • Published Oct 31, 2024 • 14

Personalization of Large Language Models: A Survey

Paper • 2411.00027 • Published Oct 29, 2024 • 31

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Paper • 2410.23218 • Published Oct 30, 2024 • 47

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

Paper • 2410.24024 • Published Oct 31, 2024 • 49

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published Nov 4, 2024 • 35