YangWang92's picture

YangWang92

yangwang92

·

AI & ML interests

None yet

Recent Activity

upvoted an article 2 days ago

Introducing Falcon H1R 7B

upvoted a collection 2 days ago

Spectral-Sphere-Optimizer

liked a model 2 days ago

unakar666/qwen3-1.7B-adamw

View all activity

Organizations

upvoted an article 2 days ago

Article

Introducing Falcon H1R 7B

2 days ago

•

50

upvoted a collection 2 days ago

Spectral-Sphere-Optimizer

Paper-related Model Checkpoints for Reproduction • 4 items • Updated 2 days ago • 3

upvoted a paper 16 days ago

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

Paper • 2512.17260 • Published 20 days ago • 48

upvoted a paper 21 days ago

Universal Reasoning Model

Paper • 2512.14693 • Published 22 days ago • 41

upvoted a paper 22 days ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published 24 days ago • 103

upvoted a paper about 1 month ago

Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation

Paper • 2510.24821 • Published Oct 28, 2025 • 38

upvoted 2 papers about 2 months ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 105

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published Nov 10, 2025 • 16

upvoted a collection about 2 months ago

Retrofitting Recurrence

40 items • Updated Nov 11, 2025 • 6

upvoted a paper about 2 months ago

DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

Paper • 2511.06307 • Published Nov 9, 2025 • 51

upvoted a collection 2 months ago

LLaDA 2.0

7 items • Updated 14 days ago • 39

upvoted 4 papers 3 months ago

Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Paper • 2505.06708 • Published May 10, 2025 • 10

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 176

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 165

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

Paper • 2507.07996 • Published Jul 10, 2025 • 34

upvoted a collection 3 months ago

L1

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning • 7 items • Updated Jul 13, 2025 • 8

upvoted 2 papers 3 months ago

TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Paper • 2510.01179 • Published Oct 1, 2025 • 25

Soft Tokens, Hard Truths

Paper • 2509.19170 • Published Sep 23, 2025 • 15

upvoted a collection 4 months ago

Qwen3-Next

4 items • Updated 8 days ago • 171

upvoted a paper 4 months ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20, 2025 • 39