GAIR-ProX

community

https://gair-nlp.github.io/ProX/

AI & ML interests

NLP Research

Recent Activity

koalazf99 updated a dataset about 4 hours ago

gair-prox/DCLM-pro

Pengfei authored a paper 8 days ago

LIMO: Less is More for Reasoning

lockon authored a paper 23 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

View all activity

gair-prox's activity

koalazf99

updated a dataset about 4 hours ago

gair-prox/DCLM-pro

Viewer • Updated about 6 hours ago • 137M • 1

Pengfei

authored a paper 8 days ago

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published 9 days ago • 49

lockon

authored a paper 23 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 23 days ago • 318

Pengfei

authored a paper about 1 month ago

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published Jan 11 • 29

lockon

authored a paper about 2 months ago

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 43

koalazf99

authored a paper about 2 months ago

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 43

Pengfei

authored a paper 3 months ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 43

SivilTaram

authored 3 papers 3 months ago

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published Nov 20, 2024 • 15

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Paper • 2407.10956 • Published Jul 15, 2024 • 6

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7, 2024 • 115

SivilTaram

authored a paper 4 months ago

Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates

Paper • 2410.07137 • Published Oct 9, 2024 • 7

SivilTaram

authored a paper 5 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 61

Pengfei

authored a paper 5 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 61

lockon

authored a paper 5 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 61

SinclairWang

authored a paper 5 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 61

koalazf99

authored a paper 5 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 61

Pengfei

authored a paper 6 months ago

OpenResearcher: Unleashing AI for Accelerated Scientific Research

Paper • 2408.06941 • Published Aug 13, 2024 • 32

SinclairWang

authored a paper 7 months ago

Data Contamination Report from the 2024 CONDA Shared Task

Paper • 2407.21530 • Published Jul 31, 2024 • 10

Pengfei

authored a paper 7 months ago

Data Contamination Report from the 2024 CONDA Shared Task

Paper • 2407.21530 • Published Jul 31, 2024 • 10

SivilTaram

authored a paper 7 months ago

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18, 2024 • 54