DataComp

non-profit

https://www.datacomp.ai/dclm/index.html#home

AI & ML interests

None defined yet.

Recent Activity

bencw authored a paper about 18 hours ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

AmeyaPrabhu authored a paper 8 days ago

Great Models Think Alike and this Undermines AI Oversight

thomwolf authored a paper 8 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

View all activity

dclm's activity

bencw

authored a paper about 18 hours ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published 1 day ago • 22

AmeyaPrabhu

authored a paper 8 days ago

Great Models Think Alike and this Undermines AI Oversight

Paper • 2502.04313 • Published 8 days ago • 25

thomwolf

authored a paper 8 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 10 days ago • 162

weizechen

authored a paper 10 days ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 11 days ago • 53

lx865712528

authored a paper 16 days ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 17 days ago • 34

Wanfq

authored 2 papers 20 days ago

BlockPruner: Fine-grained Pruning for Large Language Models

Paper • 2406.10594 • Published Jun 15, 2024

ProFuser: Progressive Fusion of Large Language Models

Paper • 2408.04998 • Published Aug 9, 2024

lx865712528

authored a paper 22 days ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published 23 days ago • 44

yentinglin

authored a paper 23 days ago

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Paper • 2501.10799 • Published 27 days ago • 15

greglindahl

authored a paper 30 days ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14 • 55

thomwolf

authored a paper 30 days ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14 • 55

Lewis-Lau

authored 2 papers 30 days ago

T-Rex: Text-assisted Retrosynthesis Prediction

Paper • 2401.14637 • Published Jan 26, 2024

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 83

lx865712528

authored a paper about 1 month ago

EpiCoder: Encompassing Diversity and Complexity in Code Generation

Paper • 2501.04694 • Published Jan 8 • 14

lx865712528

authored a paper about 2 months ago

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Paper • 2412.15797 • Published Dec 20, 2024 • 18

orionweller

authored 5 papers about 2 months ago

NevIR: Negation in Neural Information Retrieval

Paper • 2305.07614 • Published May 12, 2023 • 1

Learning from Task Descriptions

Paper • 2011.08115 • Published Nov 16, 2020

MegaWika: Millions of reports and their sources across 50 diverse languages

Paper • 2307.07049 • Published Jul 13, 2023

Defending Against Poisoning Attacks in Open-Domain Question Answering

Paper • 2212.10002 • Published Dec 20, 2022

Learning to Reason via Program Generation, Emulation, and Search

Paper • 2405.16337 • Published May 25, 2024