PinPoint / Finetuning /docs /datacomp_models.md
anonymous-upload-neurips-2025's picture
Upload 221 files
88c922f verified

CommonPool and DataComp models

As part of DataComp, we trained models on CommonPool using various data filtering strategies. We release models for all four scales of the competition, small, medium, large and xlarge, corresponding to a pool size and number of samples seen of 12.8M, 128M, 1.28B and 12.8B, respectively.

The models are specified below, see our paper DataComp: In seearch of the next generation of multimodal datasets for more details.

xlarge scale models

large scale models

medium scale models

small scale models