This repository contains the models presented in the paper Filter Like You Test: Data-Driven Data Filtering for CLIP Pretraining. Included are FLYT and M-FLYT scoring models, as well as models trained on datasets filtered by these methods.

For usage examples and more information visit our GitHub repository.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Collection including formll/FLYT-models