Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
BASF-AI 's Collections
ChEmbed Training Data
ChEmbed
Chemical Data
ChemTEB Multilingual
ChemTEB Pair Classification Datasets
ChemTEB Classification Datasets
ChemTEB Clustering Datasets
ChemTEB Retrieval Datasets
ChemTEB Bitext Mining Datasets

ChEmbed Training Data

updated Jun 30

The training datasets used for training the ChEmbed family of text embedding models

Upvote
-

  • BASF-AI/dolma-chem-only-query-generated

    Viewer • Updated May 4 • 1.17M • 13

  • BASF-AI/ChemRxiv-Papers

    Viewer • Updated Apr 5 • 30.4k • 42 • 1

  • BASF-AI/ChemRxiv-Train-CC-BY

    Viewer • Updated Apr 24 • 139k • 9

  • BASF-AI/PubChem-v3

    Viewer • Updated Apr 22 • 480k • 7 • 2
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs