The training datasets used for training the ChEmbed family of text embedding models
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
Edit this README.md
markdown file to author your organization card.
datasets
73
BASF-AI/uspto-synth-query-abs
Viewer
•
Updated
•
90k
•
23
BASF-AI/uspto-flattened
Viewer
•
Updated
•
7.5k
•
33
BASF-AI/USPTO-75K
Viewer
•
Updated
•
75k
•
46
BASF-AI/uspto-title-abs
Viewer
•
Updated
•
90k
•
43
BASF-AI/ChemRxiv-Train-CC-BY-v2
Viewer
•
Updated
•
138k
•
4
•
1
BASF-AI/PubChem-v4
Viewer
•
Updated
•
393k
•
11
BASF-AI/PubChem-Raw
Viewer
•
Updated
•
2.5M
•
8
BASF-AI/ChemRxivRetrieval
Viewer
•
Updated
•
79.5k
•
52
•
1
BASF-AI/dolma-chem-only-query-generated
Viewer
•
Updated
•
1.17M
•
20
BASF-AI/dolma-chemistry-only
Viewer
•
Updated
•
1.19M
•
26
•
2