File size: 454 Bytes
c3f6a60 257f430 c674353 257f430 |
1 2 3 4 5 6 7 8 9 |
---
language:
- en
---
This repo contains the BERT-Topic classifier of the work [Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining](https://arxiv.org/pdf/2410.08102).
For topic and label:
'activity': 0, 'education': 1, 'entertainment': 2, 'finance': 3, 'health': 4, 'business and industrial ': 5, 'infrastructure': 6, 'literature and art': 7, 'nature': 8, 'others': 9, 'law and government': 10, 'networking': 11, 'technology': 12 |