Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,31 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: cc-by-4.0
|
3 |
+
tags:
|
4 |
+
- multi-label-classification
|
5 |
+
- text-classification
|
6 |
+
- onnx
|
7 |
+
- web-classification
|
8 |
+
- firefox-ai
|
9 |
+
- preview
|
10 |
+
language:
|
11 |
+
- multilingual
|
12 |
+
datasets:
|
13 |
+
- tshasan/multi-label-web-classification
|
14 |
+
base_model: Alibaba-NLP/gte-modernbert-base
|
15 |
+
pipeline_tag: text-classification
|
16 |
+
---
|
17 |
+
|
18 |
+
# modernBERT-URLTITLE-classifier-preview
|
19 |
+
|
20 |
+
## Model Overview
|
21 |
+
|
22 |
+
This is a **preview version** of a multi-label web classification model fine-tuned from `Alibaba-NLP/gte-modernbert-base`. It classifies websites into multiple categories based on their URLs and titles. The model supports 11 labels: `Uncatergorized`,`News`, `Entertainment`, `Shop`, `Chat`, `Education`, `Government`, `Health`, `Technology`, `Work`, and `Travel`.
|
23 |
+
|
24 |
+
- **Developed by**: Taimur Hasan
|
25 |
+
- **Model Type**: Multi-label Text Classification
|
26 |
+
- **Status**: Preview (under active development
|
27 |
+
### Architecture
|
28 |
+
- **Fine-tuning**: Unfroze the last 4 encoder layers and the pooler
|
29 |
+
- **Problem Type**: Multi-label classification
|
30 |
+
- **Output Labels**: 11 (`News`, `Entertainment`, `Shop`, `Chat`, `Education`, `Government`, `Health`, `Technology`, `Work`, `Travel`,`Uncatergorized`)
|
31 |
+
- **Input Format**: Concatenated string: `"{url}:{title}"`
|