tshasan commited on
Commit
5093161
·
verified ·
1 Parent(s): 30012de

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -0
README.md ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-4.0
3
+ tags:
4
+ - multi-label-classification
5
+ - text-classification
6
+ - onnx
7
+ - web-classification
8
+ - firefox-ai
9
+ - preview
10
+ language:
11
+ - multilingual
12
+ datasets:
13
+ - tshasan/multi-label-web-classification
14
+ base_model: Alibaba-NLP/gte-modernbert-base
15
+ pipeline_tag: text-classification
16
+ ---
17
+
18
+ # modernBERT-URLTITLE-classifier-preview
19
+
20
+ ## Model Overview
21
+
22
+ This is a **preview version** of a multi-label web classification model fine-tuned from `Alibaba-NLP/gte-modernbert-base`. It classifies websites into multiple categories based on their URLs and titles. The model supports 11 labels: `Uncatergorized`,`News`, `Entertainment`, `Shop`, `Chat`, `Education`, `Government`, `Health`, `Technology`, `Work`, and `Travel`.
23
+
24
+ - **Developed by**: Taimur Hasan
25
+ - **Model Type**: Multi-label Text Classification
26
+ - **Status**: Preview (under active development
27
+ ### Architecture
28
+ - **Fine-tuning**: Unfroze the last 4 encoder layers and the pooler
29
+ - **Problem Type**: Multi-label classification
30
+ - **Output Labels**: 11 (`News`, `Entertainment`, `Shop`, `Chat`, `Education`, `Government`, `Health`, `Technology`, `Work`, `Travel`,`Uncatergorized`)
31
+ - **Input Format**: Concatenated string: `"{url}:{title}"`