Korean-PII-Masking-BERT
GitHub Repository: alphagyuu/Korean-PII-Masking-BERT
Korean-PII-Masking-BERT is a token classification model fine-tuned on KcBERTโs TokenClassifier using a processed version of "Korean SNS" dataset from AI-Hub.
๐ฅ๏ธ Python Implementation
Tokenizer:
BertTokenizer.from_pretrained('beomi/kcbert-base', do_lower_case=False)
Model:
TFBertForTokenClassification.from_pretrained('alphagyuu/Korean-PII-Masking-BertForTokenClassification', num_labels=len(tag2idx))
LabelMap:
LabelMAP = { 'O': 'LABEL0', 'B-URL': 'LABEL1', 'I-URL': 'LABEL2', 'B-๊ณ์ ': 'LABEL3', 'I-๊ณ์ ': 'LABEL4', 'B-๊ธ์ต': 'LABEL5', 'I-๊ธ์ต': 'LABEL6', 'B-๋ฒํธ': 'LABEL7', 'I-๋ฒํธ': 'LABEL8', 'B-์์': 'LABEL9', 'I-์์': 'LABEL10', 'B-์ ์': 'LABEL11', 'I-์ ์': 'LABEL12', 'B-์ด๋ฆ': 'LABEL13', 'I-์ด๋ฆ': 'LABEL14', 'B-์ฃผ์': 'LABEL15', 'I-์ฃผ์': 'LABEL16' }
- Downloads last month
- 36
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no library tag.
Model tree for alphagyuu/Korean-PII-Masking-BertForTokenClassification
Base model
beomi/kcbert-base