Model Card for pii-classifier-tab-dataset

Model is a Longformer with a classification head, finetuned on Text Anonymization Benchmark (TAB) dataset for indicating if a token is part of a Personal Identifiable Information (PII) and should be masked out or not. Model output is the logits of the input sequence, where the classes are 1 (MASK) or 0 (NO-MASK), e.g. no IOB format used.

Model is used as an example in LeakPro repo. For further detail, see example notebook.

Downloads last month: 30

Safetensors

Model size

149M params

Tensor type

F32

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for LeakPro/pii-classifier-tab-dataset

Base model

allenai/longformer-base-4096

Finetuned

(95)

this model

LeakPro
/

pii-classifier-tab-dataset

Model Card for pii-classifier-tab-dataset

Model tree for LeakPro/pii-classifier-tab-dataset

Dataset used to train LeakPro/pii-classifier-tab-dataset