|
--- |
|
language: |
|
- nl |
|
base_model: |
|
- FacebookAI/xlm-roberta-base |
|
pipeline_tag: token-classification |
|
tags: |
|
- coreference |
|
- resolution |
|
- gender |
|
- neutral |
|
- pronouns |
|
- debiading |
|
- dutch |
|
- CDA |
|
--- |
|
This Dutch coreference resolution model is based on the [wl-coref](https://github.com/vdobrovolskii/wl-coref) model. The model was debiased through CDA, in order to improve its performance on gender-neutral pronouns and neopronouns. We used [XLM-RoBERTa-base](https://huggingface.co/FacebookAI/xlm-roberta-base) as our base model, fine-tuned the model on the [SoNaR-1 corpus](https://taalmaterialen.ivdnt.org/download/tstc-sonar-corpus/), and then further fine-tuned the model on a gender-neutral version of the this corpus for debiasing purposes. We used five different seeds during our experiment, and upload all five versions of the model. |
|
|
|
For more information, see our [Github repository](https://github.com/gvanboven/Transforming_Dutch). |
|
|
|
This model was created as part of our FAccT 2024 paper. Find our published paper at https://dl.acm.org/doi/10.1145/3630106.3659049 and arxiv paper at https://arxiv.org/abs/2405.00134. |
|
|
|
## Usage |
|
For usage instructions for the wl-coref model, see [their repo](https://github.com/vdobrovolskii/wl-coref). |
|
|
|
## Citation |
|
G. van Boven, Y. Du, D. Nguyen, _Transforming Dutch: Debiasing Dutch Coreference Resolution Systems for Non-binary Pronouns_. FAccT 2024. |
|
|
|
``` |
|
@inproceedings{boven-2024-transforming, |
|
title = "Transforming Dutch: Debiasing Dutch Coreference Resolution Systems for Non-binary Pronouns", |
|
author = "van Boven, Goya and Du, Yupei and Nguyen, Dong", |
|
booktitle = "Proceedings of the 2024 Conference on Fairness, Accountability, and Transparency", |
|
month = jun, |
|
year = "2024", |
|
address = "Online and Rio de Janeiro, Brazil", |
|
publisher = "Association for Computing Machinery" |
|
} |
|
``` |