jgvb99's picture
Update README.md
869ca75 verified
---
language:
- nl
base_model:
- FacebookAI/xlm-roberta-base
pipeline_tag: token-classification
tags:
- coreference
- resolution
- gender
- neutral
- pronouns
- debiading
- dutch
- CDA
---
This Dutch coreference resolution model is based on the [wl-coref](https://github.com/vdobrovolskii/wl-coref) model. The model was debiased through CDA, in order to improve its performance on gender-neutral pronouns and neopronouns. We used [XLM-RoBERTa-base](https://huggingface.co/FacebookAI/xlm-roberta-base) as our base model, fine-tuned the model on the [SoNaR-1 corpus](https://taalmaterialen.ivdnt.org/download/tstc-sonar-corpus/), and then further fine-tuned the model on a gender-neutral version of the this corpus for debiasing purposes. We used five different seeds during our experiment, and upload all five versions of the model.
For more information, see our [Github repository](https://github.com/gvanboven/Transforming_Dutch).
This model was created as part of our FAccT 2024 paper. Find our published paper at https://dl.acm.org/doi/10.1145/3630106.3659049 and arxiv paper at https://arxiv.org/abs/2405.00134.
## Usage
For usage instructions for the wl-coref model, see [their repo](https://github.com/vdobrovolskii/wl-coref).
## Citation
G. van Boven, Y. Du, D. Nguyen, _Transforming Dutch: Debiasing Dutch Coreference Resolution Systems for Non-binary Pronouns_. FAccT 2024.
```
@inproceedings{boven-2024-transforming,
title = "Transforming Dutch: Debiasing Dutch Coreference Resolution Systems for Non-binary Pronouns",
author = "van Boven, Goya and Du, Yupei and Nguyen, Dong",
booktitle = "Proceedings of the 2024 Conference on Fairness, Accountability, and Transparency",
month = jun,
year = "2024",
address = "Online and Rio de Janeiro, Brazil",
publisher = "Association for Computing Machinery"
}
```