microsoft
/

mdeberta-v3-base

Model card Files Files and versions

DeBERTa commited on Nov 19, 2021

Commit

1d31a10

·

1 Parent(s): 20d3ada

Update README.md

Files changed (1) hide show

README.md +12 -1

README.md CHANGED Viewed

@@ -70,6 +70,17 @@ python -m torch.distributed.launch --nproc_per_node=${num_gpus} \
 If you find DeBERTa useful for your work, please cite the following paper:
 ``` latex
 @inproceedings{
 he2021deberta,
@@ -79,4 +90,4 @@ booktitle={International Conference on Learning Representations},
 year={2021},
 url={https://openreview.net/forum?id=XPZIaotutsD}
 }
-```

 If you find DeBERTa useful for your work, please cite the following paper:
+``` latex
+@misc{he2021debertav3,
+      title={DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing},
+      author={Pengcheng He and Jianfeng Gao and Weizhu Chen},
+      year={2021},
+      eprint={2111.09543},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
+}
+```
 ``` latex
 @inproceedings{
 he2021deberta,
 year={2021},
 url={https://openreview.net/forum?id=XPZIaotutsD}
 }
+```