ICEF-NLP
/

bcms-bertic-comtext-sr-legal-ner-ekavica

Token Classification

Model card Files Files and versions Community

vukbatanovic commited on Dec 25, 2024

Commit

1e34a4b

·

verified ·

1 Parent(s): 19d4bf2

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ This model was evaluated on the task of named entity recognition in Serbian lega
 The model uses a newly developed named entity schema consisting of 21 entity types, tailored for the domain of Serbian legal texts, and encoded according the the IOB2 standard.
 The full entity list is available on the [COMtext.SR GitHub repository](https://github.com/ICEF-NLP/COMtext.SR).
-This model was compared with [SrBERTa](http://huggingface.co/nemanjaPetrovic/SrBERTa), a model specially trained on Serbian legal texts, fine-tuned for 20 epochs for named entity recognition using the [COMtext.SR.legal](https://github.com/ICEF-NLP/COMtext.SR) corpus of legal texts. Token-level accuracy and F1 (macro-averaged and per-class) were used as evaluation metrics and gold tokenized text was taken as input.
 Two evaluation settings for both models were considered:
 * Default - only the entity type portion of the NE tag is considered, effectively ignoring the "B-" and "I-" prefixes

 The model uses a newly developed named entity schema consisting of 21 entity types, tailored for the domain of Serbian legal texts, and encoded according the the IOB2 standard.
 The full entity list is available on the [COMtext.SR GitHub repository](https://github.com/ICEF-NLP/COMtext.SR).
+This model was compared with [SrBERTa](http://huggingface.co/nemanjaPetrovic/SrBERTa), a model specially trained on Serbian legal texts, fine-tuned for 20 epochs for named entity recognition using the Ekavian variant of the [COMtext.SR.legal](https://github.com/ICEF-NLP/COMtext.SR) corpus of legal texts. Token-level accuracy and F1 (macro-averaged and per-class) were used as evaluation metrics and gold tokenized text was taken as input.
 Two evaluation settings for both models were considered:
 * Default - only the entity type portion of the NE tag is considered, effectively ignoring the "B-" and "I-" prefixes