s-nlp
/

russian_toxicity_classifier

Text Classification

toxic comments classification

Model card Files Files and versions Community

dardem commited on Oct 4, 2024

Commit

aecf2fe

·

verified ·

1 Parent(s): 92d6598

Update README.md

Files changed (1) hide show

README.md +11 -2

README.md CHANGED Viewed

@@ -28,8 +28,8 @@ The metrics obtained from test dataset is as follows
 from transformers import BertTokenizer, BertForSequenceClassification
 # load tokenizer and model weights
-tokenizer = BertTokenizer.from_pretrained('SkolkovoInstitute/russian_toxicity_classifier')
-model = BertForSequenceClassification.from_pretrained('SkolkovoInstitute/russian_toxicity_classifier')
 # prepare the input
 batch = tokenizer.encode('ты супер', return_tensors='pt')
@@ -38,6 +38,15 @@ batch = tokenizer.encode('ты супер', return_tensors='pt')
 model(batch)
 ```
 ## Licensing Information

 from transformers import BertTokenizer, BertForSequenceClassification
 # load tokenizer and model weights
+tokenizer = BertTokenizer.from_pretrained('s-nlp/russian_toxicity_classifier')
+model = BertForSequenceClassification.from_pretrained('s-nlp/russian_toxicity_classifier')
 # prepare the input
 batch = tokenizer.encode('ты супер', return_tensors='pt')
 model(batch)
 ```
+## Citation
+```
+@article{dementieva2022russe,
+  title={RUSSE-2022: Findings of the First Russian Detoxification Shared Task Based on Parallel Corpora},
+  author={Dementieva, Daryna and Logacheva, Varvara and Nikishina, Irina and Fenogenova, Alena and Dale, David and Krotova, Irina and Semenov, Nikita and Shavrina, Tatiana and Panchenko, Alexander}
+}
+```
 ## Licensing Information