
alexneakameni/language_detection
Text Classification
•
Updated
•
235
a variety of pre-trained language identification models
Note BERT-based language detection model trained on hac541309/open-lid-dataset, which includes 121 million sentences across 200 languages. 24.5M params
Note fine-tuned version of xlm-roberta-base on the Language Identification dataset, 20 langs. 278M params
Note fasttext model, 217 langs
Note fasttext, 172 langs
Note staticvectors, extracted from fasttext model 32.6M params
Note staticvestors, extracted from fasttext model quantized 4.09M params
Note fasttext model, 2000 langs
Note fasttext model, 189 langs