--- license: apache-2.0 metrics: - accuracy - f1 - precision - recall pipeline_tag: text-classification tags: - language detection - German - English - French - Spanish - GEFS - Language dectetor datasets: - papluca/language-identification language: - de - en - fr - es --- # German, English, French and Spanish Language Detector The ImranzamanML/GEFS-language-detector is a fined tuned model by using the dataset of papluca [Language Identification](https://huggingface.co/datasets/papluca/language-identification#additional-information) and the base model [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) . ## Supported languages Currently this model support 4 languages for [Theum AG](https://theum.com/en/index.htm?t=) ![row01](https://youtu.be/TjQhobopnrA) Following languages supported by the model: - german (de) - english (en) - spanish (es) - french (fr) ## Training Results Epoch Training Loss Validation Loss 1 0.002600 0.000148 2 0.001000 0.000015 3 0.000000 0.000011 4 0.001800 0.000009 5 0.002700 0.000016 6 0.001600 0.000012 7 0.001300 0.000009 8 0.001200 0.000008 9 0.000900 0.000007 10 0.000900 0.000007 ## Testing Results Language Precision Recall F1 Accuracy de 0.9997 0.9998 0.9998 0.9999 en 1.0000 1.0000 1.0000 1.0000 fr 0.9995 0.9996 0.9996 0.9996 es 0.9994 0.9996 0.9995 0.9996