Takalani Sesame - Tswana πŸ‡ΏπŸ‡¦

Model description

Takalani Sesame (named after the South African version of Sesame Street) is a project that aims to promote the use of South African languages in NLP, and in particular look at techniques for low-resource languages to equalise performance with larger languages around the world.

Intended uses & limitations

How to use

from transformers import AutoTokenizer, AutoModelWithLMHead

tokenizer = AutoTokenizer.from_pretrained("jannesg/takalane_ssw_roberta")

model = AutoModelWithLMHead.from_pretrained("jannesg/takalane_ssw_roberta")

Limitations and bias

Updates will be added continously to improve performance.

Training data

Data collected from https://wortschatz.uni-leipzig.de/en
Sentences: 380

Training procedure

No preprocessing. Standard Huggingface hyperparameters.

Author

Jannes Germishuys website

Downloads last month
33
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.