Language Support#
Label Sleuth supports text data in more than 150 languages. To start the system with your chosen language, use the following command:
python -m label_sleuth.start_label_sleuth --language <YOUR_LANGUAGE>
where <YOUR_LANGUAGE>
is the name of the language from the list of supported languages below. Note that if the language name consists of multiple words, it should be enclosed in double quotes.
Note
Not every machine learning model is compatible with every language. For model-language compatibility, see here.
Supported languages#
Label Sleuth supports the following languages:
Language |
---|
Afrikaans |
Albanian |
Alemannic |
Amharic |
Arabic |
Aragonese |
Armenian |
Assamese |
Asturian |
Azerbaijani |
Bashkir |
Basque |
Bavarian |
Belarusian |
Bengali |
Bihari |
Bishnupriya Manipuri |
Bosnian |
Breton |
Bulgarian |
Burmese |
Catalan |
Cebuano |
Central Bicolano |
Chechen |
Chinese |
Chuvash |
Corsican |
Croatian |
Czech |
Danish |
Divehi |
Dutch |
Eastern Punjabi |
Egyptian Arabic |
Emilian-Romagnol |
English |
Erzya |
Esperanto |
Estonian |
Fiji Hindi |
Finnish |
French |
Galician |
Georgian |
German |
Goan Konkani |
Greek |
Gujarati |
Haitian |
Hebrew |
Hill Mari |
Hindi |
Hungarian |
Icelandic |
Ido |
Ilokano |
Indonesian |
Interlingua |
Irish |
Italian |
Japanese |
Javanese |
Kannada |
Kapampangan |
Kazakh |
Khmer |
Kirghiz |
Korean |
Kurdish (Kurmanji) |
Kurdish (Sorani) |
Latin |
Latvian |
Limburgish |
Lithuanian |
Lombard |
Low Saxon |
Luxembourgish |
Macedonian |
Maithili |
Malagasy |
Malay |
Malayalam |
Maltese |
Manx |
Marathi |
Mazandarani |
Meadow Mari |
Minangkabau |
Mingrelian |
Mirandese |
Mongolian |
Nahuatl |
Neapolitan |
Nepali |
Newar |
North Frisian |
Northern Sotho |
Norwegian (Bokmål) |
Norwegian (Nynorsk) |
Occitan |
Oriya |
Ossetian |
Palatinate German |
Pashto |
Persian |
Piedmontese |
Polish |
Portuguese |
Quechua |
Romanian |
Romansh |
Russian |
Sakha |
Sanskrit |
Sardinian |
Scots |
Scottish Gaelic |
Serbian |
Serbo-Croatian |
Sicilian |
Sindhi |
Sinhalese |
Slovak |
Slovenian |
Somali |
Southern Azerbaijani |
Spanish |
Sundanese |
Swahili |
Swedish |
Tagalog |
Tajik |
Tamil |
Tatar |
Telugu |
Thai |
Tibetan |
Turkish |
Turkmen |
Ukrainian |
Upper Sorbian |
Urdu |
Uyghur |
Uzbek |
Venetian |
Vietnamese |
Volapük |
Walloon |
Waray |
Welsh |
West Flemish |
West Frisian |
Western Punjabi |
Yiddish |
Yoruba |
Zazaki |
Zeelandic |