Datasets for Different Languages
Subcategories
African Languages Languages in the Americas Asian Languages European Languages Languages in the Middle East Languages of the Pacific Islands and Nations
Keywords
Language Keywords for the Geographic Regions
African Languages
Subcategories
Keywords
Afar Afrikaans Akan Amharic Baatonum Bambara Bemba (zambia) Ber Birwa Central kanuri Chichewa Chokwe Cwi bwamu Dagbani Dinka Dyula Egyptian (ancient) Fanti Fulah Ganda Ghomálá’ Hausa Herero Igbo Kabiyè Kabuverdianu Kabyle Kachin Kamba (kenya) Kanuri Kikuyu Kimbundu Kinyarwanda Ko Kongo Koyraboro senni songhai Kutu Kwere Lingala Luba Katanga Luba Lulua Luo (kenya and tanzania) Makhuwa Makonde Malagasy Mamara senoufo Mossi N’ko Ndonga Nigerian fulfulde Nigerian pidgin Ndebele Nuer Nyankole Oromo Plateau malagasy Rundi Sango Sar Seselwa creole french Shona Somali Suba Swahili Susu Swati Tachelhit Tamasheq Tamazight Tigrigna Tsonga Setswana Tumbuka Umbundu Venda West central oromo Wolaytta Wolof Xhosa Yoruba Zulu
Languages in the Americas
Subcategories
Keywords
Achuar Shiwiar Algonquin Arabela Asháninka Aymara Bora Candoshi Shapra Caquinte Caribbean hindustani Cashibo Cacataibo Cashinahua Central aymara Central bikol Central bontok Central mazahua Chachi Chayahuita Cherokee Chimborazo highland quichua Chácobo Cofán Cree Eastern huasteca nahuatl Eastern maroon creole Galibi carib Garifuna Guarani Haitian Highland puebla nahuatl Huastec Huichol Imbabura highland quichua Inuktitut Inupiaq Ixil Jamaican creole english K’iche’ Kalaallisut Kaqchikel Kekchí Mapudungun Mezquital otomi Mi’kmaq Murui huitoto Navajo Ngäbere Nomatsiguenga Orpo Quechua Papantla totonac Purepecha Saramaccan Sharanahua Shipibo Conibo Shuar Siona Sirionó Sranan tongo Ticuna Tzotzil Wayuu Yine Yosondúa mixtec Yucateco Zapotec
Asian Languages
Subcategories
Keywords
Abkhaz Altai Amis Angika Armenian Assamese Avaric Awadhi Azerbaijani Balinese Balochi Bashkir Bengali Bishnupriya Bodo (india) Bolinao Burmese Carpathian romani Cebuano Central kurdish Chechen Chhattisgarhi Chinese Chuvash Crimean tatar Dari Dhivehi Dimli (individual language) Divehi Dogri (macrolanguage) Dzongkha Eastern tamang Erzya Farsi Filipino Gilaki Goan konkani Gujarati Hakha chin Hakka chinese Halh mongolian Hindi Hinglish Iloko Ingush Iranian persian Japanese Kalmyk Kankanaey Kannada Karachay Balkar Karelian Kashmiri Kazakh Khmer Kirghiz Kirmanjki (individual language) Komi Korean Kurdish Kyrgyz Lak Lao Lezghian Limbu Lushai Magahi Maithili Malay Malayalam Manipuri Mansi Marathi Mari (russia) Mazanderani Mingrelian Mongolian Nepali Nepali (individual language) Newari North azerbaijani Northern kurdish Northern uzbek Odia Oriya Oriya (macrolanguage) Ossetian Pampanga Pangasinan Panjabi Pashto Persian Russia buriat Sanskrit Santali Saraiki Sediq Shan Sindhi Sinhala South azerbaijani Tagalog Tajik Tamil Tatar Telugu Thai Tibetan Turkish Turkmen Tuvan Udmurt Uyghur Uzbek Vietnamese Waray (philippines) Western bukidnon manobo Yakut
European Languages
Subcategories
Keywords
Adyghe Albanian Aragonese Arpitan Asturian Basque Bavarian Belarusian Bosnian Breton Bulgarian Catalan Cornish Corsican Croatian Czech Danish Dutch English Esperanto Estonian Faroese Finnish French Frisian Friulian Gagauz Galician German Georgian Greek Hungarian Icelandic Ido Irish Italian Kashubian Kölsch Ladin Ladino Latgalian Latin Latvian Ligurian Limburgan Limburgish Lithuanian Liv Livvi Lombard Low german Lower sorbian Luxembourgish Macedonian Maltese Manx Mari Mirandese Moksha Neapolitan Northern sami Norwegian Occitan Occitan (post 1500) Old church slavonic Old english (ca. 450 1100) Old norse Picard Piemontese Polish Portuguese Romanian Romansh, Romany Russian Spanish Sardinian Saterfriesisch Scots Scottish gaelic Serbian Serbo Croatian Sicilian Silesian Slovak Slovenian Swedish Turkish Ukrainian Upper sorbian Venetian Vlax romani Volapük Walloon Welsh Yiddish
Languages in the Middle East
Subcategories
Keywords
Arabic Akkadian Ancient hebrew Assyrian neo Aramaic Hebrew
Languages of the Pacific Islands and Nations
Subcategories
Keywords
Ambulas Banjar Batak toba Benabena Betawi Bhojpuri Bine Bislama Buginese Bunama Burarra Chamorro Chuukese Dhao Doromu Koki Fiji hindi Fijian Halia Hawaiian Highland popoluca Hiri motu Iamalele Indonesian Javanese Kriol Kto Kâte Madurese Makasar Maori Marshallese Mende (papua new guinea) Minangkabau Mountain koiali Musi Muyuw Nauru Ngaju Nias Novial Pele Ata Pohnpeian Pular Rejang Samoan Sinaugoro Somba Siawari Sundanese Tahitian Tetun dili Tok pisin Tonga Warlpiri West kewa Yapese Yele