Link Search Menu Expand Document

Datasets for Different Languages

Datasets with different human languages.

Subcategories

African Languages Languages in the Americas Asian Languages European Languages Languages in the Middle East Languages of the Pacific Islands and Nations

Keywords

Language Keywords for the Geographic Regions

African Languages

Ancient and modern languages in Africa.

Subcategories

Keywords

Afar Afrikaans Akan Amharic Baatonum Bambara Bemba (zambia) Ber Birwa Central kanuri Chichewa Chokwe Cwi bwamu Dagbani Dinka Dyula Egyptian (ancient) Fanti Fulah Ganda Ghomálá’ Hausa Herero Igbo Kabiyè Kabuverdianu Kabyle Kachin Kamba (kenya) Kanuri Kikuyu Kimbundu Kinyarwanda Ko Kongo Koyraboro senni songhai Kutu Kwere Lingala Luba Katanga Luba Lulua Luo (kenya and tanzania) Makhuwa Makonde Malagasy Mamara senoufo Mossi N’ko Ndonga Nigerian fulfulde Nigerian pidgin Ndebele Nuer Nyankole Oromo Plateau malagasy Rundi Sango Sar Seselwa creole french Shona Somali Suba Swahili Susu Swati Tachelhit Tamasheq Tamazight Tigrigna Tsonga Setswana Tumbuka Umbundu Venda West central oromo Wolaytta Wolof Xhosa Yoruba Zulu

Languages in the Americas

Ancient and modern languages in the Americas.

Subcategories

Keywords

Achuar Shiwiar Algonquin Arabela Asháninka Aymara Bora Candoshi Shapra Caquinte Caribbean hindustani Cashibo Cacataibo Cashinahua Central aymara Central bikol Central bontok Central mazahua Chachi Chayahuita Cherokee Chimborazo highland quichua Chácobo Cofán Cree Eastern huasteca nahuatl Eastern maroon creole Galibi carib Garifuna Guarani Haitian Highland puebla nahuatl Huastec Huichol Imbabura highland quichua Inuktitut Inupiaq Ixil Jamaican creole english K’iche’ Kalaallisut Kaqchikel Kekchí Mapudungun Mezquital otomi Mi’kmaq Murui huitoto Navajo Ngäbere Nomatsiguenga Orpo Quechua Papantla totonac Purepecha Saramaccan Sharanahua Shipibo Conibo Shuar Siona Sirionó Sranan tongo Ticuna Tzotzil Wayuu Yine Yosondúa mixtec Yucateco Zapotec

Asian Languages

Ancient and modern languages in Asia.

Subcategories

Keywords

Abkhaz Altai Amis Angika Armenian Assamese Avaric Awadhi Azerbaijani Balinese Balochi Bashkir Bengali Bishnupriya Bodo (india) Bolinao Burmese Carpathian romani Cebuano Central kurdish Chechen Chhattisgarhi Chinese Chuvash Crimean tatar Dari Dhivehi Dimli (individual language) Divehi Dogri (macrolanguage) Dzongkha Eastern tamang Erzya Farsi Filipino Gilaki Goan konkani Gujarati Hakha chin Hakka chinese Halh mongolian Hindi Hinglish Iloko Ingush Iranian persian Japanese Kalmyk Kankanaey Kannada Karachay Balkar Karelian Kashmiri Kazakh Khmer Kirghiz Kirmanjki (individual language) Komi Korean Kurdish Kyrgyz Lak Lao Lezghian Limbu Lushai Magahi Maithili Malay Malayalam Manipuri Mansi Marathi Mari (russia) Mazanderani Mingrelian Mongolian Nepali Nepali (individual language) Newari North azerbaijani Northern kurdish Northern uzbek Odia Oriya Oriya (macrolanguage) Ossetian Pampanga Pangasinan Panjabi Pashto Persian Russia buriat Sanskrit Santali Saraiki Sediq Shan Sindhi Sinhala South azerbaijani Tagalog Tajik Tamil Tatar Telugu Thai Tibetan Turkish Turkmen Tuvan Udmurt Uyghur Uzbek Vietnamese Waray (philippines) Western bukidnon manobo Yakut

European Languages

Ancient and modern languages in Europe.

Subcategories

Keywords

Adyghe Albanian Aragonese Arpitan Asturian Basque Bavarian Belarusian Bosnian Breton Bulgarian Catalan Cornish Corsican Croatian Czech Danish Dutch English Esperanto Estonian Faroese Finnish French Frisian Friulian Gagauz Galician German Georgian Greek Hungarian Icelandic Ido Irish Italian Kashubian Kölsch Ladin Ladino Latgalian Latin Latvian Ligurian Limburgan Limburgish Lithuanian Liv Livvi Lombard Low german Lower sorbian Luxembourgish Macedonian Maltese Manx Mari Mirandese Moksha Neapolitan Northern sami Norwegian Occitan Occitan (post 1500) Old church slavonic Old english (ca. 450 1100) Old norse Picard Piemontese Polish Portuguese Romanian Romansh, Romany Russian Spanish Sardinian Saterfriesisch Scots Scottish gaelic Serbian Serbo Croatian Sicilian Silesian Slovak Slovenian Swedish Turkish Ukrainian Upper sorbian Venetian Vlax romani Volapük Walloon Welsh Yiddish

Languages in the Middle East

Ancient and modern languages from the Middle East.

Subcategories

Keywords

Arabic Akkadian Ancient hebrew Assyrian neo Aramaic Hebrew

Languages of the Pacific Islands and Nations

Ancient and modern languages in the Pacific islands, Australia, and New Zealand.

Subcategories

Keywords

Ambulas Banjar Batak toba Benabena Betawi Bhojpuri Bine Bislama Buginese Bunama Burarra Chamorro Chuukese Dhao Doromu Koki Fiji hindi Fijian Halia Hawaiian Highland popoluca Hiri motu Iamalele Indonesian Javanese Kriol Kto Kâte Madurese Makasar Maori Marshallese Mende (papua new guinea) Minangkabau Mountain koiali Musi Muyuw Nauru Ngaju Nias Novial Pele Ata Pohnpeian Pular Rejang Samoan Sinaugoro Somba Siawari Sundanese Tahitian Tetun dili Tok pisin Tonga Warlpiri West kewa Yapese Yele