How can our language datasets enhance your products?

Available languages

We offer flexible, curated datasets for 55 of the world’s major languages in off-the-shelf packages and bespoke bundles tailored to your individual requirements.

Get in touch for more information on our available languages and to discuss how our language datasets can enhance your products.

LanguagesMonoligualBilingualBilingualizedWordlistCorpus output
Afrikaansxx
Amharicxx
Arabicxx
Bengalixxx
Catalanx
Chinesex
Chinese (simplified)xx
Chinese (traditional)xx
Danishxxx
Dutchxx
Englishxxx
English (Aus)x
English (NZ)x
English (UK)xx
Farsixx
Finnishx
Frenchxxxx
Germanx
Greekx
Hebrewxxxx
Hindixxxx
Hungarianx
Igbox
Indonesianx
Italianxx
Japanesex
Kannadax
Koreanxx
Latinx
Latvianxx
Malayx
Malayalamxx
Marathixx
Northern Sothox
Norwegianxx
Persianx
Polishx
Portuguesexxx
Portuguese (Brazilian)x
Punjabixxx
Romanianxx
Russianxxxx
Scottish Gaelicx
Serbianx
Spanishxx
Swahilix
Swedishxxx
Tamilx
Telugux
Thaixxxx
Turkishxxx
Urdux
Vietnamesexx
Xhosax
Zuluxxx

Want to know more about our available languages?