Your cart is empty!
Here the term Resources refers to a set of speech or language data and descriptions in machine readable form, for the purpose of building, improving or evaluating natural language and speech algorithms or systems.
Under the Indo-Wordnet Consortium project, led by IIT Bombay, Gujarati Wordnet's Synsets (synonym set) has been developed. For each synset a POS categ..
Available Under License: Commercial Research
Under the ‘Development of Pronunciation Lexicon, Based on Experimental Study Of Phonetics And Phonemic Of Indian Languages’ project initiated by the M..
It is a voice data collected for building HTS based statistical speech synthesis for Assamese language under the project developing text-to-speech (TT..
Available Under License: CC BY-SA 2.0
Under the Indian Languages Speech Resources Development for Speech Applications project initiated by the MeitY, Govt. of India, Speech Consortium..
Bengali tree bank data is in Shakti Standard Format (SSF). SSF is a common representation for data. SSF allows information in a sentence to be represe..
It is a voice data collected for building HTS based statistical speech synthesis for Bengali language under the project developing text-to-speech (TTS..
This is a monolingual aligned corpus developed for Agriculture domain under English to Indian Language Machine Translation (EILMT) Consortium. Support..
This is a monolingual aligned corpus developed for Health domain under English to Indian Language Machine Translation (EILMT) Consortium. Supported te..
This is a monolingual aligned corpus developed for Tourism domain under English to Indian Language Machine Translation (EILMT) Consortium. Supported t..
English-Bangla Agriculture Parallel Text corpus is developed in Unicode under English to Indian Language Machine Translation (EILMT) Consort..
English-Bangla Parallel Health Text corpus is developed in Unicode under English to Indian Language Machine Translation (EILMT) Consortium. This corpu..
English-Bangla Parallel Tourism Text corpus is developed in Unicode under English to Indian Language Machine Translation (EILMT) consortium. The ..