Resources

Here the term Resources refers to a set of speech or language data and descriptions in machine readable form, for the purpose of building, improving or evaluating natural language and speech algorithms or systems.

Refine Search


Gujarati Wordnet

Gujarati Wordnet

Under the Indo-Wordnet Consortium project, led by IIT Bombay, Gujarati Wordnet's Synsets (synonym set) has been developed. For each synset a POS categ..

Available Under License:
Commercial   Research  

Sample Download | size: 65.6KB | type: rar

Added on : 30 Nov -0001

Assamese Pronunciation Lexicon Dictionary

Assamese Pronunciation Lexicon Dictionary

Under the ‘Development of Pronunciation Lexicon, Based on Experimental Study Of Phonetics And Phonemic Of Indian Languages’ project initiated by the M..

Available Under License:
Commercial   Research  

Sample Download | size: 9.3MB | type: zip

Added on : 18 Jul 2019

Assamese Voice Data Female - ILTTS

Assamese Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Assamese language under the project developing text-to-speech (TT..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 22.3MB | type: 7z

Added on : 13 Aug 2019

Assamese Voice Data Male - ILTTS

Assamese Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Assamese language under the project developing text-to-speech (TT..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 22.2MB | type: 7z

Added on : 13 Aug 2019

Bengali Speech Corpus ILSRD

Bengali Speech Corpus ILSRD

Under the Indian Languages Speech Resources Development for Speech Applications project initiated by the MeitY, Govt. of India, Speech Consortium..

Available Under License:
Commercial   Research  

Sample Download | size: 54.6MB | type: 7z

Added on : 23 Aug 2019

Bengali Treebank IIITH

Bengali Treebank IIITH

Bengali tree bank data is in Shakti Standard Format (SSF). SSF is a common representation for data. SSF allows information in a sentence to be represe..

Available Under License:
Commercial   Research  

Sample Download | size: 651.7KB | type: zip

Added on : 02 Aug 2019

Bengali Voice Data Female - ILTTS

Bengali Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Bengali language under the project developing text-to-speech (TTS..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 28.7MB | type: 7z

Added on : 07 Aug 2019

Bengali Voice Data Male - ILTTS

Bengali Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Bengali language under the project developing text-to-speech (TTS..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 34.1MB | type: 7z

Added on : 07 Aug 2019

Bodo Voice Data Female - ILTTS

Bodo Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Bengali language under the project developing text-to-speech (TTS..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 65.1MB | type: 7z

Added on : 07 Aug 2019

English Agriculture Monolingual Text-Corpus -EILMT

English Agriculture Monolingual Text-Corpus -EILMT

This is a monolingual aligned corpus developed for Agriculture domain under English to Indian Language Machine Translation (EILMT) Consortium. Support..

Available Under License:
Commercial   Research  

Sample Download | size: 15KB | type: zip

Added on : 16 Jul 2020

English Health Monolingual Text Corpus -EILMT

English Health Monolingual Text Corpus -EILMT

This is a monolingual aligned corpus developed for Health domain under English to Indian Language Machine Translation (EILMT) Consortium. Supported te..

Available Under License:
Commercial   Research  

Sample Download | size: 13.7KB | type: zip

Added on : 16 Jul 2020

English Tourism Monolingual Text Corpus -EILMT

English Tourism Monolingual Text Corpus -EILMT

This is a monolingual aligned corpus developed for Tourism domain under English to Indian Language Machine Translation (EILMT) Consortium. Supported t..

Available Under License:
Commercial   Research  

Sample Download | size: 18.1KB | type: zip

Added on : 16 Jul 2020

English-Bangla Agriculture Parallel Text corpus-EILMT

English-Bangla Agriculture Parallel Text corpus-EILMT

English-Bangla Agriculture Parallel Text corpus is developed in Unicode under English to Indian Language Machine Translation (EILMT) Consort..

Available Under License:
Commercial   Research  

Sample Download | size: 23KB | type: zip

Added on : 20 Jul 2020

English-Bangla Health Parallel Text corpus-EILMT

English-Bangla Health Parallel Text corpus-EILMT

English-Bangla Parallel Health Text corpus is developed in Unicode under English to Indian Language Machine Translation (EILMT) Consortium. This corpu..

Available Under License:
Commercial   Research  

Sample Download | size: 17.8KB | type: zip

Added on : 20 Jul 2020

English-Bangla Tourism Set - I Parallel Text corpus-EILMT

English-Bangla Tourism Set - I Parallel Text corpus-EILMT

English-Bangla Parallel Tourism Text corpus is developed in Unicode under English to Indian Language Machine Translation (EILMT) consortium. The ..

Available Under License:
Commercial   Research  

Sample Download | size: 30.2KB | type: zip

Added on : 20 Jul 2020

Showing 1 to 15 of 74 (5 Pages)
Disclaimer: The information provided on this page has been procured through different sources. Please write back to us at nplt_support[at]cdac[dot]in in case you would like to suggest an update.