Resources

Here the term Resources refers to a set of speech or language data and descriptions in machine readable form, for the purpose of building, improving or evaluating natural language and speech algorithms or systems.

Refine Search


Bengali Treebank IIITH

Bengali Treebank IIITH

Bengali tree bank data is in Shakti Standard Format (SSF). SSF is a common representation for data. SSF allows information in a sentence to be represe..

Available Under License:
Commercial   Research  

Sample Download | size: 651.7KB | type: zip

Added on : 02 Aug 2019

Kannada Treebank IIITH

Kannada Treebank IIITH

Kannada tree bank data is in Shakti Standard Format (SSF). SSF is a common representation for data. SSF allows information in a sentence to be represe..

Available Under License:
Commercial   Research  

Sample Download | size: 629.5KB | type: rar

Added on : 02 Aug 2019

Gujarati Voice Data Male - ILTTS

Gujarati Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Gujarati language under the project developing text-to-speech (TT..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 56.7MB | type: 7z

Added on : 02 Aug 2019

Gujarati Voice Data Female - ILTTS

Gujarati Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Gujarati language under the project developing text-to-speech (TT..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 62.5MB | type: 7z

Added on : 02 Aug 2019

Malayalam Treebank IIITH

Malayalam Treebank IIITH

Malayalam treebank data is in Shakti Standard Format (SSF). SSF is a common representation for data. SSF allows information in a sentence to be repres..

Available Under License:
Commercial   Research  

Sample Download | size: 169.9KB | type: zip

Added on : 01 Aug 2019

Punjabi Pronunciation Lexicon Dictionary

Punjabi Pronunciation Lexicon Dictionary

Under the ‘Development of Pronunciation Lexicon, Based on Experimental Study Of Phonetics And Phonemic Of Indian Languages’ project initiated by the M..

Available Under License:
Commercial   Research  

Sample Download | size: 7.4MB | type: 7z

Added on : 24 Jul 2019

Manipuri Pronunciation Lexicon Dictionary

Manipuri Pronunciation Lexicon Dictionary

Under the ‘Development of Pronunciation Lexicon, Based on Experimental Study Of Phonetics And Phonemic Of Indian Languages’ project initiated by the M..

Available Under License:
Commercial   Research  

Sample Download | size: 5.4MB | type: 7z

Added on : 24 Jul 2019

Assamese Pronunciation Lexicon Dictionary

Assamese Pronunciation Lexicon Dictionary

Under the ‘Development of Pronunciation Lexicon, Based on Experimental Study Of Phonetics And Phonemic Of Indian Languages’ project initiated by the M..

Available Under License:
Commercial   Research  

Sample Download | size: 9.3MB | type: zip

Added on : 18 Jul 2019

Marathi Pronunciation Lexicon Dictionary

Marathi Pronunciation Lexicon Dictionary

Under the ‘Development of Pronunciation Lexicon, Based on Experimental Study Of Phonetics And Phonemic Of Indian Languages’ project initiated by the M..

Available Under License:
Commercial   Research  

Sample Download | size: 9.6MB | type: zip

Added on : 17 Jul 2019

Hindi - Boro Parallel Chunked Text Corpus ILCI

Hindi - Boro Parallel Chunked Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 539.2KB | type: rar

Added on : 17 Jul 2019

Hindi - Urdu Parallel Chunked Text Corpus ILCI

Hindi - Urdu Parallel Chunked Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 450.4KB | type: rar

Added on : 17 Jul 2019

Odia Wordnet

Odia Wordnet

Under the Indo-Wordnet Consortium project, led by IIT Bombay, Odia Wordnet's Synsets (synonym set) has been developed. For each synset a POS category,..

Available Under License:
Commercial   Research  

Sample Download | size: 60.6KB | type: rar

Added on : 17 Jul 2019

Hindi - Telugu Parallel Chunked Text Corpus ILCI

Hindi - Telugu Parallel Chunked Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 380KB | type: rar

Added on : 17 Jul 2019

Hindi - Bengali Parallel Chunked Text Corpus ILCI

Hindi - Bengali Parallel Chunked Text Corpus ILCI

 Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel ..

Available Under License:
Commercial   Research  

Sample Download | size: 902.8KB | type: rar

Added on : 17 Jul 2019

Konkani Wordnet

Konkani Wordnet

Under the Indo-Wordnet Consortium project, led by IIT Bombay, Konkani Wordnet's Synsets (synonym set) has been developed.  For each synset a POS ..

Available Under License:
Commercial   Research  

Sample Download | size: 56KB | type: rar

Added on : 17 Jul 2019

Hindi - Marathi Parallel Chunked Text Corpus ILCI

Hindi - Marathi Parallel Chunked Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 445.5KB | type: rar

Added on : 17 Jul 2019

Kashmiri Wordnet

Kashmiri Wordnet

Under the Indo-Wordnet Consortium project, led by IIT Bombay, Kashmiri Wordnet's Synsets (synonym set) has been developed. For each synset a POS categ..

Available Under License:
Commercial   Research  

Sample Download | size: 57.5KB | type: rar

Added on : 17 Jul 2019

Hindi - Konkani Parallel Chunked Text Corpus ILCI

Hindi - Konkani Parallel Chunked Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 455.3KB | type: rar

Added on : 17 Jul 2019

Hindi - Kannada Parallel Chunked Text Corpus ILCI

Hindi - Kannada Parallel Chunked Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 451.6KB | type: rar

Added on : 17 Jul 2019

Hindi - Gujarati Parallel Chunked Text Corpus ILCI

Hindi - Gujarati Parallel Chunked Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 450.2KB | type: rar

Added on : 17 Jul 2019

Hindi - English Parallel Chunked Text Corpus ILCI

Hindi - English Parallel Chunked Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 431.4KB | type: rar

Added on : 17 Jul 2019

Hindi - Assamese Parallel Chunked Text Corpus ILCI

Hindi - Assamese Parallel Chunked Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 460.5KB | type: rar

Added on : 16 Jul 2019

Hindi - Tamil Parallel POS Tagged Text Corpus

Hindi - Tamil Parallel POS Tagged Text Corpus

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 256.9KB | type: rar

Added on : 16 Jul 2019

Hindi - Punjabi Parallel POS Tagged Text Corpus ILCI

Hindi - Punjabi Parallel POS Tagged Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 263.9KB | type: rar

Added on : 16 Jul 2019

Indian English  Speech Corpus ILSRD

Indian English Speech Corpus ILSRD

Under the Indian Languages Speech Resources Development for Speech Applications project initiated by the MeitY, Govt. of India, Speech Consortium led ..

Available Under License:
Commercial   Research  

Sample Download | size: 33.3MB | type: 7z

Added on : 16 Jul 2019

Hindi Speech Corpus ILSRD

Hindi Speech Corpus ILSRD

Under the Indian Languages Speech Resources Development for Speech Applications project initiated by the MeitY, Govt. of India, Speech Consortium led ..

Available Under License:
Commercial   Research  

Sample Download | size: 39.9MB | type: 7z

Added on : 16 Jul 2019

Hindi - Nepali Parallel POS Tagged Text Corpus ILCI

Hindi - Nepali Parallel POS Tagged Text Corpus ILCI

 Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel ..

Available Under License:
Commercial   Research  

Sample Download | size: 258.5KB | type: rar

Added on : 15 Jul 2019

Hindi - Malayalam Parallel POS Tagged Text Corpus ILCI

Hindi - Malayalam Parallel POS Tagged Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project initiated by the MeitY, Govt. of India, ILCI Consortia led by Jawaharlal Nehru University..

Available Under License:
Commercial   Research  

Sample Download | size: 269.5KB | type: rar

Added on : 15 Jul 2019

Hindi - Telugu Parallel POS Tagged Text Corpus ILCI

Hindi - Telugu Parallel POS Tagged Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 261.5KB | type: rar

Added on : 15 Jul 2019

Urdu Wordnet

Urdu Wordnet

Under the Indo-Wordnet Consortium project, led by IIT Bombay, Urdu Wordnet's Synsets (synonym set) has been developed. For each synset a POS category,..

Available Under License:
Commercial   Research  

Sample Download | size: 59.1KB | type: rar

Added on : 15 Jul 2019

Hindi – Urdu Parallel POS Tagged Text Corpus ILCI

Hindi – Urdu Parallel POS Tagged Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 252.2KB | type: rar

Added on : 15 Jul 2019

Hindi – Marathi Parallel POS Tagged Text Corpus ILCI

Hindi – Marathi Parallel POS Tagged Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 265.8KB | type: rar

Added on : 15 Jul 2019

Hindi – Konkani Parallel POS Tagged Text Corpus ILCI

Hindi – Konkani Parallel POS Tagged Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 261KB | type: rar

Added on : 15 Jul 2019

Hindi – Kannada Parallel POS Tagged Text Corpus ILCI

Hindi – Kannada Parallel POS Tagged Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project initiated by the MeitY, Govt. of India, ILCI Consortia led by Jawaharlal Nehru University..

Available Under License:
Commercial   Research  

Sample Download | size: 264.5KB | type: rar

Added on : 15 Jul 2019

Punjabi Wordnet

Punjabi Wordnet

Under the Indo-Wordnet Consortium project, led by IIT Bombay, Punjabi Wordnet's Synsets (synonym set) has been developed. For each synset a POS catego..

Available Under License:
Commercial   Research  

Sample Download | size: 67.1KB | type: rar

Added on : 15 Jul 2019

Hindi – Bodo Parallel POS Tagged Text Corpus ILCI

Hindi – Bodo Parallel POS Tagged Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 263KB | type: rar

Added on : 15 Jul 2019

Hindi – Gujarati Parallel POS Tagged Text Corpus ILCI

Hindi – Gujarati Parallel POS Tagged Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 257.7KB | type: rar

Added on : 15 Jul 2019

Hindi – English Parallel POS Tagged Text Corpus ILCI

Hindi – English Parallel POS Tagged Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 242.7KB | type: rar

Added on : 15 Jul 2019

Hindi – Bengali Parallel POS Tagged Text Corpus ILCI

Hindi – Bengali Parallel POS Tagged Text Corpus ILCI

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus..

Available Under License:
Commercial   Research  

Sample Download | size: 253.9KB | type: rar

Added on : 13 Jul 2019

Hindi – Assamese Parallel  POS Tagged Text Corpus ILCI

Hindi – Assamese Parallel POS Tagged Text Corpus ILCI

 Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel ..

Available Under License:
Commercial   Research  

Sample Download | size: 262.1KB | type: rar

Added on : 13 Jul 2019

English Monolingual PoS Tagged Text Corpus ILCI

English Monolingual PoS Tagged Text Corpus ILCI

Under the Indian Languages Corpora Initiative phase –II (ILCI Phase-II) project, initiated by the MeitY, Govt. of India, Jawaharlal Nehru University, ..

Available Under License:
Commercial   Research  

Sample Download | size: 23.8KB | type: zip

Added on : 20 Jul 2020

Showing 101 to 141 of 141 (2 Pages)
Disclaimer: The information provided on this page has been procured through different sources. Please write back to us at nplt_support[at]cdac[dot]in in case you would like to suggest an update.