speech data

NLTM Pilot TTS Data for Indian Languages — Hindi, Punjabi, Tamil, and Indian English.

TTS data for Indian languages — Hindi, Punjabi, Tamil, and Indian English. Text and corresponding speech data record in studio environment...

Available Under License:
CC BY-SA 2.0

Sample Download | size: 423.2MB | type: zip

Added on : 16 Aug 2021

Tags: TTS Data Speech Data Hindi TTS Data Punjabi TTS Data Tamil TTS Data Indian English TTS Data IITM

Indian English ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

The data set comprises of English read and conversational speech data along with the corresponding transcriptions. This speech data was collected by S..

Available Under License:
Research

Added on : 26 Jul 2021

Tags: Indian English ASR Challenge Data ASR Speech Data NLTM Pilot Speech Corpus Speech Corpus

Hindi ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

The data set comprises of Hindi read and conversational speech data along with the corresponding transcriptions. This speech data was collected by Spe..

Available Under License:
Research

Added on : 26 Jul 2021

Tags: Hindi ASR Challenge Data ASR Speech Data NLTM Pilot Speech Corpus Speech Corpus

Hindi ASR Challenge Data (ASR Speech Data released under 1st Challenge) - NLTMP

The data set comprises of Hindi read speech data along with the corresponding transcriptions. The text data was crawled from newspapers, and then volu..

Available Under License:
Research

Sample Download | size: 66MB | type: zip

Added on : 10 Jun 2021

Tags: Hindi ASR Challenge Data ASR Speech Data NLTM Pilot

Tamil ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

The data set comprises of Tamil read and conversational speech data along with the corresponding transcriptions. This speech data was collected by Spe..

Available Under License:
Research

Added on : 26 Jul 2021

Tags: Tamil ASR Challenge Data ASR Speech Data NLTM Pilot Speech Corpus Speech Corpus

Indian English ASR Challenge Data (ASR Speech Data) - NLTM Pilot

The data set comprises of Indian English read speech and lecture speech data along with the corresponding transcriptions. The read speech covers genre..

Available Under License:
Research

Sample Download | size: 23.7MB | type: tar

Added on : 10 Jun 2021

Tags: Indian English ASR Challenge Data ASR Speech Data NLTM Pilot Speech Corpus Speech Corpus

Telugu Speech Data- ASR

This corpus contains the 6019 audio files of Telugu language of approx. 1000 native speakers. This data was prepared for Agricultural Commo..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 1.7MB | type: zip

Added on : 21 Jan 2021

Tags: ASR Telugu Speech Data

BIHARI SPEECH DATA - ASR

This corpus contains the 54866 audio files of Bihari language of approx. 1000 native speakers. This corpus also contains word and its correspond..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 1.4MB | type: zip

Added on : 21 Jan 2021

Tags: ASR Bihari Speech Data

Bengali Speech Data – ASR

This corpus contains the more than 43134 audio files of Bengali language of approx. 1000 native speakers. This corpus also contains word and its corre..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 981.8KB | type: zip

Added on : 12 Jan 2021

Tags: ASR Bengali Speech Data

HINDI Speech Data – ASR

This corpus contains the more than 194714 audio files of HINDI language of approx. 1000 native speakers. This corpus also contains word and its c..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 2.7MB | type: zip

Added on : 12 Jan 2021

Tags: ASR HINDI Speech Data

Marathi Speech Data - ASR

This corpus contains the more than 44521 audio files of Marathi language of 1500 speakers, dic file which contains word and its corresponding phonetic..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 2.1MB | type: zip

Added on : 11 Dec 2020

Tags: ASR Marathi Speech Data

Tamil Speech Data- ASR

This corpus contains the more than 88175 audio files of Tamil language of approx. 1000 native speakers. This corpus contains word and its correspondin..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 2.7MB | type: zip

Added on : 04 Dec 2020

Tags: ASR Tamil Speech Data

Odia Speech Data – ASR

This corpus contains the more than 11940 audio files of Odia language of approx. 1000 native speakers. This corpus contains word and its corresponding..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 1.6MB | type: zip

Added on : 04 Dec 2020

Tags: ASR Odia Speech Data

Kannada Speech Data – ASR

This corpus contains the more than 93803 audio files of Kannada language of 1000 native speakers, Callflow1.dic file which contains word and its corre..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 973.4KB | type: zip

Added on : 04 Dec 2020

Tags: ASR Kannada Speech Data

HINDI (JHARKHAND) Speech Data – ASR

This corpus contains the more than 36694 audio files of HINDI (JHARKHAND) language of approx. 1000 native speakers. This corpus also contains wo..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 2MB | type: zip

Added on : 03 Dec 2020

Tags: ASR HINDI (JHARKHAND) Speech Data

Gujarati Speech Data - ASR

This corpus contains the more than 46503 audio files of Gujarati language of approx. 1000 native speakers. This corpus also contains word and it..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 930.8KB | type: zip

Added on : 03 Dec 2020

Tags: ASR Gujarati Speech Corpus

Assamese Speech Data-ASR

This corpus contains the 57975 audio files of Assamese language of approx. 1000 native speakers. This corpus also contains word and its correspo..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 1MB | type: zip

Added on : 03 Dec 2020

Tags: ASR ASSAMESE SPEECH DATA

Telugu Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Telugu language under the project developing text-to-speech (TTS)..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 121.1MB | type: 7z

Added on : 27 Aug 2019

Tags: Telugu Voice Data Female voice TTS text to speech

Telugu Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Telugu language under the project developing text-to-speech (TTS)..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 105.9MB | type: 7z

Added on : 26 Aug 2019

Tags: telugu voice data male voice

Tamil Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Tamil language under the project developing text-to-speech (TTS) ..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 25MB | type: 7z

Added on : 26 Aug 2019

Tags: Tamil Voice Data Male voice TTS text to speech

Tamil Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Tamil language under the project developing text-to-speech (TTS) ..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 26.9MB | type: 7z

Added on : 26 Aug 2019

Tags: Tamil voice data tts text to speech

Rajasthani Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Rajasthani language under the project developing text-to-speech (..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 24MB | type: 7z

Added on : 26 Aug 2019

Tags: Rajasthani Voice data hindi dialect tts text to speech

Rajasthani Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Rajasthani language under the project developing text-to-speech (..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 32.3MB | type: 7z

Added on : 26 Aug 2019

Tags: Rajasthani voice data tts text to speech

Odia Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Odia language under the project developing text-to-speech (TTS) s..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 5.6MB | type: 7z

Added on : 26 Aug 2019

Tags: Odia Odiya voice data male voice tts text to speech

Odia Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Odia language under the project developing text-to-speech (TTS) s..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 27.6MB | type: 7z

Added on : 26 Aug 2019

Tags: Odia Odiya text corpus tts voice data

Manipuri Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Manipuri language under the project developing text-to-speech (TT..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 24.6MB | type: 7z

Added on : 26 Aug 2019

Tags: Manipuri voice data tts text to speech

Manipuri Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Manipuri language under the project developing text-to-speech (TT..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 20.3MB | type: 7z

Added on : 26 Aug 2019

Tags: manipuri tts text to speech female voice data

Malayalam Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Malayalam language under the project developing text-to-speech (T..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 29.3MB | type: 7z

Added on : 23 Aug 2019

Tags: Malayalam voice data male voice tts text to speech

Bengali Speech Corpus ILSRD

Under the Indian Languages Speech Resources Development for Speech Applications project initiated by the MeitY, Govt. of India, Speech Consortium..

Available Under License:
Commercial Research

Sample Download | size: 54.6MB | type: 7z

Added on : 23 Aug 2019

Tags: Speech Corpus Bengali

Malayalam Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Malayalam language under the project developing text-to-speech (T..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 37.1MB | type: 7z

Added on : 22 Aug 2019

Tags: Malayalam voice data female voice tts text to speech

Assamese Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Assamese language under the project developing text-to-speech (TT..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 22.2MB | type: 7z

Added on : 13 Aug 2019

Tags: Assamese tts text to speech voice data male

Assamese Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Assamese language under the project developing text-to-speech (TT..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 22.3MB | type: 7z

Added on : 13 Aug 2019

Tags: Assamese tts text to speech voice data female

Kannada Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Kannada language under the project developing text-to-speech (TTS..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 46MB | type: 7z

Added on : 13 Aug 2019

Tags: Kannada male tts text to speech voice data

Kannada Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Kannada language under the project developing text-to-speech (TTS..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 71.2MB | type: 7z

Added on : 13 Aug 2019

Tags: Kannada tts Text to Speech Voice data

Bodo Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Bengali language under the project developing text-to-speech (TTS..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 65.1MB | type: 7z

Added on : 07 Aug 2019

Tags: Bodo Boro TTS text to speech voice data

Bengali Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Bengali language under the project developing text-to-speech (TTS..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 34.1MB | type: 7z

Added on : 07 Aug 2019

Tags: Bengali Bangla TTS Text to Speech Bengali Voice

Bengali Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Bengali language under the project developing text-to-speech (TTS..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 28.7MB | type: 7z

Added on : 07 Aug 2019

Tags: Bangla Bengali TTS text to speech Bengali voice

Marathi Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Marathi language under the project developing text-to-speech (TTS..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 33.7MB | type: 7z

Added on : 07 Aug 2019

Tags: Marathi Male TTS Text to Speech Marathi TTS

Marathi Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Marathi language under the project developing text-to-speech (TTS..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 41.9MB | type: 7z

Added on : 06 Aug 2019

Tags: Marathi TTS Text to Speech Marathi TTS

Hindi Voice Data Female- ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Hindi language under the project developing text-to-speech (TTS) ..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 72.5MB | type: 7z

Added on : 06 Aug 2019

Tags: Hindi TTS text to speech Hindi voice

Hindi Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Hindi language under the project developing text-to-speech (TTS) ..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 66.8MB | type: 7z

Added on : 05 Aug 2019

Tags: Hindi Voice Data Male Voice TTS text-to-speech

Gujarati Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Gujarati language under the project developing text-to-speech (TT..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 56.7MB | type: 7z

Added on : 02 Aug 2019

Tags: Gujarati Voice Data text to speech Gujarati TTS male

Gujarati Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Gujarati language under the project developing text-to-speech (TT..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 62.5MB | type: 7z

Added on : 02 Aug 2019

Tags: Gujarati Gujarati Voice Data TTS Female Voice text to speech

Indian English Speech Corpus ILSRD

Under the Indian Languages Speech Resources Development for Speech Applications project initiated by the MeitY, Govt. of India, Speech Consortium led ..

Available Under License:
Commercial Research

Sample Download | size: 33.3MB | type: 7z

Added on : 16 Jul 2019

Tags: Speech Corpus Indian English English

Hindi Speech Corpus ILSRD

Under the Indian Languages Speech Resources Development for Speech Applications project initiated by the MeitY, Govt. of India, Speech Consortium led ..

Available Under License:
Commercial Research

Sample Download | size: 39.9MB | type: 7z

Added on : 16 Jul 2019

Tags: Hindi Speech Corpus