Speech Corpus

Assamese Speech Data-ASR

Assamese Speech Data-ASR

This corpus contains the 57975 audio files of Assamese language of approx. 1000 native speakers. This corpus also  contains word and its correspo..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 1MB | type: zip

Added on : 03 Dec 2020

Assamese Voice Data Female - ILTTS

Assamese Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Assamese language under the project developing text-to-speech (TT..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 22.3MB | type: 7z

Added on : 13 Aug 2019

Assamese Voice Data Male - ILTTS

Assamese Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Assamese language under the project developing text-to-speech (TT..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 22.2MB | type: 7z

Added on : 13 Aug 2019

Bengali Speech Corpus ILSRD

Bengali Speech Corpus ILSRD

Under the Indian Languages Speech Resources Development for Speech Applications project initiated by the MeitY, Govt. of India, Speech Consortium..

Available Under License:
Commercial   Research  

Sample Download | size: 54.6MB | type: 7z

Added on : 23 Aug 2019

Bengali Speech Data – ASR

Bengali Speech Data – ASR

This corpus contains the more than 43134 audio files of Bengali language of approx. 1000 native speakers. This corpus also contains word and its corre..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 981.8KB | type: zip

Added on : 12 Jan 2021

Bengali Voice Data Female - ILTTS

Bengali Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Bengali language under the project developing text-to-speech (TTS..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 28.7MB | type: 7z

Added on : 07 Aug 2019

Bengali Voice Data Male - ILTTS

Bengali Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Bengali language under the project developing text-to-speech (TTS..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 34.1MB | type: 7z

Added on : 07 Aug 2019

BIHARI SPEECH DATA - ASR

BIHARI SPEECH DATA - ASR

This corpus contains the 54866 audio files of Bihari language of approx. 1000 native speakers. This corpus also  contains word and its correspond..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 1.4MB | type: zip

Added on : 21 Jan 2021

Bodo Voice Data Female - ILTTS

Bodo Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Bengali language under the project developing text-to-speech (TTS..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 65.1MB | type: 7z

Added on : 07 Aug 2019

Gujarati Speech Data - ASR

Gujarati Speech Data - ASR

This corpus contains the more than 46503 audio files of Gujarati language of  approx. 1000 native speakers. This corpus also contains word and it..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 930.8KB | type: zip

Added on : 03 Dec 2020

Gujarati Voice Data Female - ILTTS

Gujarati Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Gujarati language under the project developing text-to-speech (TT..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 62.5MB | type: 7z

Added on : 02 Aug 2019

Gujarati Voice Data Male - ILTTS

Gujarati Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Gujarati language under the project developing text-to-speech (TT..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 56.7MB | type: 7z

Added on : 02 Aug 2019

HINDI (JHARKHAND) Speech Data – ASR

HINDI (JHARKHAND) Speech Data – ASR

This corpus contains the more than 36694 audio files of HINDI (JHARKHAND)  language of approx. 1000 native speakers. This corpus also contains wo..

Available Under License:
CC BY-SA 2.0  

Sample Download | size: 2MB | type: zip

Added on : 03 Dec 2020

Hindi ASR Challenge Data (ASR Speech Data) - NLTMP

Hindi ASR Challenge Data (ASR Speech Data) - NLTMP

The data set comprises of Hindi read speech data along with the corresponding transcriptions. The text data was crawled from newspapers, and then volu..

Available Under License:
Research  

Sample Download | size: 66MB | type: zip

Added on : 10 Jun 2021

Hindi Speech Corpus ILSRD

Hindi Speech Corpus ILSRD

Under the Indian Languages Speech Resources Development for Speech Applications project initiated by the MeitY, Govt. of India, Speech Consortium led ..

Available Under License:
Commercial   Research  

Sample Download | size: 39.9MB | type: 7z

Added on : 16 Jul 2019

Showing 1 to 15 of 41 (3 Pages)
Disclaimer: The information provided on this page has been procured through different sources. Please write back to us at nplt_support[at]cdac[dot]in in case you would like to suggest an update.