• Indian English ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP
Indian English ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP
  • Contributor: ASR Consortia
  • Product Code: NLTMP-ASR-3CHALLENGE-ENG-004

Available Under License: Research  

Added on : 26 Jul 2021

The data set comprises of English read and conversational speech data along with the corresponding transcriptions. This speech data was collected by Speech Lab IITM and several startups. The text data was crawled from newspapers, and then volunteers were asked to read them. The following data sets are released for this challenge:


Train set - 179.5 hours

Development set - 5.4 hours   

Evaluation set - 5.4  hours 

Speech Data Attributes
Language Indian Accent English
Transcription Yes, Available
Duration 190.3 hours
Speaker Gender Both Male & Female

Write a review

Please login or register to review

Tags: Indian English, ASR Challenge Data, ASR Speech Data, NLTM Pilot, Speech Corpus, Speech, Corpus

Disclaimer: The information provided on this page has been procured through different sources. Please write back to us at nplt_support[at]cdac[dot]in in case you would like to suggest an update.