• End-to-End Automatic Speech Recognition
End-to-End Automatic Speech Recognition
Added on : 27 Oct 2020

End-to-End Indian English Automatic Speech Recognition (ASR) systems have been developed for different domains like news, stories, articles and NPTEL lecture transcription domains like Humanities, Electrical Engineering, Electrical and Communication Engineering, Computer Science Engineering and Mechanical Engineering. A Speech activity detector is developed for distinguishing speech and silence. The segment of speech thus detected is used by the ASR systems to generate the transcriptions automatically. This automatically generated transcription is used in generating Subtitles / Notes.  


ASR has been provided as an offline service. Users have to upload wav files (not more than 200MB) at this link: https://www.iitm.ac.in/speech/NPTEL/audio/ The link to download the ASR output will be shared in the same website. The average turnaround time is 24 hours.

Write a review

Please login or register to review
Disclaimer: The information provided on this page has been procured through different sources. Please write back to us at nplt_support[at]cdac[dot]in in case you would like to suggest an update.