Search

NLTM Pilot TTS Data for Indian Languages — Hindi, Punjabi, Tamil, and Indian English.

TTS data for Indian languages — Hindi, Punjabi, Tamil, and Indian English. Text and corresponding speech data record in studio environment....

Contributor: TTS Consortia

Tags: TTS Data,Speech Data, Hindi TTS Data, Punjabi TTS Data, Tamil TTS Data, Indian English TTS Data, IITM

Indian English ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

The data set comprises of English read and conversational speech data along with the corresponding transcriptions. This speech data was collected by Speech Lab IITM and several startups. The text data...

Contributor: ASR Consortia

Tags: Indian English, ASR Challenge Data, ASR Speech Data, NLTM Pilot, Speech Corpus, Speech, Corpus

Hindi ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

The data set comprises of Hindi read and conversational speech data along with the corresponding transcriptions. This speech data was collected by Speech Lab IITM and several startups. The text data w...

Contributor: ASR Consortia

Tags: Hindi, ASR Challenge Data, ASR Speech Data, NLTM Pilot, Speech Corpus, Speech, Corpus

English-Hindi ,Tamil-Telugu Parallel Data Developed Under PSA Pilot

English-Hindi , Tamil-Telugu Parallel Data Developed Under PSA Pilot on SSMT, lead by IIIT-Hyderabad...

Contributor: NLTM IIIT-Hyderabad

Tags: English-Hindi , Tamil-Telugu , Parallel Data, IIIT-Hyderabad,NLTM Pilot

Hindi -Telugu Domain Dictionary by IIIT-H

Hindi and Telugu Domain Dictionary developed under ILMT Hindi-Telugu Pilot by IIIT-Hyderabad (Part1). The Domain of Dictionary is Chemistry and Law. ...

Contributor: NLTM IIIT-Hyderabad

Tags: Hindi , Telugu, Dictionary, Hindi and Telugu Domain Dictionary

Hindi ASR Challenge Data (ASR Speech Data released under 1st Challenge) - NLTMP

The data set comprises of Hindi read speech data along with the corresponding transcriptions. The text data was crawled from newspapers, and then volunteers were asked to read them. It covers genres l...

Contributor: ASR Consortia

Tags: Hindi, ASR Challenge Data, ASR, Speech Data, NLTM Pilot

Hindi–Telugu Parallel Text Corpus IIIT-Hyd

Hindi – Telugu Parallel Text corpus developed Under NLTM Pilot by IIIT-Hyderabad. The domain of corpus is Chemistry, Law, News & General, Health-Care, Education, Open Education...

Contributor: NLTM IIIT-Hyderabad

Tags: NLTM Pilot, Hindi, Telugu, Hindi–Telugu, Parallel, Text Corpus

Hindi Annotated Text Corpus - IIIT Hyderabad

Hindi Annotated corpus developed Under NLTM Pilot by IIIT-Hyderabad (Part1). Domains of the Corpus are Chemistry, Law, News & General,HealthCare, Education Others, open education books....

Contributor: NLTM IIIT-Hyderabad

Tags: NLTM Pilot, Hindi, Telugu, Hindi–Telugu, Annotated, Text Corpus , IIIT-Hyderabad

e-Aksharayan – Hindi OCR

e-Aksharayan is a Desktop software for converting scanned printed Indian Language documents into a fully editable text format in Unicode encoding. Works on Windows 7,8, and 10. Input and output speci...

Contributor: OCR Consortia

Tags: e-Aksharayan, Hindi OCR, Hindi, OCR

Indian English Raw Speech Corpus - Kannada Variant

Contributor: CIIL Mysore

Tags: Indian English, Raw Speech Corpus, Kannada Variant, Speech Corpus

Indian English Raw Speech Corpus - Bengali Variant

Contributor: CIIL Mysore

Tags: Indian English, Raw Speech Corpus, Bengali Variant, Speech Corpus

Multilingual Raw Speech Corpus

Dataset Description 97:43:54 Hours | 62.2 GB speech data | 1916 Speakers | 1,916 Audio segment...

Contributor: CIIL Mysore

Tags: Multilingual, Raw Speech Corpus, Speech Corpus

Gujarati Raw Speech Corpus(Mono Recordings)

Contributor: CIIL Mysore

Tags: Gujarati, Raw Speech Corpus, Mono Recordings, Speech Corpus

Indian English ASR Challenge Data (ASR Speech Data) - NLTM Pilot

The data set comprises of Indian English read speech and lecture speech data along with the corresponding transcriptions. The read speech covers genres like politics sports, entertainment, etc. It was...

Contributor: ASR Consortia

Tags: Indian English, ASR Challenge Data, ASR Speech Data, NLTM Pilot, Speech Corpus, Speech, Corpus

HINDI Speech Data – ASR

This corpus contains the more than 194714 audio files of HINDI language of approx. 1000 native speakers. This corpus also contains word and its corresponding phonetic representation and transcrip...

Contributor: ASR Consortia

Tags: ASR, HINDI, Speech Data

Products meeting the search criteria