Resources

Here the term Resources refers to a set of speech or language data and descriptions in machine readable form, for the purpose of building, improving or evaluating natural language and speech algorithms or systems.

Refine Search

Treebank
PLS

Product Compare (0)

Sort By:

Show:

NLTM Pilot TTS Data for Indian Languages — Hindi, Punjabi, Tamil, and Indian English.

TTS data for Indian languages — Hindi, Punjabi, Tamil, and Indian English. Text and corresponding speech data record in studio environment...

Available Under License:
CC BY-SA 2.0

Sample Download | size: 423.2MB | type: zip

Added on : 16 Aug 2021

Tags: TTS Data Speech Data Hindi TTS Data Punjabi TTS Data Tamil TTS Data Indian English TTS Data IITM

Indian English ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

The data set comprises of English read and conversational speech data along with the corresponding transcriptions. This speech data was collected by S..

Available Under License:
Research

Added on : 26 Jul 2021

Tags: Indian English ASR Challenge Data ASR Speech Data NLTM Pilot Speech Corpus Speech Corpus

Hindi ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

The data set comprises of Hindi read and conversational speech data along with the corresponding transcriptions. This speech data was collected by Spe..

Available Under License:
Research

Added on : 26 Jul 2021

Tags: Hindi ASR Challenge Data ASR Speech Data NLTM Pilot Speech Corpus Speech Corpus

English-Hindi ,Tamil-Telugu Parallel Data Developed Under PSA Pilot

English-Hindi , Tamil-Telugu Parallel Data Developed Under PSA Pilot on SSMT, lead by IIIT-Hyderabad..

Available Under License:
CC BY-NC-SA 4.0

Sample Download | size: 978B | type: zip

Added on : 23 Jul 2021

Tags: English-Hindi Tamil-Telugu Parallel Data IIIT-Hyderabad NLTM Pilot

Hindi -Telugu Domain Dictionary by IIIT-H

Hindi and Telugu Domain Dictionary developed under ILMT Hindi-Telugu Pilot by IIIT-Hyderabad (Part1). The Domain of Dictionary is Chemistry and ..

Available Under License:
CC BY-NC-SA 4.0

Sample Download | size: 566B | type: zip

Added on : 20 Jun 2021

Tags: Hindi Telugu Dictionary Hindi and Telugu Domain Dictionary

Hindi ASR Challenge Data (ASR Speech Data released under 1st Challenge) - NLTMP

The data set comprises of Hindi read speech data along with the corresponding transcriptions. The text data was crawled from newspapers, and then volu..

Available Under License:
Research

Sample Download | size: 66MB | type: zip

Added on : 10 Jun 2021

Tags: Hindi ASR Challenge Data ASR Speech Data NLTM Pilot

Hindi–Telugu Parallel Text Corpus IIIT-Hyd

Hindi – Telugu Parallel Text corpus developed Under NLTM Pilot by IIIT-Hyderabad. The domain of corpus is Chemistry, Law, News & General,&nbs..

Available Under License:
CC BY-NC-SA 4.0

Sample Download | size: 29.8KB | type: zip

Added on : 17 Mar 2021

Tags: NLTM Pilot Hindi Telugu Hindi–Telugu Parallel Text Corpus

Hindi Annotated Text Corpus - IIIT Hyderabad

Hindi Annotated corpus developed Under NLTM Pilot by IIIT-Hyderabad (Part1). Domains of the Corpus are Chemistry, Law, News & General,HealthCare, ..

Available Under License:
CC BY-NC-SA 4.0

Sample Download | size: 10.6KB | type: zip

Added on : 17 Mar 2021

Tags: NLTM Pilot Hindi Telugu Hindi–Telugu Annotated Text Corpus IIIT-Hyderabad

Gujarati Wordnet

Under the Indo-Wordnet Consortium project, led by IIT Bombay, Gujarati Wordnet's Synsets (synonym set) has been developed. For each synset a POS categ..

Available Under License:
Commercial Research

Sample Download | size: 65.6KB | type: rar

Added on : 17 Jul 2019

Tags: Gujarati Wordnet Synset