Other Repositories

List of Indian Languages linguistic resources and tools developed by other institutions apart from NPLT umbrella. 

Refine Search

Indian English Raw Speech Corpus - Kannada Variant

Indian English Raw Speech Corpus - Kannada Variant

Dataset Description23:43:04 Hours | 15.3 GB | 56 Speakers| 14,455 Audio Segments | 48 kHz | 16 bit wav. English language is a blend of Anglo-Saxo..

Sample Download | size: 2.8MB | type: zip

Added on : 27 Aug 2021

Indian English Raw Speech Corpus - Bengali Variant

Indian English Raw Speech Corpus - Bengali Variant

Dataset Description 25:47:11 Hours | 15.5 GB | 53 Speakers| 16,044 Audio Segments | 48 kHz | 16 bit wav.English language is a blend of Anglo-Saxo..

Sample Download | size: 1.7MB | type: zip

Added on : 27 Aug 2021

Multilingual Raw Speech Corpus

Multilingual Raw Speech Corpus

Dataset Description 97:43:54 Hours | 62.2 GB speech data | 1916 Speakers ..

Sample Download | size: 387.1KB | type: pdf

Added on : 27 Aug 2021

Tamil Raw Speech Corpus

Tamil Raw Speech Corpus

Dataset Description139:11:41 Hours | 86 GB speech data | 452 Speakers | 60,287 Audio segments | 48 kHz | 16 bit wav. Tamil is one of the longes..

Sample Download | size: 2.8MB | type: zip

Added on : 27 Aug 2021

Odia Raw Speech Corpus

Odia Raw Speech Corpus

Dataset Description 138:06:18 hours |  89 GB | 474 Speakers | 73,418 Audio segments | 48 kHz | 16 bit wav.Odia is an Indo-Aryan ..

Sample Download | size: 1.4MB | type: zip

Added on : 27 Aug 2021

Kashmiri Raw Speech Corpus

Kashmiri Raw Speech Corpus

Dataset Description 28:10:07 Hours | 18 GB speech data | 150 Speakers | 16,380 Audio segments | 48 kHz | 16 bit wa..

Sample Download | size: 1.6MB | type: zip

Added on : 26 Aug 2021

Gujarati Raw Speech Corpus(Mono Recordings)

Gujarati Raw Speech Corpus(Mono Recordings)

Dataset Description 64:44:02 Hours | 7.1 GB | 233 Speakers| 26,223 Audio Segments | 16 kHz | 16 bit wav. Gujarati is one of ..

Sample Download | size: 380.7KB | type: zip

Added on : 26 Aug 2021

Gujarati Raw Speech Corpus

Gujarati Raw Speech Corpus

Dataset Description57:17:08 Hours | 37 GB | 204 Speakers| 25,712 Audio Segments | 48 kHz | 16 bit wav. Gujarati is one of the ma..

Sample Download | size: 2.3MB | type: zip

Added on : 26 Aug 2021

Dogri Raw Speech Corpus

Dogri Raw Speech Corpus

Dataset Description 17:10:26 Hours | 11 GB speech data | 61 Speakers | 12,036 Audio segments | 48 kHz | 16..

Sample Download | size: 2MB | type: zip

Added on : 26 Aug 2021

Assamese Raw Speech Corpus

Assamese Raw Speech Corpus

Dataset Description  54:21:12 Hours | 32.5 GB | 304 Speakers | 37,570 Audio Segments | 48 kHz | 16 bit wav.&n..

Sample Download | size: 1.3MB | type: zip

Added on : 26 Aug 2021

Sanskrit Sandhi Generator (संस्कृत-संधि-प्रक्रिया)

Sanskrit Sandhi Generator (संस्कृत-संधि-प्रक्रिया)

This application has been developed as a result of the dataset prepared by Sachin Kumar and Diwakar Mani (research students of the center under th..

Added on : 16 Sep 2019

Sanskrit Sandhi Recognizer and Analyzer

Sanskrit Sandhi Recognizer and Analyzer

Sandhi-Splitter is a computional tool which shows all possible splittings of a given Sanskrit string. The Sanskrit sandhi splitter (CONSONANT SANDHI)..

Added on : 16 Sep 2019

Sanskrit Morphological Analyzer

Sanskrit Morphological Analyzer

The "Sanskrit Morphological Analyzer" is a collection of modules developed as a result of Computational Sanskrit R&D at Special Center of Sanskrit..

Added on : 16 Aug 2019

Ayurveda Search

Ayurveda Search

Ayurveda is one of the oldest science of medicine and health form India dating back to the age of Vedas. The tradition of Ayurveda has had a unbroken ..

Added on : 14 Aug 2019

Bhagvadgita ebook and Search

Bhagvadgita ebook and Search

The "Online Indexing of Srimadbhagavadgita" was completed as part of "Multimedia E- Book of Srimadbhagavadgita: With Special Reference to Chapter 1" M..

Added on : 14 Aug 2019

Showing 1 to 15 of 48 (4 Pages)
Disclaimer: The information provided on this page has been procured through different sources. Please write back to us at nplt_support[at]cdac[dot]in in case you would like to suggest an update.