Other Repositories

List of Indian Languages linguistic resources and tools developed by other institutions apart from NPLT umbrella. 

Refine Search

A Gold Standard Dogri Raw Text Corpus

A Gold Standard Dogri Raw Text Corpus

Dogri, is an Indo-Aryan Language spoken by about five million people in India and Pakistan, Particularly in the Jammu.Dogri Text Corpus encoded in a m..

Sample Download | size: 45.8KB | type: zip

Added on : 26 Jul 2019

A Gold Standard Bodo Raw Text Corpus

A Gold Standard Bodo Raw Text Corpus

Unicode Standard Bodo text Corpus of 29, 15,544 words | 80Titles |Data and Metadata in XML format | 5 text domainsBodo is a major tribal language..

Sample Download | size: 21.4KB | type: zip

Added on : 26 Jul 2019

A Gold Standard Bengali Raw Text Corpus

A Gold Standard Bengali Raw Text Corpus

Bengali is the official language of West Bengal and Tripura. It belongs to the Indo-Aryan language family.Bengali Text Corpus encoded in a machine r..

Sample Download | size: 56.5KB | type: zip

Added on : 26 Jul 2019

Showing 46 to 48 of 48 (4 Pages)
Disclaimer: The information provided on this page has been procured through different sources. Please write back to us at nplt_support[at]cdac[dot]in in case you would like to suggest an update.