• Hindi Monolingual Data Set
Hindi Monolingual Data Set

Available Under License: CC BY-NC-SA 4.0  

Added on : 14 Dec 2020

This Hindi monolingual data set, having 473605 sentences and total word count of 7092870, has been release under license: CC BY-NC-SA 4.0 by Panlingua Language Processing LLP, New Delhi, India.

Text Corpus Attributes
No. of Sentences 4,73,605
Word-Count 70,92,870
Encoding Unicode / UTF-8

Write a review

Please login or register to review

Tags: Hindi, Monolingual, Text Corpus

Disclaimer: The information provided on this page has been procured through different sources. Please write back to us at nplt_support[at]cdac[dot]in in case you would like to suggest an update.