• English-Gujarati Tourism Set - II Parallel Text corpus-EILMT
English-Gujarati Tourism Set - II Parallel Text corpus-EILMT

Available Under License: Commercial   Research  

Sample Download | size: 23.4KB | type: zip
Added on : 23 Jul 2020

English-Gujarati Parallel Tourism Text corpus is developed in Unicode under English to Indian Language Machine Translation (EILMT) consortium. The core vocabulary of this corpus consist of various names, destinations, visiting places, vocabularies from art & architecture, culture and civilization. By and large, the corpus contains basically descriptions and information pertaining to tourist destinations and related matters of tourist interests. This corpus is created in excel format and size of the corpus is 11962 sentences.

Text Corpus Attributes
Language English to Gujarati
Parallel or Monolingual Parallel
Annotation Not annotated
No. of Sentences 11962 Sentences
Word-Count 226162 word
File Format XLS file
Encoding UTF-8
File Size 1.56 MB

Write a review

Please login or register to review

Tags: English-Gujarati, Parallel, Tourism, Text corpus

Disclaimer: The information provided on this page has been procured through different sources. Please write back to us at nplt_support[at]cdac[dot]in in case you would like to suggest an update.