• English-Odia Tourism Set - II Parallel Text corpus-EILMT
English-Odia Tourism Set - II Parallel Text corpus-EILMT

Available Under License: Commercial   Research  

Sample Download | size: 25.9KB | type: zip
Added on : 04 Aug 2020

English-Odia Parallel Tourism Text corpus is developed in Unicode, under English to Indian Language Machine Translation (EILMT) consortium. The core vocabulary of this corpus consist of various names, destinations, visiting places, vocabularies from art & architecture, culture and civilization. By and large, the corpus contains basically descriptions and information pertaining to tourist destinations and related matters of tourist interests. This corpus is created in XML & XLS formats and size of the corpus is approx. 12,000 sentences.

Text Corpus Attributes
Language English - Odia
Parallel or Monolingual Parallel
Annotation Not Annotated
No. of Sentences 12000
Word-Count 226458
File Format XML & XLS formats
Encoding UTF-8
File Size 4.25 MB

Write a review

Please login or register to review

Tags: English-Odia, Parallel, Tourism, Text corpus

Disclaimer: The information provided on this page has been procured through different sources. Please write back to us at nplt_support[at]cdac[dot]in in case you would like to suggest an update.