• English-Urdu Tourism Set - I Parallel Text corpus-EILMT
English-Urdu Tourism Set - I Parallel Text corpus-EILMT

Available Under License: Commercial   Research  

Sample Download | size: 23.2KB | type: zip
Added on : 20 Aug 2020

English-Urdu Parallel Tourism Text corpus is developed in Unicode under English to Indian Language Machine Translation (EILMT) consortium. The core vocabulary of this corpus consist of various names, destinations, visiting places, vocabularies from art & architecture, culture and civilization. By and large, the corpus contains basically descriptions and information pertaining to tourist destinations and related matters of tourist interests. This corpus is created in XLS formats and size of the corpus is approx 15198 sentences.

Text Corpus Attributes
Language English - Urdu
Parallel or Monolingual Parallel
Annotation Not Annotated
No. of Sentences 15198 Sentences
Word-Count 339500 words
File Format XLS file
Encoding UTF-8
File Size 2.15 MB

Write a review

Please login or register to review

Tags: English-Urdu, Parallel, Tourism, Text corpus

Disclaimer: The information provided on this page has been procured through different sources. Please write back to us at nplt_support[at]cdac[dot]in in case you would like to suggest an update.