English-Gujarati Tourism Set - II Parallel Text corpus-EILMT

Contributor: EILMT Consortia
Product Code: EILMT-ENG-GUJ-TEXT-0717

Available Under License: Commercial Research

Sample Download | size: 23.4KB | type: zip

Added on : 23 Jul 2020

English-Gujarati Parallel Tourism Text corpus is developed in Unicode under English to Indian Language Machine Translation (EILMT) consortium. The core vocabulary of this corpus consist of various names, destinations, visiting places, vocabularies from art & architecture, culture and civilization. By and large, the corpus contains basically descriptions and information pertaining to tourist destinations and related matters of tourist interests. This corpus is created in excel format and size of the corpus is 11962 sentences.

Text Corpus Attributes
Language	English to Gujarati
Parallel or Monolingual	Parallel
Annotation	Not annotated
No. of Sentences	11962 Sentences
Word-Count	226162 word
File Format	XLS file
Encoding	UTF-8
File Size	1.56 MB

Tags: English-Gujarati, Parallel, Tourism, Text corpus

Write a review