• Telugu Raw Speech Corpus
Telugu Raw Speech Corpus
  • Contributor: CIIL Mysore
  • Product Code: CIIL-TEL-RAW-Speech-130
Sample Download | size: 1.6MB | type: zip
Added on : 29 Jul 2019

22:43:59 hours of 15 Gigabytes speech data | 80 Speakers | 10510 Audio segments | 48 khz | 16 bit wav

Approximately 15 minutes speech (per speaker) has taken from 24 female and 56 male native speakers of different age groups. Each speaker recorded these datasets which are randomly selected from a master dataset.

Corpus Details:

    • Total speakers 80 (24 Female and 56 Male.)
    • Speech in .wav format; Metadata .txt format
    • Contemporary Text (News)-77 Audio Segments - 8:28:19 hours
    • Creative Text 77 Audio Segments - 7:10:35 hours
    • Sentence - 1828– Audio Segments 1:39:00 hours                          
    • Date142 - Audio Segments - 0:14:49 hours   
    • Command and Control Words– 2170 Audio Segments 1:43:49 hours
    • Person Name– 1438 Audio Segments  - 1:09:31 hours
    • Place Name- 707 Audio Segments - 0:33:24 hours
    • Most Frequent Word-Part– 2162 Audio Segments - 1:33:31 hours
    • Most Frequent Word-FullSet - 1909 Audio Segments0:41:23 hours
Speech Data Attributes
Annotation Raw Speech Corpus
Language Telugu
Duration 22:43:59
Speaker Type Native
File Size 15 GB
No. of Audio Segment 10510
Speaker Gender Male and Female

Write a review

Please login or register to review

Tags: Telugu, Raw Speech Corpus

Disclaimer: The information provided on this page has been procured through different sources. Please write back to us at nplt_support[at]cdac[dot]in in case you would like to suggest an update.