A Methodology for Creating a Segment Inventory for Greek Time Domain Speech Synthesis

ΕΡΓΑ

Ερευνητική περιοχή:

Είδος:

Άρθρο σε περιοδικό

Έτος:	2005

Συγγραφείς:	Ευίτα-Σταυρούλα Φωτεινέα; Γιώργος Ταμπουρατζής
Περιοδικό:	International Journal of Speech Technology
Τόμος:	8
Αριθμός:	2
Σελίδες:	161-172



Περίληψη:	This article focuses on the systematic design of a segment database which has been used to support a time-domain speech synthesis system for the Greek language. Thus, a methodology is presented for the generation of a corpus containing all possible instances of the segments for the specific language. Issues such as the phonetic coverage, the sentence selection and iterative evaluation techniques employing custom-built tools, are examined. Emphasis is placed on the comparison of the process-derived corpus to naturally-occurring corpora with respect to their suitability for use in time-domain speech synthesis. The proposed methodology generates a corpus characterised by a near-minimal size and which provides a complete coverage of the Greek language. Furthermore, within this corpus, the distribution of segmental units is similar to that of natural corpora, allowing for the extraction of multiple units in the case of the most frequently-occurring segments. The corpus creation algorithm incorporates mechanisms that enable the fine-tuning of the segment database’s language-dependent characteristics and thus assists in the generation of high-quality text-to-speech synthesis.
[Bibtex]