New Method for Delexicalization and its Application to Prosodic Tagging for Text-to-Speech Synthesis


Autoria(s): Vainio, Martti; Suni, Antti Santeri; Raitio, Tuomo; Nurminen, Jani; Järvikivi, Juhani; Alku, Paavo
Contribuinte(s)

University of Helsinki, Institute of Behavioural Sciences

University of Helsinki, Institute of Behavioural Sciences

University of Helsinki, Institute of Behavioural Sciences

Data(s)

2009

Resumo

This paper describes a new flexible delexicalization method based on glottal excited parametric speech synthesis scheme. The system utilizes inverse filtered glottal flow and all-pole modelling of the vocal tract. The method provides a possibil- ity to retain and manipulate all relevant prosodic features of any kind of speech. Most importantly, the features include voice quality, which has not been properly modeled in earlier delex- icalization methods. The functionality of the new method was tested in a prosodic tagging experiment aimed at providing word prominence data for a text-to-speech synthesis system. The ex- periment confirmed the usefulness of the method and further corroborated earlier evidence that linguistic factors influence the perception of prosodic prominence.

Formato

4

Identificador

http://hdl.handle.net/10138/24712

Idioma(s)

eng

Relação

Interspeech 2009 Proceedings of the 10th Annual Conference of the International Speech Communication Association, Brighton, UK, 6-10 Sept 2009

Interspeech

Fonte

Vainio , M , Suni , A S , Raitio , T , Nurminen , J , Järvikivi , J & Alku , P 2009 , ' New Method for Delexicalization and its Application to Prosodic Tagging for Text-to-Speech Synthesis ' in Interspeech 2009 : Proceedings of the 10th Annual Conference of the International Speech Communication Association, Brighton, UK, 6-10 Sept 2009 , Interspeech .

Palavras-Chave #616 Other humanities
Tipo

A4 Article in conference publication (refereed)

info:eu-repo/semantics/conferencePaper

http://purl.org/eprint/status/NonPeerReviewed