Evaluation of a transplantation algorithm for expressive speech synthesis


Autoria(s): Lorenzo Trueba, Jaime; Barra Chicote, Roberto; Yamagishi, J.; Watts, Oliver; Montero Martínez, Juan Manuel
Data(s)

2013

Resumo

When designing human-machine interfaces it is important to consider not only the bare bones functionality but also the ease of use and accessibility it provides. When talking about voice-based inter- faces, it has been proven that imbuing expressiveness into the synthetic voices increases signi?cantly its perceived naturalness, which in the end is very helpful when building user friendly interfaces. This paper proposes an adaptation based expressiveness transplantation system capable of copying the emotions of a source speaker into any desired target speaker with just a few minutes of read speech and without requiring the record- ing of additional expressive data. This system was evaluated through a perceptual test for 3 speakers showing up to an average of 52% emotion recognition rates relative to the natural voice recognition rates, while at the same time keeping good scores in similarity and naturality.

Formato

application/pdf

Identificador

http://oa.upm.es/26490/

Idioma(s)

eng

Publicador

E.T.S.I. Telecomunicación (UPM)

Relação

http://oa.upm.es/26490/1/INVE_MEM_2013_163879.pdf

info:eu-repo/semantics/altIdentifier/doi/null

Direitos

http://creativecommons.org/licenses/by-nc-nd/3.0/es/

info:eu-repo/semantics/openAccess

Fonte

IV Congreso Español de Informática (CEDI 2013). Workshop en Tecnologías Accesibles | IV Congreso Español de Informática (CEDI 2013). Workshop en Tecnologías Accesibles | 17/09/2013 - 20/09/2013 | Madrid, Spain

Palavras-Chave #Telecomunicaciones
Tipo

info:eu-repo/semantics/conferenceObject

Ponencia en Congreso o Jornada

PeerReviewed