On the development of an automatic voice pleasantness classification and intensity estimation system


Autoria(s): Coelho, Luís; Braga, Daniela; Sales-Dias, Miguel; Garcia-Mateo, Carmen
Data(s)

22/01/2014

22/01/2014

2013

Resumo

In the last few years, the number of systems and devices that use voice based interaction has grown significantly. For a continued use of these systems, the interface must be reliable and pleasant in order to provide an optimal user experience. However there are currently very few studies that try to evaluate how pleasant is a voice from a perceptual point of view when the final application is a speech based interface. In this paper we present an objective definition for voice pleasantness based on the composition of a representative feature subset and a new automatic voice pleasantness classification and intensity estimation system. Our study is based on a database composed by European Portuguese female voices but the methodology can be extended to male voices or to other languages. In the objective performance evaluation the system achieved a 9.1% error rate for voice pleasantness classification and a 15.7% error rate for voice pleasantness intensity estimation.

Work partially supported by ERDF funds, the Spanish Government (TEC2009-14094-C04-04), and Xunta de Galicia (CN2011/019, 2009/062)

Identificador

Pinto-Coelho, L., Braga, D., Sales-Dias, M. & Garcia-Mateo, C. (2013) On the development of an automatic voice pleasantness classification and intensity estimation system. Computer Speech & Language, 27 (1), 75-88. doi: 10.1016/j.csl.2012.01.006

0885-2308

DOI 10.1016/j.csl.2012.01.006

http://hdl.handle.net/10400.22/3436

Idioma(s)

eng

Publicador

Elsevier

Relação

Computer Speech and Language

http://www.sciencedirect.com/science/article/pii/S0885230812000083

Direitos

openAccess

Palavras-Chave #Voice pleasantness #Subtle emotions #Perceptual speech analysis #Text-to-Speech synthesis
Tipo

article