Kalman tracking linear predictor for vowel intelligibility enhancement on european portuguese HMM based speech synthesis


Autoria(s): Coelho, Luís; Braga, Daniela; Garcia-Mateo, Carmen
Data(s)

05/02/2016

05/02/2016

2010

Resumo

The recent developments on Hidden Markov Models (HMM) based speech synthesis showed that this is a promising technology fully capable of competing with other established techniques. However some issues still lack a solution. Several authors report an over-smoothing phenomenon on both time and frequencies which decreases naturalness and sometimes intelligibility. In this work we present a new vowel intelligibility enhancement algorithm that uses a discrete Kalman filter (DKF) for tracking frame based parameters. The inter-frame correlations are modelled by an autoregressive structure which provides an underlying time frame dependency and can improve time-frequency resolution. The system’s performance has been evaluated using objective and subjective tests and the proposed methodology has led to improved results.

Identificador

Coelho, L., Braga, D., & Garcia-Mateo, C. (2010). Kalman tracking linear predictor for vowel intelligibility enhancement on european portuguese HMM based speech synthesis. 2010 IEEE International Conference on Acoustics, Speech and Signal Processing. Institute of Electrical & Electronics Engineers (IEEE). http://doi.org/10.1109/icassp.2010.5495168

978-1-4244-4296-6

1520-6149

http://hdl.handle.net/10400.22/7636

10.1109/ICASSP.2010.5495168

Idioma(s)

eng

Direitos

openAccess

Palavras-Chave #Kalman filtering #Speech intelligibility
Tipo

conferenceObject