Dictionary-Learning-Based Post-Filter for HMM-Based Speech Synthesis


Autoria(s): Narayanamurthy, Praneeth Kurpad; Seelamantula, Chandra Sekhar
Data(s)

2015

Resumo

Oversmoothing of speech parameter trajectories is one of the causes for quality degradation of HMM-based speech synthesis. Various methods have been proposed to overcome this effect, the most recent ones being global variance (GV) and modulation-spectrum-based post-filter (MSPF). However, there is still a significant quality gap between natural and synthesized speech. In this paper, we propose a two-fold post-filtering technique to alleviate to a certain extent the oversmoothing of spectral and excitation parameter trajectories of HMM-based speech synthesis. For the spectral parameters, we propose a sparse coding-based post-filter to match the trajectories of synthetic speech to that of natural speech, and for the excitation trajectory, we introduce a perceptually motivated post-filter. Experimental evaluations show quality improvement compared with existing methods.

Formato

application/pdf

Identificador

http://eprints.iisc.ernet.in/53336/1/IEEE_Reg_Con_2015.pdf

Narayanamurthy, Praneeth Kurpad and Seelamantula, Chandra Sekhar (2015) Dictionary-Learning-Based Post-Filter for HMM-Based Speech Synthesis. In: IEEE Region 10 Conference (TENCON), NOV 01-04, 2015, Macau, PEOPLES R CHINA.

Publicador

IEEE

Relação

http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7373091

http://eprints.iisc.ernet.in/53336/

Palavras-Chave #Electrical Engineering
Tipo

Conference Proceedings

NonPeerReviewed