Dictionary-Learning-Based Post-Filter for HMM-Based Speech Synthesis
Data(s) |
2015
|
---|---|
Resumo |
Oversmoothing of speech parameter trajectories is one of the causes for quality degradation of HMM-based speech synthesis. Various methods have been proposed to overcome this effect, the most recent ones being global variance (GV) and modulation-spectrum-based post-filter (MSPF). However, there is still a significant quality gap between natural and synthesized speech. In this paper, we propose a two-fold post-filtering technique to alleviate to a certain extent the oversmoothing of spectral and excitation parameter trajectories of HMM-based speech synthesis. For the spectral parameters, we propose a sparse coding-based post-filter to match the trajectories of synthetic speech to that of natural speech, and for the excitation trajectory, we introduce a perceptually motivated post-filter. Experimental evaluations show quality improvement compared with existing methods. |
Formato |
application/pdf |
Identificador |
http://eprints.iisc.ernet.in/53336/1/IEEE_Reg_Con_2015.pdf Narayanamurthy, Praneeth Kurpad and Seelamantula, Chandra Sekhar (2015) Dictionary-Learning-Based Post-Filter for HMM-Based Speech Synthesis. In: IEEE Region 10 Conference (TENCON), NOV 01-04, 2015, Macau, PEOPLES R CHINA. |
Publicador |
IEEE |
Relação |
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7373091 http://eprints.iisc.ernet.in/53336/ |
Palavras-Chave | #Electrical Engineering |
Tipo |
Conference Proceedings NonPeerReviewed |