Biblioteca Digital

**Autoria(s):** Narayanamurthy, Praneeth Kurpad; Seelamantula, Chandra Sekhar
Data(s)	2015
Resumo	Oversmoothing of speech parameter trajectories is one of the causes for quality degradation of HMM-based speech synthesis. Various methods have been proposed to overcome this effect, the most recent ones being global variance (GV) and modulation-spectrum-based post-filter (MSPF). However, there is still a significant quality gap between natural and synthesized speech. In this paper, we propose a two-fold post-filtering technique to alleviate to a certain extent the oversmoothing of spectral and excitation parameter trajectories of HMM-based speech synthesis. For the spectral parameters, we propose a sparse coding-based post-filter to match the trajectories of synthetic speech to that of natural speech, and for the excitation trajectory, we introduce a perceptually motivated post-filter. Experimental evaluations show quality improvement compared with existing methods.
Formato	application/pdf
Identificador	http://eprints.iisc.ernet.in/53336/1/IEEE_Reg_Con_2015.pdf Narayanamurthy, Praneeth Kurpad and Seelamantula, Chandra Sekhar (2015) Dictionary-Learning-Based Post-Filter for HMM-Based Speech Synthesis. In: IEEE Region 10 Conference (TENCON), NOV 01-04, 2015, Macau, PEOPLES R CHINA.
Publicador	IEEE
Relação	http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7373091 http://eprints.iisc.ernet.in/53336/
Palavras-Chave	#Electrical Engineering
Tipo	Conference Proceedings NonPeerReviewed

Acesso ao item digital