Efficient post-processing techniques for speech enhancement
Data(s) |
2011
|
---|---|
Resumo |
We address the problem of speech enhancement in real-world noisy scenarios. We propose to solve the problem in two stages, the first comprising a generalized spectral subtraction technique, followed by a sequence of perceptually-motivated post-processing algorithms. The role of the post-processing algorithms is to compensate for the effects of noise as well as to suppress any artifacts created by the first-stage processing. The key post-processing mechanisms are aimed at suppressing musical noise and to enhance the formant structure of voiced speech as well as to denoise the linear-prediction residual. The parameter values in the techniques are fixed optimally by experimentally evaluating the enhancement performance as a function of the parameters. We used the Carnegie-Mellon university Arctic database for our experiments. We considered three real-world noise types: fan noise, car noise, and motorbike noise. The enhancement performance was evaluated by conducting listening experiments on 12 subjects. The listeners reported a clear improvement (MOS improvement of 0.5 on an average) over the noisy signal in the perceived quality (increase in the mean-opinion score (MOS)) for positive signal-to-noise-ratios (SNRs). For negative SNRs, however, the improvement was found to be marginal. |
Formato |
application/pdf |
Identificador |
http://eprints.iisc.ernet.in/46227/1/Nati_Conf_Comm_1_2011.pdf Ramakrishnan, Vyass and Pawan Kumar, G and Seelamantula, Chandra Sekhar and Shetty, Karthik (2011) Efficient post-processing techniques for speech enhancement. In: 2011 National Conference on Communications (NCC), 28-30 Jan. 2011, Bangalore. |
Publicador |
IEEE |
Relação |
http://dx.doi.org/10.1109/NCC.2011.5734780 http://eprints.iisc.ernet.in/46227/ |
Palavras-Chave | #Electrical Engineering |
Tipo |
Conference Proceedings PeerReviewed |