Efficient post-processing techniques for speech enhancement


Autoria(s): Ramakrishnan, Vyass; Pawan Kumar, G; Seelamantula, Chandra Sekhar; Shetty, Karthik
Data(s)

2011

Resumo

We address the problem of speech enhancement in real-world noisy scenarios. We propose to solve the problem in two stages, the first comprising a generalized spectral subtraction technique, followed by a sequence of perceptually-motivated post-processing algorithms. The role of the post-processing algorithms is to compensate for the effects of noise as well as to suppress any artifacts created by the first-stage processing. The key post-processing mechanisms are aimed at suppressing musical noise and to enhance the formant structure of voiced speech as well as to denoise the linear-prediction residual. The parameter values in the techniques are fixed optimally by experimentally evaluating the enhancement performance as a function of the parameters. We used the Carnegie-Mellon university Arctic database for our experiments. We considered three real-world noise types: fan noise, car noise, and motorbike noise. The enhancement performance was evaluated by conducting listening experiments on 12 subjects. The listeners reported a clear improvement (MOS improvement of 0.5 on an average) over the noisy signal in the perceived quality (increase in the mean-opinion score (MOS)) for positive signal-to-noise-ratios (SNRs). For negative SNRs, however, the improvement was found to be marginal.

Formato

application/pdf

Identificador

http://eprints.iisc.ernet.in/46227/1/Nati_Conf_Comm_1_2011.pdf

Ramakrishnan, Vyass and Pawan Kumar, G and Seelamantula, Chandra Sekhar and Shetty, Karthik (2011) Efficient post-processing techniques for speech enhancement. In: 2011 National Conference on Communications (NCC), 28-30 Jan. 2011, Bangalore.

Publicador

IEEE

Relação

http://dx.doi.org/10.1109/NCC.2011.5734780

http://eprints.iisc.ernet.in/46227/

Palavras-Chave #Electrical Engineering
Tipo

Conference Proceedings

PeerReviewed