Speech Enhancement from Additive Noise and Channel Distortion - a Corpus-Based Approach


Autoria(s): Ming, Ji; Crookes, Daniel
Data(s)

2014

Resumo

This paper presents a new approach to single-channel speech enhancement involving both noise and channel distortion (i.e., convolutional noise). The approach is based on finding longest matching segments (LMS) from a corpus of clean, wideband speech. The approach adds three novel developments to our previous LMS research. First, we address the problem of channel distortion as well as additive noise. Second, we present an improved method for modeling noise. Third, we present an iterative algorithm for improved speech estimates. In experiments using speech recognition as a test with the Aurora 4 database, the use of our enhancement approach as a preprocessor for feature extraction significantly improved the performance of a baseline recognition system. In another comparison against conventional enhancement algorithms, both the PESQ and the segmental SNR ratings of the LMS algorithm were superior to the other methods for noisy speech enhancement. Index Terms: corpus-based speech model, longest matching segment, speech enhancement, speech recognition

Identificador

http://pure.qub.ac.uk/portal/en/publications/speech-enhancement-from-additive-noise-and-channel-distortion--a-corpusbased-approach(7295ac23-a83c-4b49-b1c8-b586056edda6).html

Idioma(s)

eng

Direitos

info:eu-repo/semantics/closedAccess

Fonte

Ming , J & Crookes , D 2014 , Speech Enhancement from Additive Noise and Channel Distortion - a Corpus-Based Approach . in Interspeech 2014 . pp. 2710-2714 .

Tipo

contributionToPeriodical