Robust Whisper Activity Detection Using Long-Term Log Energy Variation of Sub-Band Signal


Autoria(s): Meenakshi, Nisha G; Ghosh, Prasanta Kumar
Data(s)

2015

Resumo

The goal in the whisper activity detection (WAD) is to find the whispered speech segments in a given noisy recording of whispered speech. Since whispering lacks the periodic glottal excitation, it resembles an unvoiced speech. This noise-like nature of the whispered speech makes WAD a more challenging task compared to a typical voice activity detection (VAD) problem. In this paper, we propose a feature based on the long term variation of the logarithm of the short-time sub-band signal energy for WAD. We also propose an automatic sub-band selection algorithm to maximally discriminate noisy whisper from noise. Experiments with eight noise types in four different signal-to-noise ratio (SNR) conditions show that, for most of the noises, the performance of the proposed WAD scheme is significantly better than that of the existing VAD schemes and whisper detection schemes when used for WAD.

Formato

application/pdf

Identificador

http://eprints.iisc.ernet.in/51906/1/IEEE_Sig_Pro_Let_22-11_2015.pdf

Meenakshi, Nisha G and Ghosh, Prasanta Kumar (2015) Robust Whisper Activity Detection Using Long-Term Log Energy Variation of Sub-Band Signal. In: IEEE SIGNAL PROCESSING LETTERS, 22 (11). pp. 1859-1863.

Publicador

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Relação

http://dx.doi.org/10.1109/LSP.2015.2439514

http://eprints.iisc.ernet.in/51906/

Palavras-Chave #Electrical Engineering
Tipo

Journal Article

PeerReviewed