Noise robust voice activity detection using features extracted from the time-domain autocorrelation function


Autoria(s): Ghaemmaghami, Houman; Baker, Brendan J.; Vogt, Robert J.; Sridharan, Sridha
Data(s)

2010

Resumo

This paper presents a method of voice activity detection (VAD) for high noise scenarios, using a noise robust voiced speech detection feature. The developed method is based on the fusion of two systems. The first system utilises the maximum peak of the normalised time-domain autocorrelation function (MaxPeak). The second zone system uses a novel combination of cross-correlation and zero-crossing rate of the normalised autocorrelation to approximate a measure of signal pitch and periodicity (CrossCorr) that is hypothesised to be noise robust. The score outputs by the two systems are then merged using weighted sum fusion to create the proposed autocorrelation zero-crossing rate (AZR) VAD. Accuracy of AZR was compared to state of the art and standardised VAD methods and was shown to outperform the best performing system with an average relative improvement of 24.8% in half-total error rate (HTER) on the QUT-NOISE-TIMIT database created using real recordings from high-noise environments.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/40656/

Relação

http://eprints.qut.edu.au/40656/1/2011006688_H_Ghaemmaghami_ePrints.pdf

http://www.interspeech2010.org/

Ghaemmaghami, Houman, Baker, Brendan J., Vogt, Robert J., & Sridharan, Sridha (2010) Noise robust voice activity detection using features extracted from the time-domain autocorrelation function. In Proceedings of Interspeech 2010, Makuhari Messe International Convention Complex, Makuhari, Japan.

http://purl.org/au-research/grants/ARC/LP0991238

Direitos

Copyright 2010 [please consult the authors]

Fonte

Faculty of Built Environment and Engineering; Information Security Institute; School of Engineering Systems

Palavras-Chave #090609 Signal Processing #Voice Activity Detection #High Noise Autocorrelation #Zero-crossing Rate #Time-domain Analysis
Tipo

Conference Paper