The QUT-NOISE-SRE protocol for the evaluation of noisy speaker recognition


Autoria(s): Dean, David B.; Kanagasundaram, Ahilan; Ghaemmaghami, Houman; Rahman, Md Hafizur; Sridharan, Sridha
Data(s)

10/09/2015

Resumo

The QUT-NOISE-SRE protocol is designed to mix the large QUT-NOISE database, consisting of over 10 hours of back- ground noise, collected across 10 unique locations covering 5 common noise scenarios, with commonly used speaker recognition datasets such as Switchboard, Mixer and the speaker recognition evaluation (SRE) datasets provided by NIST. By allowing common, clean, speech corpora to be mixed with a wide variety of noise conditions, environmental reverberant responses, and signal-to-noise ratios, this protocol provides a solid basis for the development, evaluation and benchmarking of robust speaker recognition algorithms, and is freely available to download alongside the QUT-NOISE database. In this work, we use the QUT-NOISE-SRE protocol to evaluate a state-of-the-art PLDA i-vector speaker recognition system, demonstrating the importance of designing voice-activity-detection front-ends specifically for speaker recognition, rather than aiming for perfect coherence with the true speech/non-speech boundaries.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/85240/

Publicador

International Speech Communication Association

Relação

http://eprints.qut.edu.au/85240/1/The%20QUT-NOISE-SRE%20Protocol%20for%20the%20Evaluation%20of%20Noisy%20Speaker%20Recognition.pdf

http://www.isca-speech.org/archive/interspeech_2015/i15_3456.html

Dean, David B., Kanagasundaram, Ahilan, Ghaemmaghami, Houman, Rahman, Md Hafizur, & Sridharan, Sridha (2015) The QUT-NOISE-SRE protocol for the evaluation of noisy speaker recognition. In Proceedings of the 16th Annual Conference of the International Speech Communication Association, Interspeech 2015, International Speech Communication Association, Dresden, Germany, pp. 3456-3460.

http://purl.org/au-research/grants/ARC/LP130100110

Direitos

Copyright 2015 [Please consult the author]

Fonte

School of Electrical Engineering & Computer Science; Faculty of Science and Technology; Institute for Creative Industries and Innovation; Information Security Institute

Palavras-Chave #090609 Signal Processing #noisy speaker verification #speech databases #evaluation protocols
Tipo

Conference Paper