JFA based Speaker Recognition using Delta-Phase and MFCC features


Autoria(s): Kanagasundaram, Ahilan; Dean, David; Sridharan, Sridha
Data(s)

04/12/2012

Resumo

This paper investigates the use of mel-frequency deltaphase (MFDP) features in comparison to, and in fusion with, traditional mel-frequency cepstral coefficient (MFCC) features within joint factor analysis (JFA) speaker verification. MFCC features, commonly used in speaker recognition systems, are derived purely from the magnitude spectrum, with the phase spectrum completely discarded. In this paper, we investigate if features derived from the phase spectrum can provide additional speaker discriminant information to the traditional MFCC approach in a JFA based speaker verification system. Results are presented which provide a comparison of MFCC-only, MFDPonly and score fusion of the two approaches within a JFA speaker verification approach. Based upon the results presented using the NIST 2008 Speaker Recognition Evaluation (SRE) dataset, we believe that, while MFDP features alone cannot compete with MFCC features, MFDP can provide complementary information that result in improved speaker verification performance when both approaches are combined in score fusion, particularly in the case of shorter utterances.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/55511/

Relação

http://eprints.qut.edu.au/55511/1/SST_2012_paper.pdf

http://clas.mq.edu.au/sst2012/

Kanagasundaram, Ahilan, Dean, David, & Sridharan, Sridha (2012) JFA based Speaker Recognition using Delta-Phase and MFCC features. In SST 2012 14th Australasian International Conference on Speech Science and Technology, Macquarie University, Sydney, Australia.

Direitos

Copyright 2012 Please consult the authors

Fonte

School of Electrical Engineering & Computer Science; Information Security Institute; Science & Engineering Faculty

Palavras-Chave #Speaker verification #MFCC features #JFA #Delta-phase
Tipo

Conference Paper