JFA based Speaker Recognition using Delta-Phase and MFCC features
Data(s) |
04/12/2012
|
---|---|
Resumo |
This paper investigates the use of mel-frequency deltaphase (MFDP) features in comparison to, and in fusion with, traditional mel-frequency cepstral coefficient (MFCC) features within joint factor analysis (JFA) speaker verification. MFCC features, commonly used in speaker recognition systems, are derived purely from the magnitude spectrum, with the phase spectrum completely discarded. In this paper, we investigate if features derived from the phase spectrum can provide additional speaker discriminant information to the traditional MFCC approach in a JFA based speaker verification system. Results are presented which provide a comparison of MFCC-only, MFDPonly and score fusion of the two approaches within a JFA speaker verification approach. Based upon the results presented using the NIST 2008 Speaker Recognition Evaluation (SRE) dataset, we believe that, while MFDP features alone cannot compete with MFCC features, MFDP can provide complementary information that result in improved speaker verification performance when both approaches are combined in score fusion, particularly in the case of shorter utterances. |
Formato |
application/pdf |
Identificador | |
Relação |
http://eprints.qut.edu.au/55511/1/SST_2012_paper.pdf http://clas.mq.edu.au/sst2012/ Kanagasundaram, Ahilan, Dean, David, & Sridharan, Sridha (2012) JFA based Speaker Recognition using Delta-Phase and MFCC features. In SST 2012 14th Australasian International Conference on Speech Science and Technology, Macquarie University, Sydney, Australia. |
Direitos |
Copyright 2012 Please consult the authors |
Fonte |
School of Electrical Engineering & Computer Science; Information Security Institute; Science & Engineering Faculty |
Palavras-Chave | #Speaker verification #MFCC features #JFA #Delta-phase |
Tipo |
Conference Paper |