Biblioteca Digital

Speaker identification using higher order spectral phase features and their effectiveness vis-a-vis Mel-Cepstral features

**Autoria(s):** Chandran, Vinod; Ning, Daryl; Sridharan, Subramanian
Contribuinte(s)	Zhang, D Jain, A
Data(s)	2004
Resumo	The effectiveness of higher-order spectral (HOS) phase features in speaker recognition is investigated by comparison with Mel Cepstral features on the same speech data. HOS phase features retain phase information from the Fourier spectrum unlikeMel–frequency Cepstral coefficients (MFCC). Gaussian mixture models are constructed from Mel– Cepstral features and HOS features, respectively, for the same data from various speakers in the Switchboard telephone Speech Corpus. Feature clusters, model parameters and classification performance are analyzed. HOS phase features on their own provide a correct identification rate of about 97% on the chosen subset of the corpus. This is the same level of accuracy as provided by MFCCs. Cluster plots and model parameters are compared to show that HOS phase features can provide complementary information to better discriminate between speakers.
Identificador	http://eprints.qut.edu.au/24302/
Publicador	Springer
Relação	DOI:10.1007/978-3-540-25948-0_84 Chandran, Vinod, Ning, Daryl, & Sridharan, Subramanian (2004) Speaker identification using higher order spectral phase features and their effectiveness vis-a-vis Mel-Cepstral features. In Zhang, D & Jain, A (Eds.) Biometric Authentication First International Conference (ICBA 2004) Proceedings, 15-17 July 2004, Hong Kong, China.
Fonte	Faculty of Built Environment and Engineering
Palavras-Chave	#080109 Pattern Recognition and Data Mining
Tipo	Conference Paper

Acesso ao item digital