Speaker linking using complete-linkage clustering


Autoria(s): Ghaemmaghami, Houman; Dean, David; Sridharan, Sridha
Data(s)

03/12/2012

Resumo

Speaker diarization determines instances of the same speaker within a recording. Extending this task to a collection of recordings for linking together segments spoken by a unique speaker requires speaker linking. In this paper we propose a speaker linking system using linkage clustering and state-of-the-art speaker recognition techniques. We evaluate our approach against two baseline linking systems using agglomerative cluster merging (AC) and agglomerative clustering with model retraining (ACR). We demonstrate that our linking method, using complete-linkage clustering, provides a relative improvement of 20% and 29% in attribution error rate (AER), over the AC and ACR systems, respectively.

Identificador

http://eprints.qut.edu.au/57369/

Relação

http://clas.mq.edu.au/sst2012/

Ghaemmaghami, Houman, Dean, David, & Sridharan, Sridha (2012) Speaker linking using complete-linkage clustering. In SST 2012 14th Australasian International Conference on Speech Science and Technology, Macquarie University, Sydney, Australia .

http://purl.org/au-research/grants/ARC/LP0991238

Direitos

Copyright 2012 ASSTA

Fonte

School of Electrical Engineering & Computer Science; Science & Engineering Faculty

Palavras-Chave #080100 ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING #090000 ENGINEERING #speaker diarization #speaker linking #speaker attribution #complete-linkage #joint factor analysis #cross-likelihood ratio #agglomerative clustering
Tipo

Conference Paper