Robust automatic speaker linking and attribution


Autoria(s): Ghaemmaghami, Houman
Data(s)

2013

Resumo

This research makes a major contribution which enables efficient searching and indexing of large archives of spoken audio based on speaker identity. It introduces a novel technique dubbed as “speaker attribution” which is the task of automatically determining ‘who spoke when?’ in recordings and then automatically linking the unique speaker identities within each recording across multiple recordings. The outcome of the research will also have significant impact in improving the performance of automatic speech recognition systems through the extracted speaker identities.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/60832/

Publicador

Queensland University of Technology

Relação

http://eprints.qut.edu.au/60832/4/Houman_Ghaemmaghami_Thesis.pdf

Ghaemmaghami, Houman (2013) Robust automatic speaker linking and attribution. PhD thesis, Queensland University of Technology.

Fonte

School of Electrical Engineering & Computer Science; Institute for Future Environments; Science & Engineering Faculty

Palavras-Chave #speaker attribution #speaker linking #speaker diarization #complete linkage clustering #cross likelihood ratio #joint factor analysis #agglomerative clustering #cross show diarization
Tipo

Thesis