Topic dependent language modelling for spoken term detection


Autoria(s): Kalantari, Shahram; Dean, David; Sridharan, Sridha
Data(s)

2014

Resumo

This paper investigates the effect of topic dependent language models (TDLM) on phonetic spoken term detection (STD) using dynamic match lattice spotting (DMLS). Phonetic STD consists of two steps: indexing and search. The accuracy of indexing audio segments into phone sequences using phone recognition methods directly affects the accuracy of the final STD system. If the topic of a document in known, recognizing the spoken words and indexing them to an intermediate representation is an easier task and consequently, detecting a search word in it will be more accurate and robust. In this paper, we propose the use of TDLMs in the indexing stage to improve the accuracy of STD in situations where the topic of the audio document is known in advance. It is shown that using TDLMs instead of the traditional general language model (GLM) improves STD performance according to figure of merit (FOM) criteria.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/75760/

Relação

http://eprints.qut.edu.au/75760/1/Eusipco.pdf

Kalantari, Shahram, Dean, David, & Sridharan, Sridha (2014) Topic dependent language modelling for spoken term detection. In Europian Signal Processing Conference (EUSIPCO 2014), 1-5 September 2014, Lisbon, Portugal. (Unpublished)

Direitos

Copyright 2014 The Author(s)

Fonte

School of Electrical Engineering & Computer Science; Science & Engineering Faculty; Smart Services CRC

Palavras-Chave #080100 ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING #089900 OTHER INFORMATION AND COMPUTING SCIENCES #Spoken term detection #Language modelling #Audio indexing
Tipo

Conference Paper