Multi Pattern Dynamic Time Warping For Automatic Speech Recognition


Autoria(s): Nair, Nishanth Ulhas; Sreenivas, TV
Data(s)

2008

Resumo

We are addressing the problem of jointly using multiple noisy speech patterns for automatic speech recognition (ASR), given that they come from the same class. If the user utters a word K times, the ASR system should try to use the information content in all the K patterns of the word simultaneously and improve its speech recognition accuracy compared to that of the single pattern based speech recognition. T address this problem, recently we proposed a Multi Pattern Dynamic Time Warping (MPDTW) algorithm to align the K patterns by finding the least distortion path between them. A Constrained Multi Pattern Viterbi algorithm was used on this aligned path for isolated word recognition (IWR). In this paper, we explore the possibility of using only the MPDTW algorithm for IWR. We also study the properties of the MPDTW algorithm. We show that using only 2 noisy test patterns (10 percent burst noise at -5 dB SNR) reduces the noisy speech recognition error rate by 37.66 percent when compared to the single pattern recognition using the Dynamic Time Warping algorithm.

Formato

application/pdf

Identificador

http://eprints.iisc.ernet.in/21154/1/11111111.pdf

Nair, Nishanth Ulhas and Sreenivas, TV (2008) Multi Pattern Dynamic Time Warping For Automatic Speech Recognition. In: IEEE Region 10 Conference (TENCON 2008), NOV 19-21, 2008, Hyderabad, INDIA, pp. 2435-2440.

Publicador

IEEE

Relação

http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4766617&tag=1

http://eprints.iisc.ernet.in/21154/

Palavras-Chave #Electrical Communication Engineering
Tipo

Conference Paper

PeerReviewed