Language identification using parallel sub-word recognition


Autoria(s): Jayram, AKVS; Ramasubramanian, V; Sreenivas, TV
Data(s)

10/04/2003

Resumo

Parallel sub-word recognition (PSWR) is a new model that has been proposed for language identification (LID) which does not need elaborate phonetic labeling of the speech data in a foreign language. The new approach performs a front-end tokenization in terms of sub-word units which are designed by automatic segmentation, segment clustering and segment HMM modeling. We develop PSWR based LID in a framework similar to the parallel phone recognition (PPR) approach in the literature. This includes a front-end tokenizer and a back-end language model, for each language to be identified. Considering various combinations of the statistical evaluation scores, it is found that PSWR can perform as well as PPR, even with broad acoustic sub-word tokenization, thus making it an efficient alternative to the PPR system.

Formato

application/pdf

Identificador

http://eprints.iisc.ernet.in/43785/1/LANGUAGE.pdf

Jayram, AKVS and Ramasubramanian, V and Sreenivas, TV (2003) Language identification using parallel sub-word recognition. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, APR 06-10, 2003, New York.

Publicador

IEEE

Relação

http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=1198709&tag=1

http://eprints.iisc.ernet.in/43785/

Palavras-Chave #Electrical Communication Engineering
Tipo

Conference Paper

PeerReviewed