Language identification using parallel sub-word recognition
Data(s) |
10/04/2003
|
---|---|
Resumo |
Parallel sub-word recognition (PSWR) is a new model that has been proposed for language identification (LID) which does not need elaborate phonetic labeling of the speech data in a foreign language. The new approach performs a front-end tokenization in terms of sub-word units which are designed by automatic segmentation, segment clustering and segment HMM modeling. We develop PSWR based LID in a framework similar to the parallel phone recognition (PPR) approach in the literature. This includes a front-end tokenizer and a back-end language model, for each language to be identified. Considering various combinations of the statistical evaluation scores, it is found that PSWR can perform as well as PPR, even with broad acoustic sub-word tokenization, thus making it an efficient alternative to the PPR system. |
Formato |
application/pdf |
Identificador |
http://eprints.iisc.ernet.in/43785/1/LANGUAGE.pdf Jayram, AKVS and Ramasubramanian, V and Sreenivas, TV (2003) Language identification using parallel sub-word recognition. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, APR 06-10, 2003, New York. |
Publicador |
IEEE |
Relação |
http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=1198709&tag=1 http://eprints.iisc.ernet.in/43785/ |
Palavras-Chave | #Electrical Communication Engineering |
Tipo |
Conference Paper PeerReviewed |