Speech recognition in adverse environments using lip information


Autoria(s): Thambiratnam, D.; Wark, T.; Sridharan, S.; Chandran, V.
Data(s)

1997

Resumo

The performance of automatic speech recognition systems deteriorates in the presence of noise. One known solution is to incorporate video information with an existing acoustic speech recognition system. We investigate the performance of the individual acoustic and visual sub-systems and then examine different ways in which the integration of the two systems may be performed. The system is to be implemented in real time on a Texas Instruments' TMS320C80 DSP.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/45587/

Relação

http://eprints.qut.edu.au/45587/1/45587a.pdf

DOI:10.1109/TENCON.1997.647279

Thambiratnam, D., Wark, T., Sridharan, S., & Chandran, V. (1997) Speech recognition in adverse environments using lip information. In TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications., Proceedings of IEEE, 02-04 Dec 1997, Brisbane, Australia.

Direitos

(c) 1997 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.

Fonte

Faculty of Built Environment and Engineering; School of Engineering Systems

Palavras-Chave #acoustic noise #image recognition #speech recognition #video signal processing #Texas Instruments TMS320C80 DSP #acoustic speech recognition system #acoustic sub-system #adverse environments #automatic speech recognition systems #lip information #noise #performance #video information #visual sub-system
Tipo

Conference Paper