Lip detection for audio-visual speech recognition in-car environment


Autoria(s): Navarathna, Rajitha; Lucey, Patrick J.; Dean, David B.; Fookes, Clinton B.; Sridharan, Sridha
Contribuinte(s)

Boashash, Boualem

Hamila, Ridha

Salleh, Sheikh Hussain Shaikh

Bakar, syed Abd Rahman Abu

Data(s)

13/05/2010

Resumo

Acoustically, car cabins are extremely noisy and as a consequence audio-only, in-car voice recognition systems perform poorly. As the visual modality is immune to acoustic noise, using the visual lip information from the driver is seen as a viable strategy in circumventing this problem by using audio visual automatic speech recognition (AVASR). However, implementing AVASR requires a system being able to accurately locate and track the drivers face and lip area in real-time. In this paper we present such an approach using the Viola-Jones algorithm. Using the AVICAR [1] in-car database, we show that the Viola- Jones approach is a suitable method of locating and tracking the driver’s lips despite the visual variability of illumination and head pose for audio-visual speech recognition system.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/32879/

Publicador

IEEE

Relação

http://eprints.qut.edu.au/32879/1/c32879.pdf

DOI:10.1109/ISSPA.2010.5605429

Navarathna, Rajitha, Lucey, Patrick J., Dean, David B., Fookes, Clinton B., & Sridharan, Sridha (2010) Lip detection for audio-visual speech recognition in-car environment. In Boashash, Boualem, Hamila, Ridha, Salleh, Sheikh Hussain Shaikh, & Bakar, syed Abd Rahman Abu (Eds.) Proceedings of 10th International Conference on Information Science, Signal Processing and their Applications, IEEE, Renaissance Hotel, Kuala Lumpur, pp. 598-601.

Direitos

Copyright 2010 IEEE.

Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Fonte

Faculty of Built Environment and Engineering; Information Security Institute; School of Engineering Systems

Palavras-Chave #080106 Image Processing #AVASR #AVICAR Database #Viola-Jones Algorithm
Tipo

Conference Paper