Biblioteca Digital

Visual front-end wars : Viola-Jones face detector vs Fourier Lucas-Kanade

**Autoria(s):** Kalantari, Shahram; Navarathna, Rajitha; Dean, David B.; Sridharan, Sridha
Contribuinte(s)	Denis , Burnham Jonas , Beskow
Data(s)	2013
Resumo	The performance of visual speech recognition (VSR) systems are significantly influenced by the accuracy of the visual front-end. The current state-of-the-art VSR systems use off-the-shelf face detectors such as Viola- Jones (VJ) which has limited reliability for changes in illumination and head poses. For a VSR system to perform well under these conditions, an accurate visual front end is required. This is an important problem to be solved in many practical implementations of audio visual speech recognition systems, for example in automotive environments for an efficient human-vehicle computer interface. In this paper, we re-examine the current state-of-the-art VSR by comparing off-the-shelf face detectors with the recently developed Fourier Lucas-Kanade (FLK) image alignment technique. A variety of image alignment and visual speech recognition experiments are performed on a clean dataset as well as with a challenging automotive audio-visual speech dataset. Our results indicate that the FLK image alignment technique can significantly outperform off-the shelf face detectors, but requires frequent fine-tuning.
Formato	application/pdf
Identificador	http://eprints.qut.edu.au/62749/
Relação	http://eprints.qut.edu.au/62749/1/IS12_paper_template.pdf Kalantari, Shahram, Navarathna, Rajitha, Dean, David B., & Sridharan, Sridha (2013) Visual front-end wars : Viola-Jones face detector vs Fourier Lucas-Kanade. In Denis , Burnham & Jonas , Beskow (Eds.) International Conference on Auditory Visual Speech Processing 2013, 29 August - 1 September 2013, Ternélia resort Le Pré du Lac, Annecy, France.
Direitos	Copyright 2013 [please consult the author]
Fonte	School of Electrical Engineering & Computer Science; Science & Engineering Faculty
Palavras-Chave	#080000 INFORMATION AND COMPUTING SCIENCES #Visual Front-ends #Viola-Jones #Fourier Lucas-Kanade #Visual Speech Recognition
Tipo	Conference Paper

Acesso ao item digital