An Investigation Into Features For Multi-View Lipreading


Autoria(s): Pass, Adrian; Zhang, Jianguo; Stewart, Darryl
Data(s)

01/09/2010

Resumo

For the first time in this paper we present results showing the effect of speaker head pose angle on automatic lip-reading performance over a wide range of closely spaced angles. We analyse the effect head pose has upon the features themselves and show that by selecting coefficients with minimum variance w.r.t. pose angle, recognition performance can be improved when train-test pose angles differ. Experiments are conducted using the initial phase of a unique multi view Audio-Visual database designed specifically for research and development of pose-invariant lip-reading systems. We firstly show that it is the higher order horizontal spatial frequency components that become most detrimental as the pose deviates. Secondly we assess the performance of different feature selection masks across a range of pose angles including a new mask based on Minimum Cross-Pose Variance coefficients. We report a relative improvement of 50% in Word Error Rate when using our selection mask over a common energy based selection during profile view lip-reading.

Formato

application/pdf

Identificador

http://pure.qub.ac.uk/portal/en/publications/an-investigation-into-features-for-multiview-lipreading(671f3074-a04d-4dbd-822f-9d3924bcb281).html

http://dx.doi.org/10.1109/ICIP.2010.5650963

http://pure.qub.ac.uk/ws/files/3713080/AN%20INVESTIGATION%20INTO%20FEATURES%20FOR%20MULTI-VIEW%20LIPREADING.pdf

Idioma(s)

eng

Publicador

Institute of Electrical and Electronics Engineers (IEEE)

Direitos

info:eu-repo/semantics/openAccess

Fonte

Pass , A , Zhang , J & Stewart , D 2010 , An Investigation Into Features For Multi-View Lipreading . in 2010 17th IEEE International Conference on Image Processing (ICIP) . Institute of Electrical and Electronics Engineers (IEEE) , pp. 2417-2420 , 2010 IEEE 17th International Conference on Image Processing , Hong Kong , Hong Kong , 26-29 September . DOI: 10.1109/ICIP.2010.5650963

Palavras-Chave #/dk/atira/pure/subjectarea/asjc/1700/1707 #Computer Vision and Pattern Recognition #/dk/atira/pure/subjectarea/asjc/1700/1711 #Signal Processing #/dk/atira/pure/subjectarea/asjc/1700/1712 #Software
Tipo

contributionToPeriodical