Multi-scale keypoints in V1 and face detection


Autoria(s): Rodrigues, J. M. F.; du Buf, J. M. H.
Data(s)

13/02/2009

13/02/2009

2005

Formato

application/pdf

Identificador

1st International Symposium Brain, Vision and Artificial Intell. (BV&AI 2005) . - Naples, 19-21 October 2005. - LNCS 3704. - p. 205-214

AUT: JRO00913; DUB00865;

http://hdl.handle.net/10400.1/36

Idioma(s)

eng

Publicador

Naples

Relação

http://www.bib.ualg.pt/artigos/DocentesEST/RODMulKey.pdf

Direitos

openAccess

Palavras-Chave #Visão computorizada #Córtex visual
Tipo

article

Resumo

End-stopped cells in cortical area V1, which combine out- puts of complex cells tuned to different orientations, serve to detect line and edge crossings (junctions) and points with a large curvature. In this paper we study the importance of the multi-scale keypoint representa- tion, i.e. retinotopic keypoint maps which are tuned to different spatial frequencies (scale or Level-of-Detail). We show that this representation provides important information for Focus-of-Attention (FoA) and object detection. In particular, we show that hierarchically-structured saliency maps for FoA can be obtained, and that combinations over scales in conjunction with spatial symmetries can lead to face detection through grouping operators that deal with keypoints at the eyes, nose and mouth, especially when non-classical receptive field inhibition is employed. Al- though a face detector can be based on feedforward and feedback loops within area V1, such an operator must be embedded into dorsal and ventral data streams to and from higher areas for obtaining translation-, rotation- and scale-invariant face (object) detection.