On robust face recognition via sparse encoding : the good, the bad, and the ugly


Autoria(s): Wong, Yongkang; Harandi, Mehrtash T.; Sanderson, Conrad
Data(s)

07/03/2013

Resumo

In the field of face recognition, Sparse Representation (SR) has received considerable attention during the past few years. Most of the relevant literature focuses on holistic descriptors in closed-set identification applications. The underlying assumption in SR-based methods is that each class in the gallery has sufficient samples and the query lies on the subspace spanned by the gallery of the same class. Unfortunately, such assumption is easily violated in the more challenging face verification scenario, where an algorithm is required to determine if two faces (where one or both have not been seen before) belong to the same person. In this paper, we first discuss why previous attempts with SR might not be applicable to verification problems. We then propose an alternative approach to face verification via SR. Specifically, we propose to use explicit SR encoding on local image patches rather than the entire face. The obtained sparse signals are pooled via averaging to form multiple region descriptors, which are then concatenated to form an overall face descriptor. Due to the deliberate loss spatial relations within each region (caused by averaging), the resulting descriptor is robust to misalignment & various image deformations. Within the proposed framework, we evaluate several SR encoding techniques: l1-minimisation, Sparse Autoencoder Neural Network (SANN), and an implicit probabilistic technique based on Gaussian Mixture Models. Thorough experiments on AR, FERET, exYaleB, BANCA and ChokePoint datasets show that the proposed local SR approach obtains considerably better and more robust performance than several previous state-of-the-art holistic SR methods, in both verification and closed-set identification problems. The experiments also show that l1-minimisation based encoding has a considerably higher computational than the other techniques, but leads to higher recognition rates.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/57987/

Publicador

Cornell University Library / arXiv

Relação

http://eprints.qut.edu.au/57987/1/wong_face_recognition_sparse_arxiv_1303_1624.pdf

http://arxiv.org/abs/1303.1624

Wong, Yongkang, Harandi, Mehrtash T., & Sanderson, Conrad (2013) On robust face recognition via sparse encoding : the good, the bad, and the ugly. Cornell University Library / arXiv, Ithaca, NY.

Direitos

The copyright is with the authors: Yongkang Wong, Mehrtash Harandi, and Conrad Sanderson.

The authors hereby grant a non-exclusive license to distribute the article in print and electronic forms, provided that the article is not changed. The authors do not provide any warranty as to the contents of the article. See also http://arxiv.org/licenses/nonexclusive-distrib/1.0/

Fonte

Science & Engineering Faculty

Palavras-Chave #010200 APPLIED MATHEMATICS #010401 Applied Statistics #010406 Stochastic Analysis and Modelling #080100 ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING #080104 Computer Vision #080106 Image Processing #080109 Pattern Recognition and Data Mining #080602 Computer-Human Interaction #170205 Neurocognitive Patterns and Neural Networks
Tipo

Report