Human-computer interaction based on visual hand-gesture recognition using volumetric spatiograms of local binary patterns


Autoria(s): Maqueda, Ana I.; Blanco Adán, Carlos Roberto del; García Santos, Narciso; Jaureguizar Núñez, Fernando
Data(s)

01/12/2015

31/12/1969

Resumo

A more natural, intuitive, user-friendly, and less intrusive Human–Computer interface for controlling an application by executing hand gestures is presented. For this purpose, a robust vision-based hand-gesture recognition system has been developed, and a new database has been created to test it. The system is divided into three stages: detection, tracking, and recognition. The detection stage searches in every frame of a video sequence potential hand poses using a binary Support Vector Machine classifier and Local Binary Patterns as feature vectors. These detections are employed as input of a tracker to generate a spatio-temporal trajectory of hand poses. Finally, the recognition stage segments a spatio-temporal volume of data using the obtained trajectories, and compute a video descriptor called Volumetric Spatiograms of Local Binary Patterns (VS-LBP), which is delivered to a bank of SVM classifiers to perform the gesture recognition. The VS-LBP is a novel video descriptor that constitutes one of the most important contributions of the paper, which is able to provide much richer spatio-temporal information than other existing approaches in the state of the art with a manageable computational cost. Excellent results have been obtained outperforming other approaches of the state of the art.

Formato

application/pdf

Identificador

http://oa.upm.es/40737/

Idioma(s)

eng

Publicador

E.T.S.I. Telecomunicación (UPM)

Relação

http://oa.upm.es/40737/1/INVE_MEM_2015_220809.pdf

http://www.sciencedirect.com/science/article/pii/S1077314215001629

TEC2010-20412

TEC2013-48453

info:eu-repo/semantics/altIdentifier/doi/http://dx.doi.org/10.1016/j.cviu.2015.07.009

Direitos

http://creativecommons.org/licenses/by-nc-nd/3.0/es/

info:eu-repo/semantics/embargoedAccess

Fonte

Computer Vision and Image Understanding, ISSN 1077-3142, 2015-12, Vol. 141

Palavras-Chave #Telecomunicaciones
Tipo

info:eu-repo/semantics/article

Artículo

PeerReviewed