Human action recognition based on 3D SIFT and LDA model


Autoria(s): Liu, Ping; Wang, Jin; She, Mary; Liu, Honghai
Contribuinte(s)

[Unknown]

Data(s)

01/01/2011

Resumo

How to recognize human action from videos captured by modern cameras efficiently and effectively is a challenge in real applications. Traditional methods which need professional analysts are facing a bottleneck because of their shortcomings. To cope with the disadvantage, methods based on computer vision techniques, without or with only a few human interventions, have been proposed to analyse human actions in videos automatically. This paper provides a method combining the three dimensional Scale Invariant Feature Transform (SIFT) detector and the Latent Dirichlet Allocation (LDA) model for human motion analysis. To represent videos effectively and robustly, we extract the 3D SIFT descriptor around each interest point, which is sampled densely from 3D Space-time video volumes. After obtaining the representation of each video frame, the LDA model is adopted to discover the underlying structure-the categorization of human actions in the collection of videos. Public available standard datasets are used to test our method. The concluding part discusses the research challenges and future directions.

Identificador

http://hdl.handle.net/10536/DRO/DU:30042251

Idioma(s)

eng

Publicador

IEEE

Relação

http://dro.deakin.edu.au/eserv/DU:30042251/wang-humanaction-2011.pdf

http://hdl.handle.net/10.1109/RIISS.2011.5945790

Direitos

2011, IEEE

Palavras-Chave #human action recognition #3D SIFT #Latent Dirichlet Allocation
Tipo

Conference Paper