Kernel analysis on Grassmann manifolds for action recognition


Autoria(s): Harandi, Mehrtash T.; Sanderson, Conrad; Shirazi, Sareh; Lovell, Brian C.
Data(s)

2013

Resumo

Modelling video sequences by subspaces has recently shown promise for recognising human actions. Subspaces are able to accommodate the effects of various image variations and can capture the dynamic properties of actions. Subspaces form a non-Euclidean and curved Riemannian manifold known as a Grassmann manifold. Inference on manifold spaces usually is achieved by embedding the manifolds in higher dimensional Euclidean spaces. In this paper, we instead propose to embed the Grassmann manifolds into reproducing kernel Hilbert spaces and then tackle the problem of discriminant analysis on such manifolds. To achieve efficient machinery, we propose graph-based local discriminant analysis that utilises within-class and between-class similarity graphs to characterise intra-class compactness and inter-class separability, respectively. Experiments on KTH, UCF Sports, and Ballet datasets show that the proposed approach obtains marked improvements in discrimination accuracy in comparison to several state-of-the-art methods, such as the kernel version of affine hull image-set distance, tensor canonical correlation analysis, spatial-temporal words and hierarchy of discriminative space-time neighbourhood features.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/57363/

Publicador

Elsevier

Relação

http://eprints.qut.edu.au/57363/1/harandi_grassmann_action_recognition_prl_2013.pdf

DOI:10.1016/j.patrec.2013.01.008

Harandi, Mehrtash T., Sanderson, Conrad, Shirazi, Sareh, & Lovell, Brian C. (2013) Kernel analysis on Grassmann manifolds for action recognition. Pattern Recognition Letters, 34(15), pp. 1906-1915.

Direitos

Copyright 2013 Elsevier.

This is the author’s version of a work that was accepted for publication in Pattern Recognition Letters. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Pattern Recognition Letters, [VOL 34, ISSUE 15, (2013)] DOI: 10.1016/j.patrec.2013.01.008

Fonte

ARC Centre of Excellence for Robotic Vision; School of Electrical Engineering & Computer Science; Science & Engineering Faculty

Palavras-Chave #010200 APPLIED MATHEMATICS #010301 Numerical Analysis #080104 Computer Vision #080106 Image Processing #080109 Pattern Recognition and Data Mining #090609 Signal Processing
Tipo

Journal Article