Learning Temporal Alignment Uncertainty for Efficient Event Detection


Autoria(s): Abbasnejad, Iman; Sridharan, Sridha; Denman, Simon; Fookes, Clinton B.
Data(s)

2015

Resumo

In this paper we tackle the problem of efficient video event detection. We argue that linear detection functions should be preferred in this regard due to their scalability and efficiency during estimation and evaluation. A popular approach in this regard is to represent a sequence using a bag of words (BOW) representation due to its: (i) fixed dimensionality irrespective of the sequence length, and (ii) its ability to compactly model the statistics in the sequence. A drawback to the BOW representation, however, is the intrinsic destruction of the temporal ordering information. In this paper we propose a new representation that leverages the uncertainty in relative temporal alignments between pairs of sequences while not destroying temporal ordering. Our representation, like BOW, is of a fixed dimensionality making it easily integrated with a linear detection function. Extensive experiments on CK+, 6DMG, and UvA-NEMO databases show significant performance improvements across both isolated and continuous event detection tasks.

Identificador

http://eprints.qut.edu.au/90802/

Relação

http://arxiv.org/pdf/1509.01343.pdf

Abbasnejad, Iman, Sridharan, Sridha, Denman, Simon, & Fookes, Clinton B. (2015) Learning Temporal Alignment Uncertainty for Efficient Event Detection. In International Conference on Digital Image Computing: Techniques and Applications (DICTA), 23-25 November 2015, Adelaide Town Hall, Adelaide, South Australia, Australia.

http://purl.org/au-research/grants/ARC/DP140100793

Direitos

The authors.

Fonte

School of Electrical Engineering & Computer Science; Science & Engineering Faculty

Palavras-Chave #080106 Image Processing #080109 Pattern Recognition and Data Mining #Event Detection #Dynamic Time Warping #Temporal Alignment #Bag of Words
Tipo

Conference Paper