Real-time video event detection in crowded scenes using MPEG derived features : a multiple instance learning approach


Autoria(s): Xu, Jingxin; Denman, Simon; Reddy, Vikas; Fookes, Clinton B.; Sridharan, Sridha
Data(s)

15/07/2014

Resumo

This paper presents an investigation into event detection in crowded scenes, where the event of interest co-occurs with other activities and only binary labels at the clip level are available. The proposed approach incorporates a fast feature descriptor from the MPEG domain, and a novel multiple instance learning (MIL) algorithm using sparse approximation and random sensing. MPEG motion vectors are used to build particle trajectories that represent the motion of objects in uniform video clips, and the MPEG DCT coefficients are used to compute a foreground map to remove background particles. Trajectories are transformed into the Fourier domain, and the Fourier representations are quantized into visual words using the K-Means algorithm. The proposed MIL algorithm models the scene as a linear combination of independent events, where each event is a distribution of visual words. Experimental results show that the proposed approaches achieve promising results for event detection compared to the state-of-the-art.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/65301/

Publicador

Elsevier

Relação

http://eprints.qut.edu.au/65301/10/65301.pdf

DOI:10.1016/j.patrec.2013.11.019

Xu, Jingxin, Denman, Simon, Reddy, Vikas, Fookes, Clinton B., & Sridharan, Sridha (2014) Real-time video event detection in crowded scenes using MPEG derived features : a multiple instance learning approach. Pattern Recognition Letters, 44, pp. 113-125.

http://purl.org/au-research/grants/ARC/LP0990135

Direitos

Copyright 2013 Elsevier

This is the author’s version of a work that was accepted for publication in Pattern Recognition Letters. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Pattern Recognition Letters, [VOL 44, (2014)] DOI: 10.1016/j.patrec.2013.11.019

Fonte

School of Electrical Engineering & Computer Science; Faculty of Science and Technology

Palavras-Chave #080104 Computer Vision
Tipo

Journal Article