Scene invariant multi camera crowd counting
Data(s) |
2014
|
---|---|
Resumo |
Automated crowd counting has become an active field of computer vision research in recent years. Existing approaches are scene-specific, as they are designed to operate in the single camera viewpoint that was used to train the system. Real world camera networks often span multiple viewpoints within a facility, including many regions of overlap. This paper proposes a novel scene invariant crowd counting algorithm that is designed to operate across multiple cameras. The approach uses camera calibration to normalise features between viewpoints and to compensate for regions of overlap. This compensation is performed by constructing an 'overlap map' which provides a measure of how much an object at one location is visible within other viewpoints. An investigation into the suitability of various feature types and regression models for scene invariant crowd counting is also conducted. The features investigated include object size, shape, edges and keypoints. The regression models evaluated include neural networks, K-nearest neighbours, linear and Gaussian process regresion. Our experiments demonstrate that accurate crowd counting was achieved across seven benchmark datasets, with optimal performance observed when all features were used and when Gaussian process regression was used. The combination of scene invariance and multi camera crowd counting is evaluated by training the system on footage obtained from the QUT camera network and testing it on three cameras from the PETS 2009 database. Highly accurate crowd counting was observed with a mean relative error of less than 10%. Our approach enables a pre-trained system to be deployed on a new environment without any additional training, bringing the field one step closer toward a 'plug and play' system. |
Formato |
application/pdf |
Identificador | |
Publicador |
Elsevier |
Relação |
http://eprints.qut.edu.au/63270/1/Scene_Invariant_Multi_Camera_Crowd_Counting.pdf DOI:10.1016/j.patrec.2013.10.002 Ryan, David, Denman, Simon, Fookes, Clinton B., & Sridharan, Sridha (2014) Scene invariant multi camera crowd counting. Pattern Recognition Letters, 44, pp. 98-112. |
Direitos |
Copyright 2013 Elsevier This is the author’s version of a work that was accepted for publication in Pattern Recognition Letters. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Pattern Recognition Letters, 44, 2014 DOI: 10.1016/j.patrec.2013.10.002 |
Fonte |
School of Electrical Engineering & Computer Science; Information Security Institute; Science & Engineering Faculty |
Palavras-Chave | #080104 Computer Vision #080106 Image Processing #crowd counting #multi camera #scene invariant #computer vision |
Tipo |
Journal Article |