71 resultados para 280208 Computer Vision


Relevância:

80.00% 80.00%

Publicador:

Resumo:

The Point Distribution Model (PDM) has been successfully used in modelling shape variations in groups of static images. It has also been effectively adapted to temporal image sets and used to track moving bodies such as hands and walking persons. However standard models do not consider the temporal characteristics of the data and are purely models of shape. This research proposes an extension to the PDM which explicitly considers the temporal sequencing of the images in the motion. The modified model can then be built from temporal quantities such as linear velocity and acceleration which are derived from the images. The new model formulation also enables movements to be tracked and classified according to their distinguishing temporal characteristics. This has been tested against distinct sets of arm movements under varying sets of experimental conditions.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

In this paper, we present a distributed surveillance system that uses multiple cheap static cameras to track multiple people in indoor environments. The system has a set of Camera Processing Modules and a Central Module to coordinate the tracking tasks among the cameras. Since each object in the scene can be tracked by a number of cameras, the problem is how to choose the most appropriate camera for each object. We propose a novel algorithm to allocate objects to cameras using the object-to-camera distance while taking into account occlusion. The algorithm attempts to assign objects in the overlapping fields of view to the nearest camera which can see the object without occlusion. Experimental results show that the system can coordinate cameras to track people properly and can deal well with occlusion.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper presents techniques for analysing human behaviour via video surveillance. In known scenes under surveillance, common paths of movement between entry and exit points are obtained and classified. These are used, together with a priori velocity data, to serve as a model of normal traffic flow in the scene. Surveillance sequences are then processed to extract and track the movement of people in the scene, which is compared with the models to enable detection of abnormal movement

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The Point Distribution Model (PDM) has proven effective in modelling variations in shape in sets of images, including those in which motion is involved such as body and hand tracking. This paper proposes an extension to the PDM through a re-parameterisation of the model which uses factors such as the angular velocity and distance travelled for sets of points on a moving shape. This then enables non-linear quantities such as acceleration and the average velocity of the body to be expressed in a linear model by the PDM. Results are shown for objects with known acceleration and deceleration components, these being a simulated pendulum modelled using simple harmonic motion and video sequences of a real pendulum in motion.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper forms a continuation of our work focused on exploiting film grammar for the task of automated film understanding. We examine film rhythm, a powerful narrative concept used to endow structure and form to the film compositionally and to enhance its lyrical quality experientially. Of the many, often complex, cinematic devices contributing to film rhythm, this paper investigates the rhythmic elements that are present in edited sequences of shots, and presents a novel computational model to detect shot structural rhythm as either metric, accelerated, decelerated, or free. Details of the algorithm for the extraction of these editing rhythm classes are presented, along with experimental results on real movie data. Following this we study the usefulness of combining the rhythmic patterns induced through both motion and editing in film. We show that, whilst detailed content identification via rhythm types alone is not possible by virtue of the fact that film is not codified to this level in terms of rhythmic elements, analysis of the combined motion/shot rhythm can allow us to determine that the content has changed and hypothesize as to why this is so. We present 3 such categories of change and demonstrate their efficacy for capturing useful film elements (e.g., scene change precipitated by plot event), by providing data support from 5 motion pictures.

Relevância:

80.00% 80.00%

Publicador:

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The paper presents the Visual Mouse (VM), a novel and simple system for interaction with displays via hand gestures. Our method includes detecting bare hands using the fast SIFT (Scale-Invariant Feature Transform) algorithm saving long training time of the Adaboost algorithm, tracking hands based on the CAMShift algorithm, recognizing hand gestures in cluttered background via Principle Components Analysis (PCA) without extracting clear-cut hand contour, and defining simple and robustly interpretable vocabularies of hand gestures, which are subsequently used to control a computer mouse. The system provides a fast and simple interaction experience without the need for more expensive hardware and software.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Sparse representation has been introduced to address many recognition problems in computer vision. In this paper, we propose a new framework for object categorization based on sparse representation of local features. Unlike most of previous sparse coding based methods in object classification that only use sparse coding to extract high-level features, the proposed method incorporates sparse representation and classification into a unified framework. Therefore, it does not need a further classifier. Experimental results show that the proposed method achieved better or comparable accuracy than the well known bag-of-features representation with various classifiers.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Finding the skeleton of a 3D mesh is an essential task for many applications such as mesh animation, tracking, and 3D registeration. In recent years, new technologies in computer vision such as Microsoft Kinect have proven that a mesh skeleton can be useful such as in the case of human machine interactions. To calculate the 3D mesh skeleton, the mesh properties such as topology and its components relations are utilized. In this paper, we propose the usage of a novel algorithm that can efficiently calculate a vertex antipodal point. A vertex antipodal point is the diametrically opposite point that belongs to the same mesh. The set of centers of the connecting lines between each vertex and its antipodal point represents the 3D mesh desired skeleton. Post processing is completed for smoothing and fitting centers into optimized skeleton parts. The algorithm is tested on different classes of 3D objects and produced efficient results that are comparable with the literature. The algorithm has the advantages of producing high quality skeletons as it preserves details. This is suitable for applications where the mesh skeleton mapping is required to be kept as much as possible.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

With urbanization and vehicle availability, there exist many traffic problems including congestion, environmental impact and safety. In order to address these problems, we propose a video driven traffic modelling system in this paper. The system can simulate real-world traffic activities in a computer, based on traffic data recorded in videos. Video processing is employed to estimate metrics such as traffic volumes. These metrics are used to update the traffic system model, which is then simulated using the Paramics™ traffic simulation platform. Video driven traffic modelling has widespread potential application in traffic systems, due to the convenience and reduced costs of model development and maintenance. Experiments are conducted in this paper to demonstrate the effectiveness of the proposed system.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

We propose Video Driven Traffic Modelling (VDTM) for accurate simulation of real-world traffic behaviours with detailed information and low-cost model development and maintenance. Computer vision techniques are employed to estimate traffic parameters. These parameters are used to build and update a traffic system model. The model is simulated using the Paramics traffic simulation platform. Based on the simulation techniques, effects of traffic interventions can be evaluated in order to achieve better decision makings for traffic management authorities. In this paper, traffic parameters such as vehicle types, times of starting trips and corresponding origin-destinations are extracted from a video. A road network is manually defined according to the traffic composition in the video, and individual vehicles associated with extracted properties are modelled and simulated within the defined road network using Paramics. VDTM has widespread potential applications in supporting traffic decision-makings. To demonstrate the effectiveness, we apply it in optimizing a traffic signal control system, which adaptively adjusts green times of signals at an intersection to reduce traffic congestion.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Automatic face recognition (AFR) is an area with immense practical potential which includes a wide range of commercial and law enforcement applications, and it continues to be one of the most active research areas of computer vision. Even after over three decades of intense research, the state-of-the-art in AFR continues to improve, benefiting from advances in a range of different fields including image processing, pattern recognition, computer graphics and physiology. However, systems based on visible spectrum images continue to face challenges in the presence of illumination, pose and expression changes, as well as facial disguises, all of which can significantly decrease their accuracy. Amongst various approaches which have been proposed in an attempt to overcome these limitations, the use of infrared (IR) imaging has emerged as a particularly promising research direction. This paper presents a comprehensive and timely review of the literature on this subject.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Linear subspace representations of appearance variation are pervasive in computer vision. In this paper we address the problem of robustly matching them (computing the similarity between them) when they correspond to sets of images of different (possibly greatly so) scales. We show that the naïve solution of projecting the low-scale subspace into the high-scale image space is inadequate, especially at large scale discrepancies. A successful approach is proposed instead. It consists of (i) an interpolated projection of the low-scale subspace into the high-scale space, which is followed by (ii) a rotation of this initial estimate within the bounds of the imposed “downsampling constraint”. The optimal rotation is found in the closed-form which best aligns the high-scale reconstruction of the low-scale subspace with the reference it is compared to. The proposed method is evaluated on the problem of matching sets of face appearances under varying illumination. In comparison to the naïve matching, our algorithm is shown to greatly increase the separation of between-class and within-class similarities, as well as produce far more meaningful modes of common appearance on which the match score is based.