984 resultados para Computer vision - Mathematics


Relevância:

80.00% 80.00%

Publicador:

Relevância:

80.00% 80.00%

Publicador:

Relevância:

80.00% 80.00%

Publicador:

Resumo:

We examine the construction of new filters for computing local energy, and compare these filters with the Gabor filters and the three-point-filter of Venkatesh [l]. Further, we demonstrate that the effect of convolution with complex Gabor filters is to band-pass (with some differentiating effect) and compute the local energy of the result. The magnitude of the resulting local energy is then used to detect features [2], [3] (step features, texture etc.), and the phase is used to classify the detected features [l], [4] or provide disparity information for stereo [5] and motion work [6], [7]. Each of these types of information can be obtained at multiple resolutions, enabling the use of course to fine strategies for computing disparity, and allowing the discrimination of image textures on the basis of which parts of the Fourier domain they dominate [8], [9].

Relevância:

80.00% 80.00%

Publicador:

Resumo:

In this paper, we investigate the use of a wavelet transform-based analysis of audio tracks accompanying videos for the problem of automatic program genre detection. We compare the classification performance based on wavelet-based audio features to that using conventional features derived from Fourier and time analysis for the task of discriminating TV programs such as news, commercials, music shows, concerts, motor racing games, and animated cartoons. Three different classifiers namely the Decision Trees, SVMs, and k-Nearest Neighbours are studied to analyse the reliability of the performance of our wavelet features based approach. Further, we investigate the issue of an appropriate duration of an audio clip to be analyzed for this automatic genre determination. Our experimental results show that features derived from the wavelet transform of the audio signal can very well separate the six video genres studied. It is also found that there is no significant difference in performance with varying audio clip durations across the classifiers.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Many tasks in computer vision can be expressed as graph problems. This allows the task to be solved using a well studied algorithm, however many of these algorithms are of exponential complexity. This is a disadvantage when considered in the context of searching a database of images or videos for similarity. Work by Mesaner and Bunke (1995) has suggested a new class of graph matching algorithms which uses a priori knowledge about a database of models to reduce the time taken during online classification. This paper presents a new algorithm which extends the earlier work to detection of the largest common subgraph.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The Point Distribution Model (PDM) has been successfully used in modelling shape variations in groups of static images. It has also been effectively adapted to temporal image sets and used to track moving bodies such as hands and walking persons. However standard models do not consider the temporal characteristics of the data and are purely models of shape. This research proposes an extension to the PDM which explicitly considers the temporal sequencing of the images in the motion. The modified model can then be built from temporal quantities such as linear velocity and acceleration which are derived from the images. The new model formulation also enables movements to be tracked and classified according to their distinguishing temporal characteristics. This has been tested against distinct sets of arm movements under varying sets of experimental conditions.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

In this paper, we present a distributed surveillance system that uses multiple cheap static cameras to track multiple people in indoor environments. The system has a set of Camera Processing Modules and a Central Module to coordinate the tracking tasks among the cameras. Since each object in the scene can be tracked by a number of cameras, the problem is how to choose the most appropriate camera for each object. We propose a novel algorithm to allocate objects to cameras using the object-to-camera distance while taking into account occlusion. The algorithm attempts to assign objects in the overlapping fields of view to the nearest camera which can see the object without occlusion. Experimental results show that the system can coordinate cameras to track people properly and can deal well with occlusion.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper presents techniques for analysing human behaviour via video surveillance. In known scenes under surveillance, common paths of movement between entry and exit points are obtained and classified. These are used, together with a priori velocity data, to serve as a model of normal traffic flow in the scene. Surveillance sequences are then processed to extract and track the movement of people in the scene, which is compared with the models to enable detection of abnormal movement

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The Point Distribution Model (PDM) has proven effective in modelling variations in shape in sets of images, including those in which motion is involved such as body and hand tracking. This paper proposes an extension to the PDM through a re-parameterisation of the model which uses factors such as the angular velocity and distance travelled for sets of points on a moving shape. This then enables non-linear quantities such as acceleration and the average velocity of the body to be expressed in a linear model by the PDM. Results are shown for objects with known acceleration and deceleration components, these being a simulated pendulum modelled using simple harmonic motion and video sequences of a real pendulum in motion.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper forms a continuation of our work focused on exploiting film grammar for the task of automated film understanding. We examine film rhythm, a powerful narrative concept used to endow structure and form to the film compositionally and to enhance its lyrical quality experientially. Of the many, often complex, cinematic devices contributing to film rhythm, this paper investigates the rhythmic elements that are present in edited sequences of shots, and presents a novel computational model to detect shot structural rhythm as either metric, accelerated, decelerated, or free. Details of the algorithm for the extraction of these editing rhythm classes are presented, along with experimental results on real movie data. Following this we study the usefulness of combining the rhythmic patterns induced through both motion and editing in film. We show that, whilst detailed content identification via rhythm types alone is not possible by virtue of the fact that film is not codified to this level in terms of rhythmic elements, analysis of the combined motion/shot rhythm can allow us to determine that the content has changed and hypothesize as to why this is so. We present 3 such categories of change and demonstrate their efficacy for capturing useful film elements (e.g., scene change precipitated by plot event), by providing data support from 5 motion pictures.