982 resultados para camera motion
Resumo:
We present a multimodal detection and tracking algorithm for sensors composed of a camera mounted between two microphones. Target localization is performed on color-based change detection in the video modality and on time difference of arrival (TDOA) estimation between the two microphones in the audio modality. The TDOA is computed by multiband generalized cross correlation (GCC) analysis. The estimated directions of arrival are then postprocessed using a Riccati Kalman filter. The visual and audio estimates are finally integrated, at the likelihood level, into a particle filter (PF) that uses a zero-order motion model, and a weighted probabilistic data association (WPDA) scheme. We demonstrate that the Kalman filtering (KF) improves the accuracy of the audio source localization and that the WPDA helps to enhance the tracking performance of sensor fusion in reverberant scenarios. The combination of multiband GCC, KF, and WPDA within the particle filtering framework improves the performance of the algorithm in noisy scenarios. We also show how the proposed audiovisual tracker summarizes the observed scene by generating metadata that can be transmitted to other network nodes instead of transmitting the raw images and can be used for very low bit rate communication. Moreover, the generated metadata can also be used to detect and monitor events of interest.
Resumo:
From perspective of structure synthesis, certain special geometric constraints, such as joint axes intersecting at one point or perpendicular to each other, are necessary in realizing the end-effector motion of kinematically decoupled parallel manipulators (PMs) along individual motion axes. These requirements are difficult to achieve in the actual system due to assembly errors and manufacturing tolerances. Those errors that violate the geometric constraint requirements are termed “constraint errors”. The constraint errors usually are more troublesome than other manipulator errors because the decoupled motion characteristics of the manipulator may no longer exist and the decoupled kinematic models will be rendered useless due to these constraint errors. Therefore, identification and prevention of these constraint errors in initial design and manufacturing stage are of great significance. In this article, three basic types of constraint errors are identified, and an approach to evaluate the effects of constraint errors on decoupling characteristics of PMs is proposed. This approach is illustrated by a 6-DOF PM with decoupled translation and rotation. The results show that the proposed evaluation method is effective to guide design and assembly.
Resumo:
Single cell recording studies have resulted in a detailed understanding of motion-sensitive neurons in non-human primate visual cortex. However, it is not known to what extent response properties of motion-sensitive neurons in the non-human primate brain mirror response characteristics of motion-sensitive neurons in the human brain. Using a motion adaptation paradigm, the direction aftereffect, we show that changes in the activity of human motion-sensitive neurons to moving dot patterns that differ in dot density bear a strong resemblance to data from macaque monkey. We also show a division-like inhibition between neural populations tuned to opposite directions, which also mirrors neural-inhibitory behaviour in macaque. These findings strongly suggest that motion-sensitive neurons in human and non-human primates share common response and inhibitory characteristics.
Resumo:
A new reconfigurable subpixel interpolation architecture for multistandard (e.g., MPEG-2, MPEG-4, H.264, and AVS) video motion estimation (ME) is presented. This exploits the mixed use of parallel and serial-input FIR filters to achieve high throughput rate and efficient silicon utilization. Silicon design studies show that this can be implemented using 34.8 × 10 3 gates with area and performance that compares very favorably with specific fixed solutions, e.g., for the H.264 standard alone. This can support SDTV and HDTV applications when implemented in 0.18 µm CMOS technology, with further performance enhancements achievable at 0.13 µm and below. © 2009 IEEE.
Resumo:
A new kind of photographic representation, called movement-image is proposed and discussed to record the visual experience of the journey through urban highways. It consists of performing long exposure photographic shots while the track is traversed, thus registering a time-panorama which includes landscape signs and inner spaces of the ways involved. This proposal is linked to the limitations of representing these expressways, if they are understood as structures of instrumental origin, where the resulting experience comes from moving at high speed through the territory. In al almost all cases the aesthetic approach or urban integration with the city and landscape are excluded. In this sense, although such structures may be an opportunity to collect, build and colonize the urban landscape, the lack of adequate representation of the phenomenon causes a difficulty in its understanding and transformation. The options for representation using photography is assumed, knowing its own particular tradition in the use of long exposures, for the expression of the mobile, and the multiple visual attention, divided or weakened.
Resumo:
The importance and use of text extraction from camera based coloured scene images is rapidly increasing with time. Text within a camera grabbed image can contain a huge amount of meta data about that scene. Such meta data can be useful for identification, indexing and retrieval purposes. While the segmentation and recognition of text from document images is quite successful, detection of coloured scene text is a new challenge for all camera based images. Common problems for text extraction from camera based images are the lack of prior knowledge of any kind of text features such as colour, font, size and orientation as well as the location of the probable text regions. In this paper, we document the development of a fully automatic and extremely robust text segmentation technique that can be used for any type of camera grabbed frame be it single image or video. A new algorithm is proposed which can overcome the current problems of text segmentation. The algorithm exploits text appearance in terms of colour and spatial distribution. When the new text extraction technique was tested on a variety of camera based images it was found to out perform existing techniques (or something similar). The proposed technique also overcomes any problems that can arise due to an unconstraint complex background. The novelty in the works arises from the fact that this is the first time that colour and spatial information are used simultaneously for the purpose of text extraction.
Resumo:
Edgard Vare` se’s Poe` me e´ lectronique can be viewed as a bridge between early twentieth-century modernism and electroacoustic music. This connection to early modernism is most clearly seen in its use of musical juxtaposition, a favoured technique of early modernist composers, especially those active in Paris. Juxtaposition and non-motion are considered here, particularly in relationship to Smalley’s exposition of spectromorphology (Smalley 1986), which in its preoccupation with motion omits any significant consideration of non-motion. Juxtaposition and non-motion have an important history within twentieth-century music, and as an early classic of electroacoustic music, Poe` me e´ lectronique is a particularly striking example of a composition that is rich in juxtapositions similar to those found in passages of early modernist music. Examining Poe` me e´ lectronique through the lens of juxtaposition and non-motion reveals how the organisation of its juxtaposed sounds encourages the experience of sound structure suspended time.