4 resultados para Wavelet Transforms
em Digital Peer Publishing
Resumo:
During decades Distance Transforms have proven to be useful for many image processing applications, and more recently, they have started to be used in computer graphics environments. The goal of this paper is to propose a new technique based on Distance Transforms for detecting mesh elements which are close to the objects' external contour (from a given point of view), and using this information for weighting the approximation error which will be tolerated during the mesh simplification process. The obtained results are evaluated in two ways: visually and using an objective metric that measures the geometrical difference between two polygonal meshes.
Resumo:
Audio-visual documents obtained from German TV news are classified according to the IPTC topic categorization scheme. To this end usual text classification techniques are adapted to speech, video, and non-speech audio. For each of the three modalities word analogues are generated: sequences of syllables for speech, “video words” based on low level color features (color moments, color correlogram and color wavelet), and “audio words” based on low-level spectral features (spectral envelope and spectral flatness) for non-speech audio. Such audio and video words provide a means to represent the different modalities in a uniform way. The frequencies of the word analogues represent audio-visual documents: the standard bag-of-words approach. Support vector machines are used for supervised classification in a 1 vs. n setting. Classification based on speech outperforms all other single modalities. Combining speech with non-speech audio improves classification. Classification is further improved by supplementing speech and non-speech audio with video words. Optimal F-scores range between 62% and 94% corresponding to 50% - 84% above chance. The optimal combination of modalities depends on the category to be recognized. The construction of audio and video words from low-level features provide a good basis for the integration of speech, non-speech audio and video.
Resumo:
In contemporary societies there are different ways to perceive the relation between identity and alterity and to describe the difference between “us” and “them”, residents and foreigners. Anthropologist Sandra Wallman sustains that in multi-cultural urban spaces the frontiers of diversity are not only burdensome markers of identity, but rather they could also represent new chances to define “identity” and “alterity”. These frontiers, in fact, can work like interfaces through which to build time after time, in a creative way, a relationship with the other. From this point of view, the concept of boundary can offer many opportunities to creatively define the relation with the other and to sign new options for cognitive and physical movement. On the other side, in many cases we have a plenty of mechanisms of exclusion that transforms a purely empirical distinction between “us” and “them” in an ontological contrast, as in the case when the immigrant undergoes hostilities through discriminatory language. Even though these forms of racism are undoubtedly objectionable from a theoretical point of view, they are anyway socially “real”, in the sense that they are perpetually reaffirmed and strengthened in public opinion. They are in fact implicit “truths”, realities that are considered objective, common opinions that are part of day-to-day existence. That is the reason why an anthropological prospective including the study of “common sense” should be adopted in our present day studies on migration, as pointed out by American anthropologist Michael Herzfeld. My primary goal is to analyze with such a critical approach same pre-conditions of racism and exclusion in contemporary multi-cultural urban spaces. On the other hand, this essay would also investigate positive strategies of comparing, interchanging, and negotiating alterity in social work. I suggest that this approach can offer positive solutions in coping with “diversity” and in working out policies for recognizing a common identity which, at the same time, do not throw away the relevance of political and economic power.
Resumo:
Visual fixation is employed by humans and some animals to keep a specific 3D location at the center of the visual gaze. Inspired by this phenomenon in nature, this paper explores the idea to transfer this mechanism to the context of video stabilization for a handheld video camera. A novel approach is presented that stabilizes a video by fixating on automatically extracted 3D target points. This approach is different from existing automatic solutions that stabilize the video by smoothing. To determine the 3D target points, the recorded scene is analyzed with a stateof- the-art structure-from-motion algorithm, which estimates camera motion and reconstructs a 3D point cloud of the static scene objects. Special algorithms are presented that search either virtual or real 3D target points, which back-project close to the center of the image for as long a period of time as possible. The stabilization algorithm then transforms the original images of the sequence so that these 3D target points are kept exactly in the center of the image, which, in case of real 3D target points, produces a perfectly stable result at the image center. Furthermore, different methods of additional user interaction are investigated. It is shown that the stabilization process can easily be controlled and that it can be combined with state-of-theart tracking techniques in order to obtain a powerful image stabilization tool. The approach is evaluated on a variety of videos taken with a hand-held camera in natural scenes.