Biblioteca Digital

34 resultados para Image Based Visual Servoing

Automatic texture segmentation for texture-based image retrieval

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Texture-segmentation is the crucial initial step for texture-based image retrieval. Texture is the main difficulty faced to a segmentation method. Many image segmentation algorithms either can’t handle texture properly or can’t obtain texture features directly during segmentation which can be used for retrieval purpose. This paper describes an automatic texture segmentation algorithm based on a set of features derived from wavelet domain, which are effective in texture description for retrieval purpose. Simulation results show that the proposed algorithm can efficiently capture the textured regions in arbitrary images, with the features of each region extracted as well. The features of each textured region can be directly used to index image database with applications as texture-based image retrieval.

Web-based image retrieval with a case study

Relevância:

40.00% 40.00%

Publicador:

Perceptual quality metrics applied to still image compression

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We present a review of perceptual image quality metrics and their application to still image compression. The review describes how image quality metrics can be used to guide an image compression scheme and outlines the advantages, disadvantages and limitations of a number of quality metrics. We examine a broad range of metrics ranging from simple mathematical measures to those which incorporate full perceptual models. We highlight some variation in the models for luminance adaptation and the contrast sensitivity function and discuss what appears to be a lack of a general consensus regarding the models which best describe contrast masking and error summation. We identify how the various perceptual components have been incorporated in quality metrics, and identify a number of psychophysical testing techniques that can be used to validate the metrics. We conclude by illustrating some of the issues discussed throughout the paper with a simple demonstration. (C) 1998 Elsevier Science B.V. All rights reserved.

A wavelet visible difference predictor

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper, we describe a model of the human visual system (HVS) based on the wavelet transform. This model is largely based on a previously proposed model, but has a number of modifications that make it more amenable to potential integration into a wavelet based image compression scheme. These modifications include the use of a separable wavelet transform instead of the cortex transform, the application of a wavelet contrast sensitivity function (CSP), and a simplified definition of subband contrast that allows us to predict noise visibility directly from wavelet coefficients. Initially, we outline the luminance, frequency, and masking sensitivities of the HVS and discuss how these can be incorporated into the wavelet transform. We then outline a number of limitations of the wavelet transform as a model of the HVS, namely the lack of translational invariance and poor orientation sensitivity. In order to investigate the efficacy of this wavelet based model, a wavelet visible difference predictor (WVDP) is described. The WVDP is then used to predict visible differences between an original and compressed (or noisy) image. Results are presented to emphasize the limitations of commonly used measures of image quality and to demonstrate the performance of the WVDP, The paper concludes with suggestions on bow the WVDP can be used to determine a visually optimal quantization strategy for wavelet coefficients and produce a quantitative measure of image quality.

Parallel processing and image analysis in the eyes of mantis shrimps

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The compound eyes of mantis shrimps, a group of tropical marine crustaceans, incorporate principles of serial and parallel processing of visual information that may be applicable to artificial imaging systems. Their eyes include numerous specializations for analysis of the spectral and polarizational properties of light, and include more photoreceptor classes for analysis of ultraviolet light, color, and polarization than occur in any other known visual system. This is possible because receptors in different regions of the eye are anatomically diverse and incorporate unusual structural features, such as spectral filters, not seen in other compound eyes. Unlike eyes of most other animals, eyes of mantis shrimps must move to acquire some types of visual information and to integrate color and polarization with spatial vision. Information leaving the retina appears to be processed into numerous parallel data streams leading into the central nervous system, greatly reducing the analytical requirements at higher levels. Many of these unusual features of mantis shrimp vision may inspire new sensor designs for machine vision

Identifying rate-limiting nodes in large-scale cortical networks for visuospatial processing: an illustration using fMRI

Relevância:

30.00% 30.00%

Publicador:

Resumo:

With the advent of functional neuroimaging techniques, in particular functional magnetic resonance imaging (fMRI), we have gained greater insight into the neural correlates of visuospatial function. However, it may not always be easy to identify the cerebral regions most specifically associated with performance on a given task. One approach is to examine the quantitative relationships between regional activation and behavioral performance measures. In the present study, we investigated the functional neuroanatomy of two different visuospatial processing tasks, judgement of line orientation and mental rotation. Twenty-four normal participants were scanned with fMRI using blocked periodic designs for experimental task presentation. Accuracy and reaction time (RT) to each trial of both activation and baseline conditions in each experiment was recorded. Both experiments activated dorsal and ventral visual cortical areas as well as dorsolateral prefrontal cortex. More regionally specific associations with task performance were identified by estimating the association between (sinusoidal) power of functional response and mean RT to the activation condition; a permutation test based on spatial statistics was used for inference. There was significant behavioral-physiological association in right ventral extrastriate cortex for the line orientation task and in bilateral (predominantly right) superior parietal lobule for the mental rotation task. Comparable associations were not found between power of response and RT to the baseline conditions of the tasks. These data suggest that one region in a neurocognitive network may be most strongly associated with behavioral performance and this may be regarded as the computationally least efficient or rate-limiting node of the network.

Visual biology of Hawaiian coral reef fishes. II. Colors of Hawaiian coral reef fish

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The colors of 51 species of Hawaiian reef fish have been measured using a spectrometer and therefore can be described in objective terms that are not influenced by the human visual experience. In common with other known reef fish populations, the colors of Hawaiian reef fish occupy spectral positions from 300-800nm; yellow or orange with blue, yellow with black, and black with white are the most frequently combined colors; and there is no link between possession of ultraviolet (UV) reflectance and UV visual sensitivity or the potential for UV visual sensitivity. In contrast to other reef systems, blue, yellow, and orange appear more frequently in Hawaiian reef fish. Based on spectral quality of reflections from fish skin, trends in fish colors can be seen that are indicative of both visually driven selective pressures and chemical or physical constraints on the design of colors. UV-reflecting colors can function as semiprivate communication signals. White or yellow with black form highly contrasting patterns that transmit well through clear water. Labroid fishes display uniquely complex colors but lack the ability to see the UV component that is common in their pigments. Step-shaped spectral curves are usually long-wavelength colors such as yellow or red, and colors with a peak-shaped spectral curves are green, blue, violet, and UV.

Characterisation of the three-dimensional structure of earthworm burrow systems using image analysis and mathematical morphology

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The aim of this work was to exemplify the specific contribution of both two- and three-dimensional (31)) X-ray computed tomography to characterise earthworm burrow systems. To achieve this purpose we used 3D mathematical morphology operators to characterise burrow systems resulting from the activity of an anecic (Aporrectodea noctunia), and an endogeic species (Allolobophora chlorotica), when both species were introduced either separately or together into artificial soil cores. Images of these soil cores were obtained using a medical X-ray tomography scanner. Three-dimensional reconstructions of burrow systems were obtained using a specifically developed segmentation algorithm. To study the differences between burrow systems, a set of classical tools of mathematical morphology (granulometries) were used. So-called granulometries based on different structuring elements clearly separated the different burrow systems. They enabled us to show that burrows made by the anecic species were fatter, longer, more vertical, more continuous but less sinuous than burrows of the endogeic species. The granulometry transform of the soil matrix showed that burrows made by A. nocturna were more evenly distributed than those of A. chlorotica. Although a good discrimination was possible when only one species was introduced into the soil cores, it was not possible to separate burrows of the two species from each other in cases where species were introduced into the same soil core. This limitation, partly due to the insufficient spatial resolution of the medical scanner, precluded the use of the morphological operators to study putative interactions between the two species.

Why fear snakes and spiders? Evidence for a learning-based account of fear-relevant stimulus-processing

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Fear-relevant stimuli, such as snakes, spiders and heights, preferentially capture attention as compared to nonfear-relevant stimuli. This is said to reflect an encapsulated mechanism whereby attention is captured by the simple perceptual features of stimuli that have evolutionary significance. Research, using pictures of snakes and spiders, has found some support for this account; however, participants may have had prior fear of snakes and spiders that influenced results. The current research compared responses of snake and spider experts who had little fear of snakes and spiders, and control participants across a series of affective priming and visual search tasks. Experts discriminated between dangerous and nondangerous snakes and spiders, and expert responses to pictures of nondangerous snakes and spiders differed from those of control participants. The current results dispute that stimulus fear relevance is based purely on perceptual features, and provides support for the role of learning and experience.

A comparison of computer-based methods for the determination of onset of muscle contraction using electromyography

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Little consensus exists in the literature regarding methods for determination of the onset of electromyographic (EMG) activity. The aim of this study was to compare the relative accuracy of a range of computer-based techniques with respect to EMG onset determined visually by an experienced examiner. Twenty-seven methods were compared which varied in terms of EMG processing (low pass filtering at 10, 50 and 500 Hz), threshold value (1, 2 and 3 SD beyond mean of baseline activity) and the number of samples for which the mean must exceed the defined threshold (20, 50 and 100 ms). Three hundred randomly selected trials of a postural task were evaluated using each technique. The visual determination of EMG onset was found to be highly repeatable between days. Linear regression equations were calculated for the values selected by each computer method which indicated that the onset values selected by the majority of the parameter combinations deviated significantly from the visually derived onset values. Several methods accurately selected the time of onset of EMG activity and are recommended for future use. Copyright (C) 1996 Elsevier Science Ireland Ltd.

Eye-movements and visual imagery: A working memory approach to the treatment of post-traumatic stress disorder

Relevância:

30.00% 30.00%

Publicador:

Resumo:

It has been claimed that the symptoms of post-traumatic stress disorder (PTSD) can be ameliorated by eye-movement desensitization-reprocessing therapy (EMD-R), a procedure that involves the individual making saccadic eye-movements while imagining the traumatic event. We hypothesized that these eye-movements reduce the vividness of distressing images by disrupting the function of the visuospatial sketchpad (VSSP) of working memory, and that by doing so they reduce the intensity of the emotion associated with the image. This hypothesis was tested by asking non-PTSD participants to form images of neutral and negative pictures under dual task conditions. Their images were less vivid with concurrent eye-movements and with a concurrent spatial tapping task that did not involve eye-movements. In the first three experiments, these secondary tasks did not consistently affect participants' emotional responses to the images. However, Expt 4 used personal recollections as stimuli for the imagery task, and demonstrated a significant reduction in emotional response under the same dual task conditions. These results suggest that, if EMD-R works, it does so by reducing the vividness and emotiveness of traumatic images via the VSSP of working memory. Other visuospatial tasks may also be of therapeutic value.

Relating body mass index to figural stimuli: population-based normative data for Caucasians

Relevância:

30.00% 30.00%

Publicador:

Resumo:

OBJECTIVE: To establish body mass index (BMI) norms for standard figural stimuli using a large Caucasian population-based sample. In addition, we sought to determine the effectiveness of the figural stimuli to identify individuals as obese or thin. DESIGN: All Caucasian twins born in Virginia between 1915 and 1971 were identified by public birth record. In addition, 3347 individual twins responded to a letter published in the newsletter of the American Association of Retired Persons (AARP). All adult twins (aged 18 and over) from both of these sources and their family members were mailed a 16 page 'Health and Lifestyle' questionnaire. SUBJECTS: BMI and silhouette data were available on 16 728 females and 11 366 males ranging in age from 18- 100. MEASUREMENTS: Self-report information on height-weight, current body size, desired body size and a discrepancy score using standard figural stimuli. RESULTS: Gender- and age-specific norms are presented linking BMI to each of the figural stimuli. Additional norms for desired body size and discrepancy scores are also presented. Receiver operating curves (ROC) indicate that the figural stimuli are effective in classifying individuals as obese or thin. CONCLUSIONS: With the establishment of these norms, the silhouettes used in standard body image assessment can now be linked to BMI. Differences were observed between women and men in terms of desired body size and discrepancy scores, with women preferring smaller sizes. The figural stimuli are a robust technique for classifying individuals as obese or thin.

Using visual spatial search interface for WWW applications

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Most Internet search engines are keyword-based. They are not efficient for the queries where geographical location is important, such as finding hotels within an area or close to a place of interest. A natural interface for spatial searching is a map, which can be used not only to display locations of search results but also to assist forming search conditions. A map-based search engine requires a well-designed visual interface that is intuitive to use yet flexible and expressive enough to support various types of spatial queries as well as aspatial queries. Similar to hyperlinks for text and images in an HTML page, spatial objects in a map should support hyperlinks. Such an interface needs to be scalable with the size of the geographical regions and the number of websites it covers. In spite of handling typically a very large amount of spatial data, a map-based search interface should meet the expectation of fast response time for interactive applications. In this paper we discuss general requirements and the design for a new map-based web search interface, focusing on integration with the WWW and visual spatial query interface. A number of current and future research issues are discussed, and a prototype for the University of Queensland is presented. (C) 2001 Published by Elsevier Science Ltd.

A segmentation-based and partial-volume-compensated method for an accurate measurement of lateral ventricular volumes on T-1-weighted magnetic resonance images

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Lateral ventricular volumes based on segmented brain MR images can be significantly underestimated if partial volume effects are not considered. This is because a group of voxels in the neighborhood of lateral ventricles is often mis-classified as gray matter voxels due to partial volume effects. This group of voxels is actually a mixture of ventricular cerebro-spinal fluid and the white matter and therefore, a portion of it should be included as part of the lateral ventricular structure. In this note, we describe an automated method for the measurement of lateral ventricular volumes on segmented brain MR images. Image segmentation was carried in combination of intensity correction and thresholding. The method is featured with a procedure for addressing mis-classified voxels in the surrounding of lateral ventricles. A detailed analysis showed that lateral ventricular volumes could be underestimated by 10 to 30% depending upon the size of the lateral ventricular structure, if mis-classified voxels were not included. Validation of the method was done through comparison with the averaged manually traced volumes. Finally, the merit of the method is demonstrated in the evaluation of the rate of lateral ventricular enlargement. (C) 2001 Elsevier Science Inc. All rights reserved.

Monocular and binocular distance cues: insights from visual form agnosia I (of III)

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The human nervous system constructs a Euclidean representation of near (personal) space by combining multiple sources of information (cues). We investigated the cues used for the representation of personal space in a patient with visual form agnosia (DF). Our results indicated that DF relies predominantly on binocular vergence information when determining the distance of a target despite the presence of other (retinal) cues. Notably, DF was able to construct an Euclidean representation of personal space from vergence alone. This finding supports previous assertions that vergence provides the nervous system with veridical information for the construction of personal space. The results from the current study, together with those of others, suggest that: (i) the ventral stream is responsible for extracting depth and distance information from monocular retinal cues (i.e. from shading, texture, perspective) and (ii) the dorsal stream has access to binocular information (from horizontal image disparities and vergence). These results also indicate that DF was not able to use size information to gauge target distance, suggesting that intact temporal cortex is necessary for learned size to influence distance processing. Our findings further suggest that in neurologically intact humans, object information extracted in the ventral pathway is combined with the products of dorsal stream processing for guiding prehension. Finally, we studied the size-distance paradox in visual form agnosia in order to explore the cognitive use of size information. The results of this experiment were consistent with a previous suggestion that the paradox is a cognitive phenomenon.

«
1
2
3
»