922 resultados para IMML and Visual IMML
Resumo:
The main objective of this work is to present a way to emulate some functions of the mammalian visual system and a model to analyze subjective sensations and visual illusions
Resumo:
La medida de calidad de vídeo sigue siendo necesaria para definir los criterios que caracterizan una señal que cumpla los requisitos de visionado impuestos por el usuario. Las nuevas tecnologías, como el vídeo 3D estereoscópico o formatos más allá de la alta definición, imponen nuevos criterios que deben ser analizadas para obtener la mayor satisfacción posible del usuario. Entre los problemas detectados durante el desarrollo de esta tesis doctoral se han determinado fenómenos que afectan a distintas fases de la cadena de producción audiovisual y tipo de contenido variado. En primer lugar, el proceso de generación de contenidos debe encontrarse controlado mediante parámetros que eviten que se produzca el disconfort visual y, consecuentemente, fatiga visual, especialmente en lo relativo a contenidos de 3D estereoscópico, tanto de animación como de acción real. Por otro lado, la medida de calidad relativa a la fase de compresión de vídeo emplea métricas que en ocasiones no se encuentran adaptadas a la percepción del usuario. El empleo de modelos psicovisuales y diagramas de atención visual permitirían ponderar las áreas de la imagen de manera que se preste mayor importancia a los píxeles que el usuario enfocará con mayor probabilidad. Estos dos bloques se relacionan a través de la definición del término saliencia. Saliencia es la capacidad del sistema visual para caracterizar una imagen visualizada ponderando las áreas que más atractivas resultan al ojo humano. La saliencia en generación de contenidos estereoscópicos se refiere principalmente a la profundidad simulada mediante la ilusión óptica, medida en términos de distancia del objeto virtual al ojo humano. Sin embargo, en vídeo bidimensional, la saliencia no se basa en la profundidad, sino en otros elementos adicionales, como el movimiento, el nivel de detalle, la posición de los píxeles o la aparición de caras, que serán los factores básicos que compondrán el modelo de atención visual desarrollado. Con el objetivo de detectar las características de una secuencia de vídeo estereoscópico que, con mayor probabilidad, pueden generar disconfort visual, se consultó la extensa literatura relativa a este tema y se realizaron unas pruebas subjetivas preliminares con usuarios. De esta forma, se llegó a la conclusión de que se producía disconfort en los casos en que se producía un cambio abrupto en la distribución de profundidades simuladas de la imagen, aparte de otras degradaciones como la denominada “violación de ventana”. A través de nuevas pruebas subjetivas centradas en analizar estos efectos con diferentes distribuciones de profundidades, se trataron de concretar los parámetros que definían esta imagen. Los resultados de las pruebas demuestran que los cambios abruptos en imágenes se producen en entornos con movimientos y disparidades negativas elevadas que producen interferencias en los procesos de acomodación y vergencia del ojo humano, así como una necesidad en el aumento de los tiempos de enfoque del cristalino. En la mejora de las métricas de calidad a través de modelos que se adaptan al sistema visual humano, se realizaron también pruebas subjetivas que ayudaron a determinar la importancia de cada uno de los factores a la hora de enmascarar una determinada degradación. Los resultados demuestran una ligera mejora en los resultados obtenidos al aplicar máscaras de ponderación y atención visual, los cuales aproximan los parámetros de calidad objetiva a la respuesta del ojo humano. ABSTRACT Video quality assessment is still a necessary tool for defining the criteria to characterize a signal with the viewing requirements imposed by the final user. New technologies, such as 3D stereoscopic video and formats of HD and beyond HD oblige to develop new analysis of video features for obtaining the highest user’s satisfaction. Among the problems detected during the process of this doctoral thesis, it has been determined that some phenomena affect to different phases in the audiovisual production chain, apart from the type of content. On first instance, the generation of contents process should be enough controlled through parameters that avoid the occurrence of visual discomfort in observer’s eye, and consequently, visual fatigue. It is especially necessary controlling sequences of stereoscopic 3D, with both animation and live-action contents. On the other hand, video quality assessment, related to compression processes, should be improved because some objective metrics are adapted to user’s perception. The use of psychovisual models and visual attention diagrams allow the weighting of image regions of interest, giving more importance to the areas which the user will focus most probably. These two work fields are related together through the definition of the term saliency. Saliency is the capacity of human visual system for characterizing an image, highlighting the areas which result more attractive to the human eye. Saliency in generation of 3DTV contents refers mainly to the simulated depth of the optic illusion, i.e. the distance from the virtual object to the human eye. On the other hand, saliency is not based on virtual depth, but on other features, such as motion, level of detail, position of pixels in the frame or face detection, which are the basic features that are part of the developed visual attention model, as demonstrated with tests. Extensive literature involving visual comfort assessment was looked up, and the development of new preliminary subjective assessment with users was performed, in order to detect the features that increase the probability of discomfort to occur. With this methodology, the conclusions drawn confirmed that one common source of visual discomfort was when an abrupt change of disparity happened in video transitions, apart from other degradations, such as window violation. New quality assessment was performed to quantify the distribution of disparities over different sequences. The results confirmed that abrupt changes in negative parallax environment produce accommodation-vergence mismatches derived from the increasing time for human crystalline to focus the virtual objects. On the other side, for developing metrics that adapt to human visual system, additional subjective tests were developed to determine the importance of each factor, which masks a concrete distortion. Results demonstrated slight improvement after applying visual attention to objective metrics. This process of weighing pixels approximates the quality results to human eye’s response.
Resumo:
In two experiments, electric brain waves of 14 subjects were recorded under several different conditions to study the invariance of brain-wave representations of simple patches of colors and simple visual shapes and their names, the words blue, circle, etc. As in our earlier work, the analysis consisted of averaging over trials to create prototypes and test samples, to both of which Fourier transforms were applied, followed by filtering and an inverse transformation to the time domain. A least-squares criterion of fit between prototypes and test samples was used for classification. The most significant results were these. By averaging over different subjects, as well as trials, we created prototypes from brain waves evoked by simple visual images and test samples from brain waves evoked by auditory or visual words naming the visual images. We correctly recognized from 60% to 75% of the test-sample brain waves. The general conclusion is that simple shapes such as circles and single-color displays generate brain waves surprisingly similar to those generated by their verbal names. These results, taken together with extensive psychological studies of auditory and visual memory, strongly support the solution proposed for visual shapes, by Bishop Berkeley and David Hume in the 18th century, to the long-standing problem of how the mind represents simple abstract ideas.
Resumo:
Tactile sensors play an important role in robotics manipulation to perform dexterous and complex tasks. This paper presents a novel control framework to perform dexterous manipulation with multi-fingered robotic hands using feedback data from tactile and visual sensors. This control framework permits the definition of new visual controllers which allow the path tracking of the object motion taking into account both the dynamics model of the robot hand and the grasping force of the fingertips under a hybrid control scheme. In addition, the proposed general method employs optimal control to obtain the desired behaviour in the joint space of the fingers based on an indicated cost function which determines how the control effort is distributed over the joints of the robotic hand. Finally, authors show experimental verifications on a real robotic manipulation system for some of the controllers derived from the control framework.
Resumo:
Drawing from ethnographic, empirical, and historical/cultural perspectives, we examine the extent to which visual aspects of music contribute to the communication that takes place between performers and their listeners. First, we introduce a framework for understanding how media and genres shape aural and visual experiences of music. Second, we present case studies of two performances, and describe the relation between visual and aural aspects of performance. Third, we report empirical evidence that visual aspects of performance reliably influence perceptions of musical structure (pitch related features) and affective interpretations of music. Finally, we trace new and old media trajectories of aural and visual dimensions of music, and highlight how our conceptions, perceptions and appreciation of music are intertwined with technological innovation and media deployment strategies.
Resumo:
The purpose of this study was to test the effects of visual occlusion and fatigue on the motor performance of vertical skills in synchronized swimming. Experienced synchronized swimmers (n = 12) were randomly assigned to either an exercise or nonexercise (control) activity group. Subjective ratings of fatigue were obtained from the swimmers who then each performed four vertical skills under alternating conditions of vision and visual occlusion before and after either a swimming (designed to induce fatigue) or nonphysical activity. A main effect of activity (p < .03) was found for two measures of performance accuracy (lateral and anterior total distance traveled) but not for lateral and anterior maximum deviation from vertical, indicating that fatigue played a role in executing the skills. The data also indicate that the maintenance of a stationary position is a skill of greater difficulty than maintaining a true vertical. In contrast with previous research findings on synchronized swimmers, a significant effect of vision in all conditions was found, with performance decrements in the conditions of visual occlusion showing that vision provided important sensory input for the swimmers.
Resumo:
A number of neurodegenerative diseases caused by prions have been described recently. These include Creutzfeldt-Jakob disease (CJD) in humans, scrapie in sheep and BSE in cows. Patients with CJD may suffer a range of visual problems including eye movement deficits and visual hallucinations. In addition, it is possible that CJD may be acquired via corneal transplant and that prions may be transmitted by reusable contact lenses.
Resumo:
Parkinson's disease (PD) is a common disorder of middle-aged and elderly people, in which there is degeneration of the extra-pyramidal motor system. In some patients, the disease is associated with a range of visual signs and symptoms, including defects in visual acuity, colour vision, the blink reflex, pupil reactivity, saccadic and smooth pursuit movements and visual evoked potentials. In addition, there may be psychophysical changes, disturbances of complex visual functions such as visuospatial orientation and facial recognition, and chronic visual hallucinations. Some of the treatments associated with PD may have adverse ocular reactions. If visual problems are present, they can have an important effect on overall motor function, and quality of life of patients can be improved by accurate diagnosis and correction of such defects. Moreover, visual testing is useful in separating PD from other movement disorders with visual symptoms, such as dementia with Lewy bodies (DLB), multiple system atrophy (MSA) and progressive supranuclear palsy (PSP). Although not central to PD, visual signs and symptoms can be an important though obscure aspect of the disease and should not be overlooked.
Resumo:
The thesis investigated progression of the central 10° visual field with structural changes at the macula in a cross-section of patients with varying degrees of agerelated macular degeneration (AMD). The relationships between structure and function were investigated for both standard and short-wavelength automated perimetry (SWAP). Factors known to influence the measure of visual field progression were considered, including the accuracy of the refractive correction on SWAP thresholds and the learning effect. Techniques of assessing the structure to function relationships between fundus images and the visual field were developed with computer programming and evaluated for repeatability. Drusen quantification of fundus photographs and retro-mode scanning laser ophthalmoscopic images was performed. Visual field progression was related to structural changes derived from both manual and automated methods. Principal Findings: • Visual field sensitivity declined with advancing stage of AMD. SWAP showed greater sensitivity to progressive changes than standard perimetry. • Defects were confined to the central 5°. SWAP defects occurred at similar locations but were deeper and wider than corresponding standard perimetry defects. • The central field became less uniform as severity of AMD increased. SWAP visual field indices of focal loss were of more importance when detecting early change in AMD, than indices of diffuse loss. • The decline in visual field sensitivity over stage of severity of AMD was not uniform, whereas a linear relationship was found between the automated measure of drusen area and visual field parameters. • Perimetry exhibited a stronger relationship with drusen area than other measures of visual function. • Overcorrection of the refraction for the working distance in SWAP should be avoided in subjects with insufficient accommodative facility. • The perimetric learning effect in the 10° field did not differ significantly between normal subjects and AMD patients. • Subretinal deposits appeared more numerous in retro-mode imaging than in fundus photography.
Resumo:
PURPOSE: To determine the objective measures of visual function that are most relevant to subjective quality of vision and perceived reading ability in patients with acquired macular disease. METHODS: Twenty-eight patients with macular disease underwent a comprehensive assessment of visual function. The patients also completed a vision-related quality-of-life questionnaire that included a section of general questions about perceived visual performance and a section with specific questions on reading. RESULTS: Results of all tests of vision correlated highly with reported vision-related quality-of-life impairment. Low-contrast tests explained most of the variance in self-reported problems with reading. Text-reading speed correlated highly with overall concern about vision. CONCLUSIONS: Reading performance is strongly associated with vision-related quality of life. High-contrast distance acuity is not the only relevant measure of visual function in relation to the perceived visual performance of a patient with macular disease. The results suggest the importance of print contrast, even over print size, in reading performance in patients with acquired macular disease.
Resumo:
Progressive supranuclear palsy is a rare, degenerative brain disorder and the second most common syndrome in which the patient exhibits 'parkinsonism', that is, a variety of symptoms involving problems with movement. General symptoms include difficulties with gait and balance; the patient walking clumsily and often falling backwards. The syndrome can be difficult to diagnose and visual signs and symptoms can help to separate it from closely related movement disorders such as Parkinson's disease, multiple system atrophy, dementia with Lewy bodies and corticobasal degeneration. A combination of the presence of vertical supranuclear gaze palsy, fixation instability, lid retraction, blepharospasm and apraxia of eyelid opening and closing may be useful visual signs in the identification of progressive supranuclear palsy. As primary eye-care practitioners, optometrists should be able to identify the visual problems of patients with this disorder and be expected to work with patients and their carers to manage their visual welfare.
Resumo:
The work presented in this thesis is divided into two distinct sections. In the first, the functional neuroimaging technique of Magnetoencephalography (MEG) is described and a new technique is introduced for accurate combination of MEG and MRI co-ordinate systems. In the second part of this thesis, MEG and the analysis technique of SAM are used to investigate responses of the visual system in the context of functional specialisation within the visual cortex. In chapter one, the sources of MEG signals are described, followed by a brief description of the necessary instrumentation for accurate MEG recordings. This chapter is concluded by introducing the forward and inverse problems of MEG, techniques to solve the inverse problem, and a comparison of MEG with other neuroimaging techniques. Chapter two provides an important contribution to the field of research with MEG. Firstly, it is described how MEG and MRI co-ordinate systems are combined for localisation and visualisation of activated brain regions. A previously used co-registration methods is then described, and a new technique is introduced. In a series of experiments, it is demonstrated that using fixed fiducial points provides a considerable improvement in the accuracy and reliability of co-registration. Chapter three introduces the visual system starting from the retina and ending with the higher visual rates. The functions of the magnocellular and the parvocellular pathways are described and it is shown how the parallel visual pathways remain segregated throughout the visual system. The structural and functional organisation of the visual cortex is then described. Chapter four presents strong evidence in favour of the link between conscious experience and synchronised brain activity. The spatiotemporal responses of the visual cortex are measured in response to specific gratings. It is shown that stimuli that induce visual discomfort and visual illusions share their physical properties with those that induce highly synchronised gamma frequency oscillations in the primary visual cortex. Finally chapter five is concerned with localization of colour in the visual cortex. In this first ever use of Synthetic Aperture Magnetometry to investigate colour processing in the visual cortex, it is shown that in response to isoluminant chromatic gratings, the highest magnitude of cortical activity arise from area V2.
Resumo:
The orientations of lines and edges are important in defining the structure of the visual environment, and observers can detect differences in line orientation within the first few hundred milliseconds of scene viewing. The present work is a psychophysical investigation of the mechanisms of early visual orientation-processing. In experiments with briefly presented displays of line elements, observers indicated whether all the elements were uniformly oriented or whether a uniquely oriented target was present among uniformly oriented nontargets. The minimum difference between nontarget and target orientations that was required for effective target-detection (the orientation increment threshold) varied little with the number of elements and their spatial density, but the percentage of correct responses in detection of a large orientation-difference increased with increasing element density. The differing variations with element density of thresholds and percent-correct scores may indicate the operation of more than one mechanism in early visual orientation-processIng. Reducing element length caused threshold to increase with increasing number of elements, showing that the effectiveness of rapid, spatially parallel orientation-processing depends on element length. Orientational anisotropy in line-target detection has been reported previously: a coarse periodic variation and some finer variations in orientation increment threshold with nontarget orientation have been found. In the present work, the prominence of the coarse variation in relation to finer variations decreased with increasing effective viewing duration, as if the operation of coarse orientation-processing mechanisms precedes the operation of finer ones. Orientational anisotropy was prominent even when observers lay horizontally and viewed displays by looking upwards through a black cylinder that excluded all possible visual references for orientation. So, gravitational and visual cues are not essential to the definition of an orientational reference frame for early vision, and such a reference can be well defined by retinocentric neural coding, awareness of body-axis orientation, or both.
Resumo:
Difficulties in visual attention are increasingly being linked to dyslexia. To date, the majority of studies have inferred functionality of attention from response times to stimuli presented for an indefinite duration. However, in paradigms that use reaction times to investigate the ability to orient attention, a delayed reaction time could also indicate difficulties in signal enhancement or noise exclusion once oriented. Thus, in order to investigate attention modulation and visual crowding effects in dyslexia, this study measured stimulus discrimination accuracy to rapidly presented displays. Adults with dyslexia (AwD) and controls discriminated the orientation of a target in an array of different numbers of - and differently spaced - vertically orientated distractors. Results showed that AwD: were disproportionately impacted by (i) close spacing and (ii) increased numbers of stimuli, (iii) did use pre-cues to modulate attention, but (iv) used cues less successfully to counter effects of increasing numbers of distractors. A greater dependence on pre-cues, larger effects of crowding and the impact of increased numbers of distractors all correlated significantly with measures of literacy. These findings extend previous studies of visual crowding of letters in dyslexia to non-complex stimuli. Overall, AwD do not use cues less, but they do use cues less successfully. We conclude that visual attention is an important factor to consider in the aetiology of dyslexia. The results challenge existing theoretical accounts of visual attention deficits, which alone are unable to comprehensively explain the pattern of findings demonstrated here.
Resumo:
Alzheimer’s disease (AD) is an important neurodegenerative disorder causing visual problems in the elderly population. The pathology of AD includes the deposition in the brain of abnormal aggregates of ?-amyloid (A?) in the form of senile plaques (SP) and abnormally phosphorylated tau in the form of neurofibrillary tangles (NFT). A variety of visual problems have been reported in patients with AD including loss of visual acuity (VA), colour vision and visual fields; changes in pupillary responses to mydriatics, defects in fixation and in smooth and saccadic eye movements; changes in contrast sensitivity and in visual evoked potentials (VEP); and disturbances in complex visual tasks such as reading, visuospatial function, and in the naming and identification of objects. In addition, pathological changes have been observed to affect the eye, visual pathway, and visual cortex in AD. To better understand degeneration of the visual cortex in AD, the laminar distribution of the SP and NFT was studied in visual areas V1 and V2 in 18 cases of AD which varied in disease onset and duration. In area V1, the mean density of SP and NFT reached a maximum in lamina III and in laminae II and III respectively. In V2, mean SP density was maximal in laminae III and IV and NFT density in laminae II and III. The densities of SP in laminae I of V1 and NFT in lamina IV of V2 were negatively correlated with patient age. No significant correlations were observed in any cortical lamina between the density of NFT and disease onset or duration. However, in area V2, the densities of SP in lamina II and lamina V were negatively correlated with disease duration and disease onset respectively. In addition, there were several positive correlations between the densities of SP and NFT in V1 with those in area V2. The data suggest: (1) NFT pathology is greater in area V2 than V1, (2) laminae II/III of V1 and V2 are most affected by the pathology, (3) the formation of SP and NFT in V1 and V2 are interconnected, and (4) the pathology may spread between visual areas via the feed-forward short cortico-cortical connections.