972 resultados para visual representation
Resumo:
Autonomous landing is a challenging and important technology for both military and civilian applications of Unmanned Aerial Vehicles (UAVs). In this paper, we present a novel online adaptive visual tracking algorithm for UAVs to land on an arbitrary field (that can be used as the helipad) autonomously at real-time frame rates of more than twenty frames per second. The integration of low-dimensional subspace representation method, online incremental learning approach and hierarchical tracking strategy allows the autolanding task to overcome the problems generated by the challenging situations such as significant appearance change, variant surrounding illumination, partial helipad occlusion, rapid pose variation, onboard mechanical vibration (no video stabilization), low computational capacity and delayed information communication between UAV and Ground Control Station (GCS). The tracking performance of this presented algorithm is evaluated with aerial images from real autolanding flights using manually- labelled ground truth database. The evaluation results show that this new algorithm is highly robust to track the helipad and accurate enough for closing the vision-based control loop.
Resumo:
We demonstrate performance-related changes in cortical and cerebellar activity. The largest learning-dependent changes were observed in the anterior lateral cerebellum, where the extent and intensity of activation correlated inversely with psychophysical performance. After learning had occurred (a few minutes), the cerebellar activation almost disappeared; however, it was restored when the subjects were presented with a novel, untrained direction of motion for which psychophysical performance also reverted to chance level. Similar reductions in the extent and intensity of brain activations in relation to learning occurred in the superior colliculus, anterior cingulate, and parts of the extrastriate cortex. The motion direction-sensitive middle temporal visual complex was a notable exception, where there was an expansion of the cortical territory activated by the trained stimulus. Together, these results indicate that the learning and representation of visual motion discrimination are mediated by different, but probably interacting, neuronal subsystems.
Resumo:
Revealing the layout of cortical maps is important both for understanding the processes involved in their development and for uncovering the mechanisms underlying neural computation. The typical organization of orientation maps in the cat visual cortex is radial; complete orientation cycles are mapped around orientation singularities. In contrast, long linear zones of orientation representation have been detected in the primary visual cortex of the tree shrew. In this study, we searched for the existence of long linear sequences and wide linear zones within orientation preference maps of the cat visual cortex. Optical imaging based on intrinsic signals was used. Long linear sequences and wide linear zones of preferred orientation were occasionally detected along the border between areas 17 and 18, as well as within area 18. Adjacent zones of distinct radial and linear organizations were observed across area 18 of a single hemisphere. However, radial and linear organizations were not necessarily segregated; long (7.5 mm) linear sequences of preferred orientation were found embedded within a typical pinwheel-like organization of orientation. We conclude that, although the radial organization is dominant, perfectly linear organization may develop and perform the processing related to orientation in the cat visual cortex.
Resumo:
Working memory is the process of actively maintaining a representation of information for a brief period of time so that it is available for use. In monkeys, visual working memory involves the concerted activity of a distributed neural system, including posterior areas in visual cortex and anterior areas in prefrontal cortex. Within visual cortex, ventral stream areas are selectively involved in object vision, whereas dorsal stream areas are selectively involved in spatial vision. This domain specificity appears to extend forward into prefrontal cortex, with ventrolateral areas involved mainly in working memory for objects and dorsolateral areas involved mainly in working memory for spatial locations. The organization of this distributed neural system for working memory in monkeys appears to be conserved in humans, though some differences between the two species exist. In humans, as compared with monkeys, areas specialized for object vision in the ventral stream have a more inferior location in temporal cortex, whereas areas specialized for spatial vision in the dorsal stream have a more superior location in parietal cortex. Displacement of both sets of visual areas away from the posterior perisylvian cortex may be related to the emergence of language over the course of brain evolution. Whereas areas specialized for object working memory in humans and monkeys are similarly located in ventrolateral prefrontal cortex, those specialized for spatial working memory occupy a more superior and posterior location within dorsal prefrontal cortex in humans than in monkeys. As in posterior cortex, this displacement in frontal cortex also may be related to the emergence of new areas to serve distinctively human cognitive abilities.
Resumo:
Efficient and reliable classification of visual stimuli requires that their representations reside a low-dimensional and, therefore, computationally manageable feature space. We investigated the ability of the human visual system to derive such representations from the sensory input-a highly nontrivial task, given the million or so dimensions of the visual signal at its entry point to the cortex. In a series of experiments, subjects were presented with sets of parametrically defined shapes; the points in the common high-dimensional parameter space corresponding to the individual shapes formed regular planar (two-dimensional) patterns such as a triangle, a square, etc. We then used multidimensional scaling to arrange the shapes in planar configurations, dictated by their experimentally determined perceived similarities. The resulting configurations closely resembled the original arrangements of the stimuli in the parameter space. This achievement of the human visual system was replicated by a computational model derived from a theory of object representation in the brain, according to which similarities between objects, and not the geometry of each object, need to be faithfully represented.
Resumo:
Functional magnetic resonance imaging (fMRI) was used to identify and map the representation of the visual field in seven areas of human cerebral cortex and to identify at least two additional visually responsive regions. The cortical locations of neurons responding to stimulation along the vertical or horizontal visual field meridia were charted on three-dimensional models of the cortex and on unfolded maps of the cortical surface. These maps were used to identify the borders among areas that would be topographically homologous to areas V1, V2, V3, VP, and parts of V3A and V4 of the macaque monkey. Visually responsive areas homologous to the middle temporal/medial superior temporal area complex and unidentified parietal visual areas were also observed. The topography of the visual areas identified thus far is consistent with the organization in macaque monkeys. However, these and other findings suggest that human and simian cortical organization may begin to differ in extrastriate cortex at, or beyond, V3A and V4.
Resumo:
A imagem mental e a memória visual têm sido consideradas como componentes distintos na codificação da informação, e associados a processos diferentes da memória de trabalho. Evidências experimentais mostram, por exemplo, que o desempenho em tarefas de memória baseadas na geração de imagem mentais (imaginação visual) sofre a interferência do ruído visual dinâmico (RVD), mas não se observa o mesmo efeito em tarefas de memória visual baseadas na percepção visual (memória visual). Embora várias evidências mostrem que tarefas de imaginação e de memória visual sejam baseadas em processos cognitivos diferentes, isso não descarta a possibilidade de utilizarem também processos em comum e que alguns resultados experimentais que apontam diferenças entre as duas tarefas resultem de diferenças metodológicas entre os paradigmas utilizados para estuda-las. Nosso objetivo foi equiparar as tarefas de imagem mental visual e memória visual por meio de tarefas de reconhecimento, com o paradigma de dicas retroativas espaciais. Sequências de letras romanas na forma visual (tarefa de memória visual) e acústicas (tarefa de imagem mental visual) foram apresentadas em quatro localizações espaciais diferentes. No primeiro e segundo experimento analisou-se o tempo do curso de recuperação tanto para o processo de imagem quanto para o processo de memória. No terceiro experimento, comparou-se a estrutura das representações dos dois componentes, por meio da apresentação do RVD durante a etapa de geração e recuperação. Nossos resultados mostram que não há diferenças no armazenamento da informação visual durante o período proposto, porém o RVD afeta a eficiência do processo de recuperação, isto é o tempo de resposta, sendo a representação da imagem mental visual mais suscetível ao ruído. No entanto, o processo temporal da recuperação é diferente para os dois componentes, principalmente para imaginação que requer mais tempo para recuperar a informação do que a memória. Os dados corroboram a relevância do paradigma de dicas retroativas que indica que a atenção espacial é requisitada em representações de organização espacial, independente se são visualizadas ou imaginadas.
Resumo:
The Vernacular Discourse of the "Arab Spring" is a project that bridges the divide between the East and the West by offering new readings to Arab subjectivities. Through an analysis of the "Arab Spring" through the lens of vernacular discourse, it challenges the Euro-Americo-centric legacies of Orientalism in Western academia and the new wave of extremism in the Arab world by offering alternative representations of Arab bodies and subjectivities. To offer this new reading of the "Arab Spring," it explores the foundations of critical rhetoric as a theory and a practice and argues for a turn towards a critical vernacular discourse. The turn towards critical vernacular discourse is important as it urges the analyses of different artifacts produced by marginalized groups in order to understand their perspectives that have largely been foreclosed in traditional cultural studies research. Building on embodied/performative critical rhetoric, the vernacular discourses of the Arab revolutionary body examines other forms of knowledge productions that are not merely textual; more specifically, through data gathered in the Lhbib Bourguiba, Tunisia. This analysis of the political revolutionary body unveils the complexity underlining the discussion around issues of identity, agency and representation in the Middle East and North Africa, and calls for a critical study towards these issues in the region beyond the binary approach that has been practiced and applied by academics and media analysts. Hence, by analyzing vernacular discourse, this research locates a method of examining and theorizing the dialectic between agency, citizenry, and subjectivity through the study of how power structure is recreated and challenged through the use of the vernacular in revolutionary movements, as well as how marginalized groups construct their own subjectivities through the use of vernacular discourse. Therefore, highlighting the political prominence of evaluating the Arab Spring as a vernacular discourse is important in creating new ways of understanding communication in postcolonial/neocolonial settings.
Resumo:
Traditional visual servoing systems do not deal with the topic of moving objects tracking. When these systems are employed to track a moving object, depending on the object velocity, visual features can go out of the image, causing the fail of the tracking task. This occurs specially when the object and the robot are both stopped and then the object starts the movement. In this work, we have employed a retina camera based on Address Event Representation (AER) in order to use events as input in the visual servoing system. The events launched by the camera indicate a pixel movement. Event visual information is processed only at the moment it occurs, reducing the response time of visual servoing systems when they are used to track moving objects.
Resumo:
Spatio-temporal maps of the occipital cortex of macaque monkeys were analyzed using optical imaging of intrinsic signals. The images obtained during localized visual stimulation (IS) were compared with the images obtained on presentation of a blank screen (IB). We first investigated spontaneous variations of the intrinsic signals by analyzing the 100 IBs for each of the three cortical areas. Slow periodical activation was observed in alternation over the cortical areas. Cross-correlation analysis indicated that synchronization of spontaneous activation only took place within each cortical area, but not between them. When a small, drifting grating (2degreesX2degrees) was presented on the fovea. a dark spot appeared in the optical image at the cortical representation of this retinal location. It spread bilaterally along the border between V1 and V2, continuing as a number of parallel dark bands covering a large area of the lateral surface of V1. Cross-correlation analysis showed that during visual stimulation the intrinsic signals over all of the three cortical areas were synchronized, with in-phase activation of V1 and V2 and anti-phase activation of V4 and V1/V2. The significance of these extensive synergistic and antagonistic interactions between different cortical areas is discussed. (C) 2003 Elsevier B.V. All rights reserved.
Resumo:
There is a growing body of evidence that the processes mediating the allocation of spatial attention within objects may be separable from those governing attentional distribution between objects. In the neglect literature, a related proposal has been made regarding the perception of (within-object) sizes and (between-object) distances. This proposal follows observations that, in size-matching and bisection tasks, neglect is more strongly expressed when patients are required to attend to the sizes of discrete objects than to the (unfilled) distances between objects. These findings are consistent with a partial dissociation between size and distance processing, but a simpler alternative must also be considered. Whilst a neglect patient may fail to explore the full extent of a solid stimulus, the estimation of an unfilled distance requires that both endpoints be inspected before the task can be attempted at all. The attentional cueing implicit in distance estimation tasks might thus account for their superior performance by neglect patients. We report two bisection studies that address this issue. The first confirmed, amongst patients with left visual neglect, a reliable reduction of rightward error for unfilled gap stimuli as compared with solid lines. The second study assessed the cause of this reduction, deconfounding the effects of stimulus type (lines vs. gaps) and attentional cueing, by applying an explicit cueing manipulation to line and gap bisection tasks. Under these matched cueing conditions, all patients performed similarly on line and gap bisection tasks, suggesting that the reduction of neglect typically observed for gap stimuli may be attributable entirely to cueing effects. We found no evidence that a spatial extent, once fully attended, is judged any differently according to whether it is filled or unfilled.
Resumo:
The McGurk effect, in which auditory [ba] dubbed onto [go] lip movements is perceived as da or tha, was employed in a real-time task to investigate auditory-visual speech perception in prelingual infants. Experiments 1A and 1B established the validity of real-time dubbing for producing the effect. In Experiment 2, 4(1)/(2)-month-olds were tested in a habituation-test paradigm, in which 2 an auditory-visual stimulus was presented contingent upon visual fixation of a live face. The experimental group was habituated to a McGurk stimulus (auditory [ba] visual [ga]), and the control group to matching auditory-visual [ba]. Each group was then presented with three auditory-only test trials, [ba], [da], and [deltaa] (as in then). Visual-fixation durations in test trials showed that the experimental group treated the emergent percept in the McGurk effect, [da] or [deltaa], as familiar (even though they had not heard these sounds previously) and [ba] as novel. For control group infants [da] and [deltaa] were no more familiar than [ba]. These results are consistent with infants'perception of the McGurk effect, and support the conclusion that prelinguistic infants integrate auditory and visual speech information. (C) 2004 Wiley Periodicals, Inc.
Resumo:
We tested current hypotheses on the functional organization of the third visual complex, a particularly controversial region of the primate extrastriate cortex. In anatomical experiments, injections of retrograde tracers were placed in the dorsal cortex immediately rostral to the second visual area (V2) of New World monkeys (Callithrix jacchus), revealing the topography of interconnections between the third tier cortex and the primary visual area (V1). The data indicate the presence of a dorsomedial area (DM), which represents the entire upper and lower quadrants of the visual field, and which receives strong, topographically organized projections from the superficial layers of V1. The visuotopic organization and boundaries of DM were confirmed by electrophysiological recordings in the same animals and by architectural characteristics which were distinct from those found in ventral extrastriate cortex rostral to V2. There was no electrophysiological or histological evidence for a transitional area between V2 and DM. In particular, the central representation of the upper quadrant in DM was directly adjacent to the representation of the horizontal meridian that marks the rostral border of V2. The present results argue in favor of the hypothesis that the third visual complex in New World monkeys contains different areas in its dorsal and ventral components: area DM, near the dorsal midline, and a homolog of area 19 of other mammals, located more lateral and ventrally. The characteristics of DM suggest that it may correspond to visual area 6 (V6) of Old World monkeys. (C) 2005 Wiley-Liss, Inc.
Resumo:
Objectives. It has been proposed that disruption of the internal proprioceptive representation, via incongruent sensory input, may underpin pathological pain states, but experimental evidence relies on conflicting visual input, which is not clinically relevant. We aimed to determine the symptomatic effect of incongruent proprioceptive input, imparted by vibration of the wrist tendons, which evokes the illusion of perpetual wrist flexion and disrupts cortical proprioceptive representation. Methods. Twenty-nine healthy and naive volunteers reported symptoms during five conditions: control, active and passive wrist flexion, extensor carpi radialis tendon vibration to evoke illusion of perpetual wrist flexion, and ulnar styloid (sham) vibration. No advice was given about possible illusions. Results. Twenty-one subjects reported the illusion of perpetual wrist flexion during tendon vibration. There was no effect of condition or of whether or not subjects reported an illusion on discomfort/pain (P > 0.28). Peculiarity, swelling and foreignness were greater during tendon vibration than during the other conditions, and greater during tendon vibration in those who reported an illusion of wrist flexion than in those who did not (P < 0.05 for all). Symptoms were reported by at least two subjects in each condition and four subjects reported systemic symptoms (e.g. nausea). Conclusions. In healthy volunteers, incongruent proprioceptive input does not cause discomfort or pain but does evoke feelings of peculiarity, swelling and foreignness in the limb.