997 resultados para Vision par ordinateur


Relevância:

20.00% 20.00%

Publicador:

Resumo:

A key goal of computational neuroscience is to link brain mechanisms to behavioral functions. The present article describes recent progress towards explaining how laminar neocortical circuits give rise to biological intelligence. These circuits embody two new and revolutionary computational paradigms: Complementary Computing and Laminar Computing. Circuit properties include a novel synthesis of feedforward and feedback processing, of digital and analog processing, and of pre-attentive and attentive processing. This synthesis clarifies the appeal of Bayesian approaches but has a far greater predictive range that naturally extends to self-organizing processes. Examples from vision and cognition are summarized. A LAMINART architecture unifies properties of visual development, learning, perceptual grouping, attention, and 3D vision. A key modeling theme is that the mechanisms which enable development and learning to occur in a stable way imply properties of adult behavior. It is noted how higher-order attentional constraints can influence multiple cortical regions, and how spatial and object attention work together to learn view-invariant object categories. In particular, a form-fitting spatial attentional shroud can allow an emerging view-invariant object category to remain active while multiple view categories are associated with it during sequences of saccadic eye movements. Finally, the chapter summarizes recent work on the LIST PARSE model of cognitive information processing by the laminar circuits of prefrontal cortex. LIST PARSE models the short-term storage of event sequences in working memory, their unitization through learning into sequence, or list, chunks, and their read-out in planned sequential performance that is under volitional control. LIST PARSE provides a laminar embodiment of Item and Order working memories, also called Competitive Queuing models, that have been supported by both psychophysical and neurobiological data. These examples show how variations of a common laminar cortical design can embody properties of visual and cognitive intelligence that seem, at least on the surface, to be mechanistically unrelated.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Under natural viewing conditions, a single depthful percept of the world is consciously seen. When dissimilar images are presented to corresponding regions of the two eyes, binocular rivalry may occur, during which the brain consciously perceives alternating percepts through time. Perceptual bistability can also occur in response to a single ambiguous figure. These percepts raise basic questions: What brain mechanisms generate a single depthful percept of the world? How do the same mechanisms cause perceptual bistability, notably binocular rivalry? What properties of brain representations correspond to consciously seen percepts? How do the dynamics of the layered circuits of visual cortex generate single and bistable percepts? A laminar cortical model of how cortical areas V1, V2, and V4 generate depthful percepts is developed to explain and quantitatively simulate binocular rivalry data. The model proposes how mechanisms of cortical development, perceptual grouping, and figure-ground perception lead to single and rivalrous percepts.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A neural network theory of :3-D vision, called FACADE Theory, is described. The theory proposes a solution of the classical figure-ground problem for biological vision. It does so by suggesting how boundary representations and surface representations are formed within a Boundary Contour System (BCS) and a Feature Contour System (FCS). The BCS and FCS interact reciprocally to form 3-D boundary and surface representations that arc mutually consistent. Their interactions generate 3-D percepts wherein occluding and occluded object completed, and grouped. The theory clarifies how preattentive processes of 3-D perception and figure-ground separation interact reciprocally with attentive processes of spatial localization, object recognition, and visual search. A new theory of stereopsis is proposed that predicts how cells sensitive to multiple spatial frequencies, disparities, and orientations are combined by context-sensitive filtering, competition, and cooperation to form coherent BCS boundary segmentations. Several factors contribute to figure-ground pop-out, including: boundary contrast between spatially contiguous boundaries, whether due to scenic differences in luminance, color, spatial frequency, or disparity; partially ordered interactions from larger spatial scales and disparities to smaller scales and disparities; and surface filling-in restricted to regions surrounded by a connected boundary. Phenomena such as 3-D pop-out from a 2-D picture, DaVinci stereopsis, a 3-D neon color spreading, completion of partially occluded objects, and figure-ground reversals are analysed. The BCS and FCS sub-systems model aspects of how the two parvocellular cortical processing streams that join the Lateral Geniculate Nucleus to prestriate cortical area V4 interact to generate a multiplexed representation of Form-And-Color-And-Depth, or FACADE, within area V4. Area V4 is suggested to support figure-ground separation and to interact. with cortical mechanisms of spatial attention, attentive objcect learning, and visual search. Adaptive Resonance Theory (ART) mechanisms model aspects of how prestriate visual cortex interacts reciprocally with a visual object recognition system in inferotemporal cortex (IT) for purposes of attentive object learning and categorization. Object attention mechanisms of the What cortical processing stream through IT cortex are distinguished from spatial attention mechanisms of the Where cortical processing stream through parietal cortex. Parvocellular BCS and FCS signals interact with the model What stream. Parvocellular FCS and magnocellular Motion BCS signals interact with the model Where stream. Reciprocal interactions between these visual, What, and Where mechanisms arc used to discuss data about visual search and saccadic eye movements, including fast search of conjunctive targets, search of 3-D surfaces, selective search of like-colored targets, attentive tracking of multi-element groupings, and recursive search of simultaneously presented targets.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A neural network model of 3-D visual perception and figure-ground separation by visual cortex is introduced. The theory provides a unified explanation of how a 2-D image may generate a 3-D percept; how figures pop-out from cluttered backgrounds; how spatially sparse disparity cues can generate continuous surface representations at different perceived depths; how representations of occluded regions can be completed and recognized without usually being seen; how occluded regions can sometimes be seen during percepts of transparency; how high spatial frequency parts of an image may appear closer than low spatial frequency parts; how sharp targets are detected better against a figure and blurred targets are detector better against a background; how low spatial frequency parts of an image may be fused while high spatial frequency parts are rivalrous; how sparse blue cones can generate vivid blue surface percepts; how 3-D neon color spreading, visual phantoms, and tissue contrast percepts are generated; how conjunctions of color-and-depth may rapidly pop-out during visual search. These explanations arise derived from an ecological analysis of how monocularly viewed parts of an image inherit the appropriate depth from contiguous binocularly viewed parts, as during DaVinci stereopsis. The model predicts the functional role and ordering of multiple interactions within and between the two parvocellular processing streams that join LGN to prestriate area V4. Interactions from cells representing larger scales and disparities to cells representing smaller scales and disparities are of particular importance.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Air Force Office of Scientific Research (90-0175); Defense Advanced Research Projects Agency (90-0083); Office of Naval Research (N00014-91-J-4100)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Adequate hand-washing has been shown to be a critical activity in preventing the transmission of infections such as MRSA in health-care environments. Hand-washing guidelines published by various health-care related institutions recommend a technique incorporating six hand-washing poses that ensure all areas of the hands are thoroughly cleaned. In this paper, an embedded wireless vision system (VAMP) capable of accurately monitoring hand-washing quality is presented. The VAMP system hardware consists of a low resolution CMOS image sensor and FPGA processor which are integrated with a microcontroller and ZigBee standard wireless transceiver to create a wireless sensor network (WSN) based vision system that can be retargeted at a variety of health care applications. The device captures and processes images locally in real-time, determines if hand-washing procedures have been correctly undertaken and then passes the resulting high-level data over a low-bandwidth wireless link. The paper outlines the hardware and software mechanisms of the VAMP system and illustrates that it offers an easy to integrate sensor solution to adequately monitor and improve hand hygiene quality. Future work to develop a miniaturized, low cost system capable of being integrated into everyday products is also discussed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This thesis investigates the extent and range of the ocular vocabulary and themes employed by the playwright Thomas Middleton in context with early modern scientific, medical, and moral-philosophical writing on vision. More specifically, this thesis concerns Middleton’s revelation of the substance or essence of outward forms through mimesis. This paradoxical stance implies Middleton’s use of an illusory (theatrical) art form to explore hidden truths. This can be related to the early modern belief in the imagination (or fantasy) as chief mediator between the corporeal and spiritual worlds as well as to a reformed belief in the power of signs to indicate divine truth. This thesis identifies striking parallels between Middleton’s policy of social diagnosis and cure and an increased preoccupation with knowledge of interior man which culminates in Robert Burton’s Anatomy of Melancholy of 1621. All of these texts seek a cure for diseased internal sense faculties (such as fantasy and will) which cause the raging passions to destroy the individual. The purpose of this thesis is to demonstrate how Middleton takes a similar ‘mental-medicinal’ approach which investigates the idols created by the imagination before ‘purging’ the same and restoring order (Corneanu and Vermeir 184). The idea of infection incurred through the eyes which are fixed on vice (or error) has moral, religious, and political implications and discovery of corruption involves stripping away the illusions of false appearances to reveal the truth within whereby disease and disorder can be cured and restored. Finally, Middleton’s use of theatrical fantasy to detect the idols of the diseased imagination can be read as a Paracelsian, rather than Galenic, form of medicine whereby like is ‘joined with their like’ (Bostocke C7r) to restore health.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This dissertation examines medieval literary accounts of visions of the afterlife with an origin or provenance in Ireland from the perspective of genre, analysing their structural and literary characteristics both synchronically and diachronically. To this end, I have developed a new typology of medieval vision literature. I address the question in what manner the internationally attested genre of vision literature is adapted and developed in an Irish literary milieu. I explore this central research question through an interrogation of the typological unity of the key texts, both in formal arrangement and in the eschatological themes they express. My analysis of the structure and rhetoric of these narratives reveals the primary role of identity strategies, question-and-answer patterns and exhortation for their narrative cohesion and didactic function. In addition, I was able to make a formal distinction at text-level between the adaptation of the genre as an autonomous unit and the adaptation of thematic motifs as topoi. This further enabled me to nuance the distribution of characteristic features in the genre. My analysis of the spatial and temporal aspects of the eschatological journey confirms a preoccupation with personal eschatology. It reveals a close connection between the development of the aspects of graded access and trial in the genre and a growing awareness of an interim state of the soul after death. Finally, my dissertation provides new editions, translations and analyses of primary sources. My research breaks new ground in the hitherto underexplored area of genre adaptation in Ireland. In addition, it contributes significantly to our understanding of the nature of vision literature both in an Irish and a European context, and to our knowledge of the transmission of eschatological thought in the Latin West. Discusses the visions of: Laisrén, Fursa, Adomnán, Lóchán, Tnugdal, Owein and Visio Sancti Pauli Redactions VI and XI.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The artistic play of light seen on a pyramid in some Mayan ruins located in Cancun, Mexico provided the inspiration for Vision of Equinox. On both the spring and autumn equinox days, the sunlight projected on the pyramid forms a shape which looks like a serpent moving on the stairway of the pyramid. Vision of Equinox was composed with an image of light as the model for the artistic transfiguration of sound. The light image of sound changes its shape in each stage of the piece, using the orchestra in different ways - sometimes like a chamber ensemble, sometimes like one big instrument. The image of light casting on a pyramid is expressed by descending melodic lines that can be heard several times in the piece. At the final climax of the work, a complete and embodied artistic figure is formed and stated, expressing the appearance of the Mayan god Quetzalcoatl, the serpent, in my own imagination. The light and shadow which comprise this pyramid art are treated as two contrasting elements in my composition and become the two main motives in this piece. To express these two contrasting elements, I picked the numbers "5" and "2," and used them as "key numbers" in this piece. As a result, the intervals of a fifth and a second (sometimes inverted as a seventh) are the two main intervals used in the structure. The interval of a fifth was taken into account for the construction of the pyramid, which has five points of contact. The interval of a second was selected as a contrasting sonority to the fifth. Further, the numbers "5" and "2" are used as the number of notes which form the main motives in this piece; quintuplets are used throughout this piece, and the short motive made by two sixteenth notes is used as one of the main motives in this piece. Moreover, the shape of the pyramid provided a concept of symmetry, which is expressed by the setting of a central point of the music (pitch center) as well as the use of retrograde and inversion in this piece.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Dissertation

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The early detection of developmental disorders is key to child outcome, allowing interventions to be initiated which promote development and improve prognosis. Research on autism spectrum disorder (ASD) suggests that behavioral signs can be observed late in the first year of life. Many of these studies involve extensive frame-by-frame video observation and analysis of a child's natural behavior. Although nonintrusive, these methods are extremely time-intensive and require a high level of observer training; thus, they are burdensome for clinical and large population research purposes. This work is a first milestone in a long-term project on non-invasive early observation of children in order to aid in risk detection and research of neurodevelopmental disorders. We focus on providing low-cost computer vision tools to measure and identify ASD behavioral signs based on components of the Autism Observation Scale for Infants (AOSI). In particular, we develop algorithms to measure responses to general ASD risk assessment tasks and activities outlined by the AOSI which assess visual attention by tracking facial features. We show results, including comparisons with expert and nonexpert clinicians, which demonstrate that the proposed computer vision tools can capture critical behavioral observations and potentially augment the clinician's behavioral observations obtained from real in-clinic assessments.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The early detection of developmental disorders is key to child outcome, allowing interventions to be initiated that promote development and improve prognosis. Research on autism spectrum disorder (ASD) suggests behavioral markers can be observed late in the first year of life. Many of these studies involved extensive frame-by-frame video observation and analysis of a child's natural behavior. Although non-intrusive, these methods are extremely time-intensive and require a high level of observer training; thus, they are impractical for clinical and large population research purposes. Diagnostic measures for ASD are available for infants but are only accurate when used by specialists experienced in early diagnosis. This work is a first milestone in a long-term multidisciplinary project that aims at helping clinicians and general practitioners accomplish this early detection/measurement task automatically. We focus on providing computer vision tools to measure and identify ASD behavioral markers based on components of the Autism Observation Scale for Infants (AOSI). In particular, we develop algorithms to measure three critical AOSI activities that assess visual attention. We augment these AOSI activities with an additional test that analyzes asymmetrical patterns in unsupported gait. The first set of algorithms involves assessing head motion by tracking facial features, while the gait analysis relies on joint foreground segmentation and 2D body pose estimation in video. We show results that provide insightful knowledge to augment the clinician's behavioral observations obtained from real in-clinic assessments.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Gemstone Team FACE