925 resultados para Human visual processing


Relevância:

80.00% 80.00%

Publicador:

Resumo:

Visual cortex of macaque monkeys consists of a large number of cortical areas that span the occipital, parietal, temporal, and frontal lobes and occupy more than half of cortical surface. Although considerable progress has been made in understanding the contributions of many occipital areas to visual perceptual processing, much less is known concerning the specific functional contributions of higher areas in the temporal and frontal lobes. Previous behavioral and electrophysiological investigations have demonstrated that the inferotemporal cortex (IT) is essential to the animal's ability to recognize and remember visual objects. While it is generally recognized that IT consists of a number of anatomically and functionally distinct visual-processing areas, there remains considerable controversy concerning the precise number, size, and location of these areas. Therefore, the precise delineation of the cortical subdivisions of inferotemporal cortex is critical for any significant progress in the understanding of the specific contributions of inferotemporal areas to visual processing. In this study, anterograde and/or retrograde neuroanatomical tracers were injected into two visual areas in the ventral posterior and central portions of IT (areas PITv and CITvp) to elucidate the corticocortical connections of these areas with well known areas of occipital cortex and with less well understood regions of inferotemporal cortex. The locations of injection sites and the delineation of the borders of many occipital areas were aided by the pattern of interhemispheric connections, revealed following callosal transection and subsequent labeling with HRP. The resultant patterns of connections were represented on two-dimensional computational (CARET) and manual cortical maps and the laminar characteristics and density of the projection fields were quantified. The laminar and density features of these corticocortical connections demonstrate thirteen anatomically distinct subdivisions or areas distributed within the superior temporal sulcus and across the inferotemporal gyrus. These results serve to refine previous descriptions of inferotemporal areas, validate recently identified areas, and provide a new description of the hierarchical relationships among occipitotemporal cortical areas in macaques. ^

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Métrica de calidad de video de alta definición construida a partir de ratios de referencia completa. La medida de calidad de video, en inglés Visual Quality Assessment (VQA), es uno de los mayores retos por solucionar en el entorno multimedia. La calidad de vídeo tiene un impacto altísimo en la percepción del usuario final (consumidor) de los servicios sustentados en la provisión de contenidos multimedia y, por tanto, factor clave en la valoración del nuevo paradigma denominado Calidad de la Experiencia, en inglés Quality of Experience (QoE). Los modelos de medida de calidad de vídeo se pueden agrupar en varias ramas según la base técnica que sustenta el sistema de medida, destacando en importancia los que emplean modelos psicovisuales orientados a reproducir las características del sistema visual humano, en inglés Human Visual System, del que toman sus siglas HVS, y los que, por el contrario, optan por una aproximación ingenieril en la que el cálculo de calidad está basado en la extracción de parámetros intrínsecos de la imagen y su comparación. A pesar de los avances recogidos en este campo en los últimos años, la investigación en métricas de calidad de vídeo, tanto en presencia de referencia (los modelos denominados de referencia completa), como en presencia de parte de ella (modelos de referencia reducida) e incluso los que trabajan en ausencia de la misma (denominados sin referencia), tiene un amplio camino de mejora y objetivos por alcanzar. Dentro de ellos, la medida de señales de alta definición, especialmente las utilizadas en las primeras etapas de la cadena de valor que son de muy alta calidad, son de especial interés por su influencia en la calidad final del servicio y no existen modelos fiables de medida en la actualidad. Esta tesis doctoral presenta un modelo de medida de calidad de referencia completa que hemos llamado PARMENIA (PArallel Ratios MEtric from iNtrInsic features Analysis), basado en la ponderación de cuatro ratios de calidad calculados a partir de características intrínsecas de la imagen. Son: El Ratio de Fidelidad, calculado mediante el gradiente morfológico o gradiente de Beucher. El Ratio de Similitud Visual, calculado mediante los puntos visualmente significativos de la imagen a través de filtrados locales de contraste. El Ratio de Nitidez, que procede de la extracción del estadístico de textura de Haralick contraste. El Ratio de Complejidad, obtenido de la definición de homogeneidad del conjunto de estadísticos de textura de Haralick PARMENIA presenta como novedad la utilización de la morfología matemática y estadísticos de Haralick como base de una métrica de medida de calidad, pues esas técnicas han estado tradicionalmente más ligadas a la teledetección y la segmentación de objetos. Además, la aproximación de la métrica como un conjunto ponderado de ratios es igualmente novedosa debido a que se alimenta de modelos de similitud estructural y otros más clásicos, basados en la perceptibilidad del error generado por la degradación de la señal asociada a la compresión. PARMENIA presenta resultados con una altísima correlación con las valoraciones MOS procedentes de las pruebas subjetivas a usuarios que se han realizado para la validación de la misma. El corpus de trabajo seleccionado procede de conjuntos de secuencias validados internacionalmente, de modo que los resultados aportados sean de la máxima calidad y el máximo rigor posible. La metodología de trabajo seguida ha consistido en la generación de un conjunto de secuencias de prueba de distintas calidades a través de la codificación con distintos escalones de cuantificación, la obtención de las valoraciones subjetivas de las mismas a través de pruebas subjetivas de calidad (basadas en la recomendación de la Unión Internacional de Telecomunicaciones BT.500), y la validación mediante el cálculo de la correlación de PARMENIA con estos valores subjetivos, cuantificada a través del coeficiente de correlación de Pearson. Una vez realizada la validación de los ratios y optimizada su influencia en la medida final y su alta correlación con la percepción, se ha realizado una segunda revisión sobre secuencias del hdtv test dataset 1 del Grupo de Expertos de Calidad de Vídeo (VQEG, Video Quality Expert Group) mostrando los resultados obtenidos sus claras ventajas. Abstract Visual Quality Assessment has been so far one of the most intriguing challenges on the media environment. Progressive evolution towards higher resolutions while increasing the quality needed (e.g. high definition and better image quality) aims to redefine models for quality measuring. Given the growing interest in multimedia services delivery, perceptual quality measurement has become a very active area of research. First, in this work, a classification of objective video quality metrics based on their underlying methodologies and approaches for measuring video quality has been introduced to sum up the state of the art. Then, this doctoral thesis describes an enhanced solution for full reference objective quality measurement based on mathematical morphology, texture features and visual similarity information that provides a normalized metric that we have called PARMENIA (PArallel Ratios MEtric from iNtrInsic features Analysis), with a high correlated MOS score. The PARMENIA metric is based on the pooling of different quality ratios that are obtained from three different approaches: Beucher’s gradient, local contrast filtering, and contrast and homogeneity Haralick’s texture features. The metric performance is excellent, and improves the current state of the art by providing a wide dynamic range that make easier to discriminate between very close quality coded sequences, especially for very high bit rates whose quality, currently, is transparent for quality metrics. PARMENIA introduces a degree of novelty against other working metrics: on the one hand, exploits the structural information variation to build the metric’s kernel, but complements the measure with texture information and a ratio of visual meaningful points that is closer to typical error sensitivity based approaches. We would like to point out that PARMENIA approach is the only metric built upon full reference ratios, and using mathematical morphology and texture features (typically used in segmentation) for quality assessment. On the other hand, it gets results with a wide dynamic range that allows measuring the quality of high definition sequences from bit rates of hundreds of Megabits (Mbps) down to typical distribution rates (5-6 Mbps), even streaming rates (1- 2 Mbps). Thus, a direct correlation between PARMENIA and MOS scores are easily constructed. PARMENIA may further enhance the number of available choices in objective quality measurement, especially for very high quality HD materials. All this results come from validation that has been achieved through internationally validated datasets on which subjective tests based on ITU-T BT.500 methodology have been carried out. Pearson correlation coefficient has been calculated to verify the accuracy of PARMENIA and its reliability.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Brain oscillations are closely correlated with human information processing and fundamental aspects of cognition. Previous literature shows that due to the relation between brain oscillations and memory processes, spectral dynamics during such tasks are good candidates to study and characterize memory related pathologies. Mild cognitive impairment (MCI), defined as a clinical condition characterized by memory impairment and/ or deterioration of additional cognitive domains, is considered a preliminary stage in the dementia process. In consequence, the study of its brain patterns could help to achieve an early diagnosis of Alzheimer Disease.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A novel scheme for depth sequences compression, based on a perceptual coding algorithm, is proposed. A depth sequence describes the object position in the 3D scene, and is used, in Free Viewpoint Video, for the generation of synthetic video sequences. In perceptual video coding the human visual system characteristics are exploited to improve the compression efficiency. As depth sequences are never shown, the perceptual video coding, assessed over them, is not effective. The proposed algorithm is based on a novel perceptual rate distortion optimization process, assessed over the perceptual distortion of the rendered views generated through the encoded depth sequences. The experimental results show the effectiveness of the proposed method, able to obtain a very considerable improvement of the rendered view perceptual quality.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Abstract The development of cognitive robots needs a strong “sensorial” support which should allow it to perceive the real world for interacting with it properly. Therefore the development of efficient visual-processing software to be equipped in effective artificial agents is a must. In this project we study and develop a visual-processing software that will work as the “eyes” of a cognitive robot. This software performs a three-dimensional mapping of the robot’s environment, providing it with the essential information required to make proper decisions during its navigation. Due to the complexity of this objective we have adopted the Scrum methodology in order to achieve an agile development process, which has allowed us to correct and improve in a fast way the successive versions of the product. The present project is structured in Sprints, which cover the different stages of the software development based on the requirements imposed by the robot and its real necessities. We have initially explored different commercial devices oriented to the acquisition of the required visual information, adopting the Kinect Sensor camera (Microsoft) as the most suitable option. Later on, we have studied the available software to manage the obtained visual information as well as its integration with the robot’s software, choosing the high-level platform Matlab as the common nexus to join the management of the camera, the management of the robot and the implementation of the behavioral algorithms. During the last stages the software has been developed to include the fundamental functionalities required to process the real environment, such as depth representation, segmentation, and clustering. Finally the software has been optimized to exhibit real-time processing and a suitable performance to fulfill the robot’s requirements during its operation in real situations.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

(ENG)The influence of Theosophy in the symbolist painting of Mondrian (1908-1911) has been unanimously recognized. There is not, however, the same consensus with respect to the influence of theosophy in his neoplastic period. There is a relationship between Mondrian’s theoretical writing and his practical work, but no proportionality. Mondrian’s theoretical discourse is not limited to painting and touches on other arts and disciplines (architecture, urbanism). Mondrian will define a complex – philosophical? – system whose final goal will be to completely transform the human visual environment. That is to say that the area covered by his theoretical discourse widely exceeds that of his practical activity (painting). The goal of this article is to try to delimit the scope of Theosophy’s influence on Mondrian’s work during his neoplastic period, in his theoretical writing and in his practical production. (SPA)La influencia de la teosofía en la pintura simbolista de Mondrian (1908-1911) es unánimemente reconocida. No existe el mismo consenso respecto a la influencia de la teosofía durante su periodo neoplástico. Existe relación entre los escritos teóricos de Mondrian y su obra práctica, pero no proporcionalidad. El discurso teórico de Mondrian no se limita a la pintura, sino que alcanza a otras artes y disciplinas (la arquitectura, la ciudad). Mondrian va a definir un complejo sistema -¿filosófico?- cuyo objetivo final será la transformación de todo el entorno visual del ser humano. Es decir, el ámbito de su discurso teórico supera ampliamente el de su actividad práctica (la pintura). El objetivo de este artículo es el de tratar de acotar el alcance de la influencia de la teosofía en la obra de Mondrian de su periodo neoplástico, tanto en sus escritos teóricos como en su pintura.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A frame-level distortion model based on perceptual features of the human visual system is proposed to improve the performance of unequal error protection strategies and provide better quality of experience to users in Side-by-Side 3D video delivery systems.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Assessing video quality is a complex task. While most pixel-based metrics do not present enough correlation between objective and subjective results, algorithms need to correspond to human perception when analyzing quality in a video sequence. For analyzing the perceived quality derived from concrete video artifacts in determined region of interest we present a novel methodology for generating test sequences which allow the analysis of impact of each individual distortion. Through results obtained after subjective assessment it is possible to create psychovisual models based on weighting pixels belonging to different regions of interest distributed by color, position, motion or content. Interesting results are obtained in subjective assessment which demonstrates the necessity of new metrics adapted to human visual system.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

El proyecto nace de un proyecto anterior donde se construyó un modelo para representar la información de los estudios superiores mediante una red de ontologías, proporcionando una definición común de conceptos importantes. Este proyecto consiste en desarrollar una herramienta capaz de generar datos educativos, a partir de la red de ontologías mencionadas anteriormente, siguiendo el paradigma de Linked Data [1]. La herramienta deberá extraer datos de diferentes fuentes educativas y transformará dichos datos educativos a datos enlazados (Linked Data). Para llevar a cabo esta labor se ha utilizado GATE Developer [2], es un entorno de desarrollo que proporciona un completo conjunto de herramientas gráficas interactivas para la creación, medición y mantenimiento de componentes de software para el procesamiento del lenguaje humano.---ABSTRACT---The project arises from a previous project in which a model was constructed to represent information of higher education through a network of ontologies, providing a common definition of important concepts. This project is to develop a tool capable of generating educational data from the ontology network mentioned above, following the paradigm of Linked Data [1]. The tool will extract data from different educational sources and transform said data to linked data (linked data). To carry out this work has been used GATE Developer [2]. It is a development environment that provides a comprehensive set of interactive graphical tools for creating, measuring and maintenance of software components for human language processing.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The human visual system is able to effortlessly integrate local features to form our rich perception of patterns, despite the fact that visual information is discretely sampled by the retina and cortex. By using a novel perturbation technique, we show that the mechanisms by which features are integrated into coherent percepts are scale-invariant and nonlinear (phase and contrast polarity independent). They appear to operate by assigning position labels or “place tags” to each feature. Specifically, in the first series of experiments, we show that the positional tolerance of these place tags in foveal, and peripheral vision is about half the separation of the features, suggesting that the neural mechanisms that bind features into forms are quite robust to topographical jitter. In the second series of experiment, we asked how many stimulus samples are required for pattern identification by human and ideal observers. In human foveal vision, only about half the features are needed for reliable pattern interpolation. In this regard, human vision is quite efficient (ratio of ideal to real ≈ 0.75). Peripheral vision, on the other hand is rather inefficient, requiring more features, suggesting that the stimulus may be relatively underrepresented at the stage of feature integration.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Cerebral networks are complex sets of connections that resemble a ladder-like web of multiple parallel feedforward, lateral, and feedback connections. This static anatomical description has been pivotal in guiding our understanding of signal processing within cerebral networks. However, measures on both magnitude and functional significance of connections are extremely limited. Here, we compare the anatomically defined strengths of a set of cerebral pathways emerging from the visual middle suprasylvian (MS) cortex of the cat with measures of the functional impact the same region has over distant sites. These functional measures were obtained by analyzing the local and distant effects of MS cooling deactivation on deoxyglucose uptake. Relative to major efferent projections from MS cortex that have a strong influence, projections to early visual processing stages have weaker functional influences than predicted from the anatomy. For higher processing stages, the converse holds: projections from MS cortex have stronger functional influence than predicted from the anatomy. We conclude that these and future functional measures, obtained using the same combination of techniques, will furnish fundamental, new information that complements and extends current models of static cerebral networks, and lead to more realistic models of cerebral network function and component interactions.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

To understand how the human visual system analyzes images, it is essential to know the structure of the visual environment. In particular, natural images display consistent statistical properties that distinguish them from random luminance distributions. We have studied the geometric regularities of oriented elements (edges or line segments) present in an ensemble of visual scenes, asking how much information the presence of a segment in a particular location of the visual scene carries about the presence of a second segment at different relative positions and orientations. We observed strong long-range correlations in the distribution of oriented segments that extend over the whole visual field. We further show that a very simple geometric rule, cocircularity, predicts the arrangement of segments in natural scenes, and that different geometrical arrangements show relevant differences in their scaling properties. Our results show similarities to geometric features of previous physiological and psychophysical studies. We discuss the implications of these findings for theories of early vision.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Behavioral models indicate that persistent small afferent input, as generated by tissue injury, results in a hyperalgesia at the site of injury and a tactile allodynia in areas adjacent to the injury site. Hyperalgesia reflects a sensitization of the peripheral terminal and a central facilitation evoked by the persistent small afferent input. The allodynia reflects a central sensitization. The spinal pharmacology of these pain states has been defined in the unanesthetized rat prepared with spinal catheters for injection and dialysis. After tissue injury, excitatory transmitters (e.g., glutamate and substance P) acting though N-methyl-d-aspartate (NMDA) and neurokinin 1 receptors initiate a cascade that evokes release of (i) NO, (ii) cyclooxygenase products, and (iii) activation of several kinases. Spinal dialysis show amino acid and prostanoid release after cutaneous injury. Spinal neurokinin 1, NMDA, and non-NMDA receptors enhance spinal prostaglandin E2 release. Spinal prostaglandins facilitate release of spinal amino acids and peptides. Activation by intrathecal injection of receptors on spinal C fiber terminals (μ,/∂ opiate, α2 adrenergic, neuropeptide Y) prevents release of primary afferent peptides and spinal amino acids and blocks acute and facilitated pain states. Conversely, consistent with their role in facilitated processing, NMDA, cyclooxygenase 2, and NO synthase inhibitors act to diminish only hyperalgesia. Importantly, spinal delivery of several of these agents diminishes human injury pain states. This efficacy emphasizes (i) the role of facilitated states in humans, (ii) shows the importance of spinal systems in human pain processing, and (iii) indicates that these preclinical mechanisms reflect processes that regulate the human pain experience.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The basal ganglia are known to receive inputs from widespread regions of the cerebral cortex, such as the frontal, parietal, and temporal lobes. Of these cortical areas, only the frontal lobe is thought to be the target of basal ganglia output. One of the cortical regions that is a source of input to the basal ganglia is area TE, in inferotemporal cortex. This cortical area is thought to be critically involved in the recognition and discrimination of visual objects. Using retrograde transneuronal transport of herpes simplex virus type 1, we have found that one of the output nuclei of the basal ganglia, the substantia nigra pars reticulata, projects via the thalamus to TE. Thus, TE is not only a source of input to the basal ganglia, but also is a target of basal ganglia output. This result implies that the output of the basal ganglia influences higher order aspects of visual processing. In addition, we propose that dysfunction of the basal ganglia loop with TE leads to alterations in visual perception, including visual hallucinations.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

It has been known for more than 40 years that images fade from perception when they are kept at the same position on the retina by abrogating eye movements. Although aspects of this phenomenon were described earlier, the use of close-fitting contact lenses in the 1950s made possible a series of detailed observations on eye movements and visual continuity. In the intervening decades, many investigators have studied the role of image motion on visual perception. Although several controversies remain, it is clear that images deteriorate and in some cases disappear following stabilization; eye movements are, therefore, essential to sustained exoptic vision. The time course of image degradation has generally been reported to be a few seconds to a minute or more, depending upon the conditions. Here we show that images of entoptic vascular shadows can disappear in less than 80 msec. The rapid vanishing of these images implies an active mechanism of image erasure and creation as the basis of normal visual processing.