926 resultados para visual process
Resumo:
In this report we summarize the state-of-the-art of speech emotion recognition from the signal processing point of view. On the bases of multi-corporal experiments with machine-learning classifiers, the observation is made that existing approaches for supervised machine learning lead to database dependent classifiers which can not be applied for multi-language speech emotion recognition without additional training because they discriminate the emotion classes following the used training language. As there are experimental results showing that Humans can perform language independent categorisation, we made a parallel between machine recognition and the cognitive process and tried to discover the sources of these divergent results. The analysis suggests that the main difference is that the speech perception allows extraction of language independent features although language dependent features are incorporated in all levels of the speech signal and play as a strong discriminative function in human perception. Based on several results in related domains, we have suggested that in addition, the cognitive process of emotion-recognition is based on categorisation, assisted by some hierarchical structure of the emotional categories, existing in the cognitive space of all humans. We propose a strategy for developing language independent machine emotion recognition, related to the identification of language independent speech features and the use of additional information from visual (expression) features.
Resumo:
In recent years there has been an increasing use of visual methods in ageing research. There are, however, limited reflections and critical explorations of the implications of using visual methods in research with people in mid to later life. This paper examines key methodological complexities when researching the daily lives of people as they grow older and the possibilities and limitations of using participant-generated visual diaries. The paper will draw on our experiences of an empirical study, which included a sample of 62 women and men aged 50 years and over with different daily routines. Participant-led photography was drawn upon as a means to create visual diaries, followed by in-depth, photo-elicitation interviews. The paper will critically reflect on the use of visual methods for researching the daily lives of people in mid to later life, as well as suggesting some wider tensions within visual methods that warrant attention. First, we explore the extent to which photography facilitates a ‘collaborative’ research process; second, complexities around capturing the ‘everydayness’ of daily routines are explored; third, the representation and presentation of ‘self’ by participants within their images and interview narratives is examined; and, finally, we highlight particular emotional considerations in visualising daily life.
Resumo:
2000 Mathematics Subject Classification: 62P10, 92C20
Resumo:
There has been an increasing interest in the use of agent-based simulation and some discussion of the relative merits of this approach as compared to discrete-event simulation. There are differing views on whether an agent-based simulation offers capabilities that discrete-event cannot provide or whether all agent-based applications can at least in theory be undertaken using a discrete-event approach. This paper presents a simple agent-based NetLogo model and corresponding discrete-event versions implemented in the widely used ARENA software. The two versions of the discrete-event model presented use a traditional process flow approach normally adopted in discrete-event simulation software and also an agent-based approach to the model build. In addition a real-time spatial visual display facility is provided using a spreadsheet platform controlled by VBA code embedded within the ARENA model. Initial findings from this investigation are that discrete-event simulation can indeed be used to implement agent-based models and with suitable integration elements such as VBA provide the spatial displays associated with agent-based software.
Resumo:
In this study we aim to evaluate the impact of ageing and gender on different visual mental imagery processes. Two hundred and fifty-one participants (130 women and 121 men; age range = 18–77 years) were given an extensive neuropsychological battery including tasks probing the generation, maintenance, inspection, and transformation of visual mental images (Complete Visual Mental Imagery Battery, CVMIB). Our results show that all mental imagery processes with the exception of the maintenance are affected by ageing, suggesting that other deficits, such as working memory deficits, could account for this effect. However, the analysis of the transformation process, investigated in terms of mental rotation and mental folding skills, shows a steeper decline in mental rotation, suggesting that age could affect rigid transformations of objects and spare non-rigid transformations. Our study also adds to previous ones in showing gender differences favoring men across the lifespan in the transformation process, and, interestingly, it shows a steeper decline in men than in women in inspecting mental images, which could partially account for the mixed results about the effect of ageing on this specific process. We also discuss the possibility to introduce the CVMIB in clinical assessment in the context of theoretical models of mental imagery.
Resumo:
This research pursued the conceptualization, implementation, and verification of a system that enhances digital information displayed on an LCD panel to users with visual refractive errors. The target user groups for this system are individuals who have moderate to severe visual aberrations for which conventional means of compensation, such as glasses or contact lenses, does not improve their vision. This research is based on a priori knowledge of the user's visual aberration, as measured by a wavefront analyzer. With this information it is possible to generate images that, when displayed to this user, will counteract his/her visual aberration. The method described in this dissertation advances the development of techniques for providing such compensation by integrating spatial information in the image as a means to eliminate some of the shortcomings inherent in using display devices such as monitors or LCD panels. Additionally, physiological considerations are discussed and integrated into the method for providing said compensation. In order to provide a realistic sense of the performance of the methods described, they were tested by mathematical simulation in software, as well as by using a single-lens high resolution CCD camera that models an aberrated eye, and finally with human subjects having various forms of visual aberrations. Experiments were conducted on these systems and the data collected from these experiments was evaluated using statistical analysis. The experimental results revealed that the pre-compensation method resulted in a statistically significant improvement in vision for all of the systems. Although significant, the improvement was not as large as expected for the human subject tests. Further analysis suggest that even under the controlled conditions employed for testing with human subjects, the characterization of the eye may be changing. This would require real-time monitoring of relevant variables (e.g. pupil diameter) and continuous adjustment in the pre-compensation process to yield maximum viewing enhancement.
Resumo:
Favelas are Brazilian informal housing settlements that are areas of concentrated poverty. In Rio de Janeiro, favelas are perceived as areas of heightened criminal activity and violence, and residents experience discrimination, and little access to quality education and employment opportunities. In this context, hundreds of non-formal educational arts and leisure programs work to build the self-esteem and identity of youth in Rio's favelas as a way of preventing the youth from negative local influences. The Morrinho organization, located in the Pereira da Silva favela in Rio, uses art as a way for the local male youth to communicate their lived reality. This study used a visual critical ethnographic methodology to describe the way in which the Morrinho participants interpret living in a favela. Seventeen semi-structured interviews with young men aged 15 to 29, the feature-length documentary film on the organization, 206 researcher produced documentary style photographs of the Morrinho artwork, and the researcher's field notes were analyzed. Truth claims, ways of seeing as communicated through words and actions, were induced through a cyclical process of reconstructive horizon analysis that incorporated the societal context and critical theory. The participants communicated their concerns about life in a favela; however, they did not describe their societal positions in terms of complete marginalization. They named multiple benefits of living in Pereira da Silva, discussed positive and negative experiences in school, and described ways they circumvented discrimination. Morrinho as an organization was described as an enthralling game and a social project that benefited dozens of local youth. Character development was a valuable result of participation at Morrinho. The Morrinho artwork communicates a nuanced vision of both benevolent and violent social actors, and counters the overwhelmingly negative dominant characterization of Rio de Janeiro's favelas. This study has implications for an inclusive critical pedagogy and the use of art as a means to facilitate a transformative education. Further research is recommended to explore terminology used to refer to favelas, and perceptions that favela residents have of their experiences in public education.
Resumo:
This research pursued the conceptualization, implementation, and verification of a system that enhances digital information displayed on an LCD panel to users with visual refractive errors. The target user groups for this system are individuals who have moderate to severe visual aberrations for which conventional means of compensation, such as glasses or contact lenses, does not improve their vision. This research is based on a priori knowledge of the user's visual aberration, as measured by a wavefront analyzer. With this information it is possible to generate images that, when displayed to this user, will counteract his/her visual aberration. The method described in this dissertation advances the development of techniques for providing such compensation by integrating spatial information in the image as a means to eliminate some of the shortcomings inherent in using display devices such as monitors or LCD panels. Additionally, physiological considerations are discussed and integrated into the method for providing said compensation. In order to provide a realistic sense of the performance of the methods described, they were tested by mathematical simulation in software, as well as by using a single-lens high resolution CCD camera that models an aberrated eye, and finally with human subjects having various forms of visual aberrations. Experiments were conducted on these systems and the data collected from these experiments was evaluated using statistical analysis. The experimental results revealed that the pre-compensation method resulted in a statistically significant improvement in vision for all of the systems. Although significant, the improvement was not as large as expected for the human subject tests. Further analysis suggest that even under the controlled conditions employed for testing with human subjects, the characterization of the eye may be changing. This would require real-time monitoring of relevant variables (e.g. pupil diameter) and continuous adjustment in the pre-compensation process to yield maximum viewing enhancement.
Resumo:
Over the past 30 years, Art Education in interface with disabilities has been a subject of increasing interest in research in academia, especially with regard to Special Education, but still has some shortages in terms of socialization studies to discuss this type of teaching from the perspective of inclusive education. In this scenario, this paper presents an analysis from the field of teaching Visual Arts in the context of school inclusion, with emphasis on teaching drawing to the visually impaired. The conducted literature indicates a number of authors who discuss teaching drawing to people with visual disabilities, who are dedicated primarily to the Special Education context. In this sense, the shortage of research that discuss this teaching from the perspective of inclusive education, this research aimed at the inclusive approach to teaching drawing in the school context. Thus, the aim of this study was to develop a proposal for a pedagogical intervention in Visual Arts, with reference to drawing and its construction process, with the participation of seeing and unseeing students. Therefore, the methodological approach, which was qualitative, was the intervention research, in the light of the Bakhtinian principles of dialogism and otherness, with exploratory study characteristics. The locus of the research was the State School Admiral Newton Braga Faria, which is located in Alecrim, on the East Zone of Natal / RN and is near the Institute for Education and Rehabilitation of the Blind - IERC / RN. The class chosen for intervention was the 7th grade “C” afternoon shift, which had children aged 12 to 16, with 27 students enrolled, three students with disabilities: 02 blind girls and 01 deafblind boy with light hearing and visual loss. As interlocutors of the research, we could also count on the Art teacher who served as a collaborator, as well as teacher in the school’s Multifunction Resource Room. The instruments and research procedures were observation, semi-structured interview, field diary and the photo / video recording. In the development of research, we conducted 10 workshops with multisensory teaching sequences, articulating the physical, tactile and graphical expressions as intrinsic to the reading and production of drawing for both seeing and unseeing students. The process and data built on research allowed for a reflection on cultural experiences with drawing in the school context and on the interactions between seeing and unseeing students in the production and analysis of tactile-visual drawings. They also point out the construction of a teaching approach to drawing, in the context of the common class, from educational workshops that enable artistic and aesthetic interactions from the perspective of school inclusiveness. Thus, we argued that the mobilization of the tactile, physical and graphical expressions can be adopted in a multisensory approach that enables a pedagogical focus that involves all students and is not restricted to the presence of students with visual impairment.
Resumo:
The environment in which we live in, we constantly deal with a huge amount of dynamic information, thus, attention is an indispensable cognitive resource that allows an effective selection of stimuli for our survival. From this, investigating how we process our encouragement in movements and how the attention spreads into a space to serve more than one stimuli simultanously is something very important. The behavioural urgence hipothesis suggests that the encouragement in a movement of approaching shows a certain priority in the process related to objects which are in a movement away, but there are researches that point out that it might not happen in an attentive phase, but instead as a priorization of motor response. There are also many controversies found in researches about attentive focalization, in which some studies suggest that the focus of attention would work in a similar manner to a zoom lens, while some searches indicate that the focus of attention could be shared to answer some stimuli in non contiguous regions. This study tried to investigate through two experiments the effect of attentive priorization by encouragement in movements and how the attention is spread with distractors stimuli. The first experiment investigated if the amount of moving flows really influenced in the process of information. The results indicate an effect of priorization of the flows guided in relation to aleatory ones and also from the unique flow due to dual flow. The second experiment investigated how the distribution of attention is in a space with the use of flows as an exogenous cue. The results indicate that the focus of attention works as the one suggested in the zoom lens model.
Resumo:
As we look around a scene, we perceive it as continuous and stable even though each saccadic eye movement changes the visual input to the retinas. How the brain achieves this perceptual stabilization is unknown, but a major hypothesis is that it relies on presaccadic remapping, a process in which neurons shift their visual sensitivity to a new location in the scene just before each saccade. This hypothesis is difficult to test in vivo because complete, selective inactivation of remapping is currently intractable. We tested it in silico with a hierarchical, sheet-based neural network model of the visual and oculomotor system. The model generated saccadic commands to move a video camera abruptly. Visual input from the camera and internal copies of the saccadic movement commands, or corollary discharge, converged at a map-level simulation of the frontal eye field (FEF), a primate brain area known to receive such inputs. FEF output was combined with eye position signals to yield a suitable coordinate frame for guiding arm movements of a robot. Our operational definition of perceptual stability was "useful stability," quantified as continuously accurate pointing to a visual object despite camera saccades. During training, the emergence of useful stability was correlated tightly with the emergence of presaccadic remapping in the FEF. Remapping depended on corollary discharge but its timing was synchronized to the updating of eye position. When coupled to predictive eye position signals, remapping served to stabilize the target representation for continuously accurate pointing. Graded inactivations of pathways in the model replicated, and helped to interpret, previous in vivo experiments. The results support the hypothesis that visual stability requires presaccadic remapping, provide explanations for the function and timing of remapping, and offer testable hypotheses for in vivo studies. We conclude that remapping allows for seamless coordinate frame transformations and quick actions despite visual afferent lags. With visual remapping in place for behavior, it may be exploited for perceptual continuity.
Resumo:
Objective
Pedestrian detection under video surveillance systems has always been a hot topic in computer vision research. These systems are widely used in train stations, airports, large commercial plazas, and other public places. However, pedestrian detection remains difficult because of complex backgrounds. Given its development in recent years, the visual attention mechanism has attracted increasing attention in object detection and tracking research, and previous studies have achieved substantial progress and breakthroughs. We propose a novel pedestrian detection method based on the semantic features under the visual attention mechanism.
Method
The proposed semantic feature-based visual attention model is a spatial-temporal model that consists of two parts: the static visual attention model and the motion visual attention model. The static visual attention model in the spatial domain is constructed by combining bottom-up with top-down attention guidance. Based on the characteristics of pedestrians, the bottom-up visual attention model of Itti is improved by intensifying the orientation vectors of elementary visual features to make the visual saliency map suitable for pedestrian detection. In terms of pedestrian attributes, skin color is selected as a semantic feature for pedestrian detection. The regional and Gaussian models are adopted to construct the skin color model. Skin feature-based visual attention guidance is then proposed to complete the top-down process. The bottom-up and top-down visual attentions are linearly combined using the proper weights obtained from experiments to construct the static visual attention model in the spatial domain. The spatial-temporal visual attention model is then constructed via the motion features in the temporal domain. Based on the static visual attention model in the spatial domain, the frame difference method is combined with optical flowing to detect motion vectors. Filtering is applied to process the field of motion vectors. The saliency of motion vectors can be evaluated via motion entropy to make the selected motion feature more suitable for the spatial-temporal visual attention model.
Result
Standard datasets and practical videos are selected for the experiments. The experiments are performed on a MATLAB R2012a platform. The experimental results show that our spatial-temporal visual attention model demonstrates favorable robustness under various scenes, including indoor train station surveillance videos and outdoor scenes with swaying leaves. Our proposed model outperforms the visual attention model of Itti, the graph-based visual saliency model, the phase spectrum of quaternion Fourier transform model, and the motion channel model of Liu in terms of pedestrian detection. The proposed model achieves a 93% accuracy rate on the test video.
Conclusion
This paper proposes a novel pedestrian method based on the visual attention mechanism. A spatial-temporal visual attention model that uses low-level and semantic features is proposed to calculate the saliency map. Based on this model, the pedestrian targets can be detected through focus of attention shifts. The experimental results verify the effectiveness of the proposed attention model for detecting pedestrians.
Resumo:
MAIA, Maria Aniolly Queiroz et al. O bibliotecário como mediador no processo de transferência da informação para pessoas com deficiência visual. In: CONGRESSO BRASILEIRO DE BIBLIOTECONOMIA, 24., DOCUMENTAÇÃO E CIÊNCIA DA INFORMAÇÃO, 2011, Maceió. Anais... Maceió: CBBD, 2011
Resumo:
There are few professions in which visual acuity is as important as it is to radiologists. The diagnostic decision making process is composed of a number of events (detection or observation, interpretation and reporting), where the detection phase is subject to a number of physical and psychological phenomena that are critical to the process. Visual acuity is one phenomenon that has often been overlooked, and there is very little research assessing the impact of reduced visual acuity on diagnostic performance. The aim of this study was to investigate the impact of reduced visual acuity on an observer’s ability to detect simulated nodules in an anthropomorphic chest phantom.
Resumo:
A presente investigação propõe-se a atuar no sector turístico, uma vez que este é bombardeado diariamente por uma quantidade considerável de dados e informações. Atualmente, usufrui-se significativamente mais da tecnologia com a finalidade de promover e vender os produtos/serviços disponíveis no mercado. A par da evolução tecnológica, os utilizadores/clientes conseguem comprar, cada vez mais, à distancia de um clique os produtos turísticos que desejam. No entanto, há um variado leque de aplicações sobre o turismo que permitem entender os gostos e as necessidades dos turistas assim como a sua atitude para com o mesmo. Porém, nem as entidades nem os gestores turísticos usufruem inteligentemente dos dados que lhes são facultados. Estes tendem normalmente a prender-se pelo turismo em Portugal e de que forma é que a sua entidade é apresentada acabando por esquecer que os dados podem e devem ser utilizados para expandir o mercado assim como entender/conhecer potenciais mercados. Deste modo, o fundamento principal desta investigação remete para a criação de uma plataforma infocomunicacional que analise na totalidade os dados obtidos, assim como fornecer as ferramentas pertinentes para que se consiga fazer esta análise, nomeadamente através de uma representação infográfica adequada e estratégias de a comunicar aos stakeholders.. Para tal foi aplicada no âmbito desta dissertação a metodologia investigação/ação, vista como um processo cíclico que para além de incluir simultaneamente estas duas vertentes, vai alternando entre a ação e a reflexão critica sendo sustentada por bases teóricas. A criação do protótipo da plataforma Smart Tourism, resultou num sistema inovador que tenta responder aos indicadores escolhidos no Dashbord e ao problema infocomunicacional, tentando criar as bases necessárias para que as entidades consigam analisar de forma mais integrada/sistematizada e racional a atividade turística. Foi por isso, desenvolvido e avaliado qualitativamente um protótipo de base infocomunicacional visual (dashboard visual) que para além do que para além do que já foi referido, consegue proporcionar a gestão dos produtos, clientes, staff e parceiros, aumentando assim o valor deste sector.