935 resultados para Visual Object Identification Task
Resumo:
There is evidence that automatic visual attention favors the right side. This study investigated whether this lateral asymmetry interacts with the right hemisphere dominance for visual location processing and left hemisphere dominance for visual shape processing. Volunteers were tested in a location discrimination task and a shape discrimination task. The target stimuli (S2) could occur in the left or right hemifield. They were preceded by an ipsilateral, contralateral or bilateral prime stimulus (S1). The attentional effect produced by the right S1 was larger than that produced by the left S1. This lateral asymmetry was similar between the two tasks suggesting that the hemispheric asymmetries of visual mechanisms do not contribute to it. The finding that it was basically due to a longer reaction time to the left S2 than to the right S2 for the contralateral S1 condition suggests that the inhibitory component of attention is laterally asymmetric.
Resumo:
Object selection refers to the mechanism of extracting objects of interest while ignoring other objects and background in a given visual scene. It is a fundamental issue for many computer vision and image analysis techniques and it is still a challenging task to artificial Visual systems. Chaotic phase synchronization takes place in cases involving almost identical dynamical systems and it means that the phase difference between the systems is kept bounded over the time, while their amplitudes remain chaotic and may be uncorrelated. Instead of complete synchronization, phase synchronization is believed to be a mechanism for neural integration in brain. In this paper, an object selection model is proposed. Oscillators in the network representing the salient object in a given scene are phase synchronized, while no phase synchronization occurs for background objects. In this way, the salient object can be extracted. In this model, a shift mechanism is also introduced to change attention from one object to another. Computer simulations show that the model produces some results similar to those observed in natural vision systems.
Resumo:
Biological systems have facility to capture salient object(s) in a given scene, but it is still a difficult task to be accomplished by artificial vision systems. In this paper a visual selection mechanism based on the integrate and fire neural network is proposed. The model not only can discriminate objects in a given visual scene, but also can deliver focus of attention to the salient object. Moreover, it processes a combination of relevant features of an input scene, such as intensity, color, orientation, and the contrast of them. In comparison to other visual selection approaches, this model presents several interesting features. It is able to capture attention of objects in complex forms, including those linearly nonseparable. Moreover, computer simulations show that the model produces results similar to those observed in natural vision systems.
Resumo:
Texture is an important visual attribute used to describe the pixel organization in an image. As well as it being easily identified by humans, its analysis process demands a high level of sophistication and computer complexity. This paper presents a novel approach for texture analysis, based on analyzing the complexity of the surface generated from a texture, in order to describe and characterize it. The proposed method produces a texture signature which is able to efficiently characterize different texture classes. The paper also illustrates a novel method performance on an experiment using texture images of leaves. Leaf identification is a difficult and complex task due to the nature of plants, which presents a huge pattern variation. The high classification rate yielded shows the potential of the method, improving on traditional texture techniques, such as Gabor filters and Fourier analysis.
Resumo:
Visual attention is a very important task in autonomous robotics, but, because of its complexity, the processing time required is significant. We propose an architecture for feature selection using foveated images that is guided by visual attention tasks and that reduces the processing time required to perform these tasks. Our system can be applied in bottom-up or top-down visual attention. The foveated model determines which scales are to be used on the feature extraction algorithm. The system is able to discard features that are not extremely necessary for the tasks, thus, reducing the processing time. If the fovea is correctly placed, then it is possible to reduce the processing time without compromising the quality of the tasks outputs. The distance of the fovea from the object is also analyzed. If the visual system loses the tracking in top-down attention, basic strategies of fovea placement can be applied. Experiments have shown that it is possible to reduce up to 60% the processing time with this approach. To validate the method, we tested it with the feature algorithm known as Speeded Up Robust Features (SURF), one of the most efficient approaches for feature extraction. With the proposed architecture, we can accomplish real time requirements of robotics vision, mainly to be applied in autonomous robotics
Resumo:
This work uses computer vision algorithms related to features in the identification of medicine boxes for the visually impaired. The system is for people who have a disease that compromises his vision, hindering the identification of the correct medicine to be ingested. We use the camera, available in several popular devices such as computers, televisions and phones, to identify the box of the correct medicine and audio through the image, showing the poor information about the medication, such: as the dosage, indication and contraindications of the medication. We utilize a model of object detection using algorithms to identify the features in the boxes of drugs and playing the audio at the time of detection of feauteres in those boxes. Experiments carried out with 15 people show that where 93 % think that the system is useful and very helpful in identifying drugs for boxes. So, it is necessary to make use of this technology to help several people with visual impairments to take the right medicine, at the time indicated in advance by the physician
Resumo:
The listing task, a method used in social and behavioral sciences, is frequently used in ethnobotanical research to constructfolk taxonomies and select relevant itemsfor subsequent research. The objective of the present study was to determine whether visual stimuli are associated with responses to the theme plants or if context influences the answers. Interviews were conducted with 400 women in Rio Claro, São Paulo, Brazil, in four different locations: three with a visible presence of plants (a plant store, a supermarket, and a public plaza) and one with no plants (a street corner in the center of the city). The women were asked to name plants. Analysis indicates that visual stimuli influenced responses and that this is more marked in the plant store than in the other locations. The plants cited most often-roses, orchids, ferns, violets, and daisies-were, with little variation, the same in all the locales studied.
Resumo:
Background: The relationship between normal and tangential force components (grip force - GF and load force - LF, respectively) acting on the digits-object interface during object manipulation reveals neural mechanisms involved in movement control. Here, we examined whether the feedback type provided to the participants during exertion of LF would influence GF-LF coordination and task performance. Methods. Sixteen young (24.7 ±3.8 years-old) volunteers isometrically exerted continuously sinusoidal FZ (vertical component of LF) by pulling a fixed instrumented handle up and relaxing under two feedback conditions: targeting and tracking. In targeting condition, FZ exertion range was determined by horizontal lines representing the upper (10 N) and lower (1 N) targets, with frequency (0.77 or 1.53 Hz) dictated by a metronome. In tracking condition, a sinusoidal template set at similar frequencies and range was presented and should be superposed by the participants' exerted FZ. Task performance was assessed by absolute errors at peaks (AEPeak) and valleys (AEValley) and GF-LF coordination by GF-LF ratios, maximum cross-correlation coefficients (r max), and time lags. Results: The results revealed no effect of feedback and no feedback by frequency interaction on any variable. AE Peak and GF-LF ratio were higher and rmax lower at 1.53 Hz than at 0.77 Hz. Conclusion: These findings indicate that the type of feedback does not influence task performance and GF-LF coordination. Therefore, we recommend the use of tracking tasks when assessing GF-LF coordination during isometric LF exertion in externally fixed instrumented handles because they are easier to understand and provide additional indices (e.g., RMSE) of voluntary force control. © 2013 Pedão et al.; licensee BioMed Central Ltd.
Resumo:
Pós-graduação em Ciências da Motricidade - IBRC
The phonological and visual basis of developmental dyslexia in Brazilian Portuguese reading children
Resumo:
Evidence from opaque languages suggests that visual attention processing abilities in addition to phonological skills may act as cognitive underpinnings of developmental dyslexia. We explored the role of these two cognitive abilities on reading fluency in Brazilian Portuguese, a more transparent orthography than French or English. Sixty-six children with developmental dyslexia and normal Brazilian Portuguese children participated. They were administered three tasks of phonological skills (phoneme identification, phoneme, and syllable blending) and three visual tasks (a letter global report task and two non-verbal tasks of visual closure and visual constancy). Results show that Brazilian Portuguese children with developmental dyslexia are impaired not only in phonological processing but further in visual processing. The phonological and visual processing abilities significantly and independently contribute to reading fluency in the whole population. Last, different cognitively homogeneous subtypes can be identified in the Brazilian Portuguese population of children with developmental dyslexia. Two subsets of children with developmental dyslexia were identified as having a single cognitive disorder, phonological or visual; another group exhibited a double deficit and a few children showed no visual or phonological disorder. Thus the current findings extend previous data from more opaque orthographies as French and English, in showing the importance of investigating visual processing skills in addition to phonological skills in children with developmental dyslexia whatever their language orthography transparency.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Pós-graduação em Ciências Cartográficas - FCT
Resumo:
The effect produced by a warning stimulus(i) (WS) in reaction time (RT) tasks is commonly attributed to a facilitation of sensorimotor mechanisms by alertness. Recently, evidence was presented that this effect is also related to a proactive inhibition of motor control mechanisms. This inhibition would hinder responding to the WS instead of the target stimulus (TS). Some studies have shown that auditory WS produce a stronger facilitatory effect than visual WS. The present study investigated whether the former WS also produces a stronger inhibitory effect than the latter WS. In one session, the RTs to a visual target in two groups of volunteers were evaluated. In a second session, subjects reacted to the visual target both with (50% of the trials) and without (50% of the trials) a WS. During trials, when subjects received a WS, one group received a visual WS and the other group was presented with an auditory WS. In the first session, the mean RTs of the two groups did not differ significantly. In the second session, the mean RT of the two groups in the presence of the WS was shorter than in their absence. The mean RT in the absence of the auditory WS was significantly longer than the mean RT in the absence of the visual WS. Mean RTs did not differ significantly between the present conditions of the visual and auditory WS. The longer RTs of the auditory WS group as opposed to the visual WS group in the WS-absent trials suggest that auditory WS exert a stronger inhibitory influence on responsivity than visual WS.
Resumo:
This thesis deals with Visual Servoing and its strictly connected disciplines like projective geometry, image processing, robotics and non-linear control. More specifically the work addresses the problem to control a robotic manipulator through one of the largely used Visual Servoing techniques: the Image Based Visual Servoing (IBVS). In Image Based Visual Servoing the robot is driven by on-line performing a feedback control loop that is closed directly in the 2D space of the camera sensor. The work considers the case of a monocular system with the only camera mounted on the robot end effector (eye in hand configuration). Through IBVS the system can be positioned with respect to a 3D fixed target by minimizing the differences between its initial view and its goal view, corresponding respectively to the initial and the goal system configurations: the robot Cartesian Motion is thus generated only by means of visual informations. However, the execution of a positioning control task by IBVS is not straightforward because singularity problems may occur and local minima may be reached where the reached image is very close to the target one but the 3D positioning task is far from being fulfilled: this happens in particular for large camera displacements, when the the initial and the goal target views are noticeably different. To overcame singularity and local minima drawbacks, maintaining the good properties of IBVS robustness with respect to modeling and camera calibration errors, an opportune image path planning can be exploited. This work deals with the problem of generating opportune image plane trajectories for tracked points of the servoing control scheme (a trajectory is made of a path plus a time law). The generated image plane paths must be feasible i.e. they must be compliant with rigid body motion of the camera with respect to the object so as to avoid image jacobian singularities and local minima problems. In addition, the image planned trajectories must generate camera velocity screws which are smooth and within the allowed bounds of the robot. We will show that a scaled 3D motion planning algorithm can be devised in order to generate feasible image plane trajectories. Since the paths in the image are off-line generated it is also possible to tune the planning parameters so as to maintain the target inside the camera field of view even if, in some unfortunate cases, the feature target points would leave the camera images due to 3D robot motions. To test the validity of the proposed approach some both experiments and simulations results have been reported taking also into account the influence of noise in the path planning strategy. The experiments have been realized with a 6DOF anthropomorphic manipulator with a fire-wire camera installed on its end effector: the results demonstrate the good performances and the feasibility of the proposed approach.