944 resultados para Visual Speech Recognition, Multiple Views, Frontal View, Profile View


Relevância:

40.00% 40.00%

Publicador:

Resumo:

Purpose Optical blur and ageing are known to affect driving performance but their effects on drivers' eye movements are poorly understood. This study examined the effects of optical blur and age on eye movement patterns and performance on the DriveSafe slide recognition test which is purported to predict fitness to drive. Methods Twenty young (27.1 ± 4.6 years) and 20 older (73.3 ± 5.7 years) visually normal drivers performed the DriveSafe under two visual conditions: best-corrected vision and with +2.00 DS blur. The DriveSafe is a Visual Recognition Slide Test that consists of brief presentations of static, real-world driving scenes containing different road users (pedestrians, bicycles and vehicles). Participants reported the types, relative positions and direction of travel of the road users in each image; the score was the number of correctly reported items (maximum score of 128). Eye movements were recorded while participants performed the DriveSafe test using a Tobii TX300 eye tracking system. Results There was a significant main effect of blur on DriveSafe scores (best-corrected: 114.9 vs blur: 93.2; p < 0.001). There was also a significant age and blur interaction on the DriveSafe scores (p < 0.001) such that the young drivers were more negatively affected by blur than the older drivers (reductions of 22% and 13% respectively; p < 0.001): with best-corrected vision, the young drivers performed better than the older drivers (DriveSafe scores: 118.4 vs 111.5; p = 0.001), while with blur, the young drivers performed worse than the older drivers (88.6 vs 95.9; p = 0.009). For the eye movement patterns, blur significantly reduced the number of fixations on road users (best-corrected: 5.1 vs blur: 4.5; p < 0.001), fixation duration on road users (2.0 s vs 1.8 s; p < 0.001) and saccade amplitudes (7.4° vs 6.7°; p < 0.001). A main effect of age on eye movements was also found where older drivers made smaller saccades than the young drivers (6.7° vs 7.4°; p < 0.001). Conclusions Blur reduced DriveSafe scores for both age groups and this effect was greater for the young drivers. The decrease in number of fixations and fixation duration on road users, as well as the reduction in saccade amplitudes under the blurred condition, highlight the difficulty experienced in performing the task in the presence of optical blur, which suggests that uncorrected refractive errors may have a detrimental impact on aspects of driving performance.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Purpose Little is known about the prevalence of refractive error, binocular vision, and other visual conditions in Australian Indigenous children. This is important given the association of these visual conditions with reduced reading performance in the wider population, which may also contribute to the suboptimal reading performance reported in this population. The aim of this study was to develop a visual profile of Queensland Indigenous children. Methods Vision testing was performed on 595 primary schoolchildren in Queensland, Australia. Vision parameters measured included visual acuity, refractive error, color vision, nearpoint of convergence, horizontal heterophoria, fusional vergence range, accommodative facility, AC/A ratio, visual motor integration, and rapid automatized naming. Near heterophoria, nearpoint of convergence, and near fusional vergence range were used to classify convergence insufficiency (CI). Results Although refractive error (Indigenous, 10%; non-Indigenous, 16%; p = 0.04) and strabismus (Indigenous, 0%; non-Indigenous, 3%; p = 0.03) were significantly less common in Indigenous children, CI was twice as prevalent (Indigenous, 10%; non-Indigenous, 5%; p = 0.04). Reduced visual information processing skills were more common in Indigenous children (reduced visual motor integration [Indigenous, 28%; non-Indigenous, 16%; p < 0.01] and slower rapid automatized naming [Indigenous, 67%; non-Indigenous, 59%; p = 0.04]). The prevalence of visual impairment (reduced visual acuity) and color vision deficiency was similar between groups. Conclusions Indigenous children have less refractive error and strabismus than their non-Indigenous peers. However, CI and reduced visual information processing skills were more common in this group. Given that vision screenings primarily target visual acuity assessment and strabismus detection, this is an important finding as many Indigenous children with CI and reduced visual information processing may be missed. Emphasis should be placed on identifying children with CI and reduced visual information processing given the potential effect of these conditions on school performance

Relevância:

40.00% 40.00%

Publicador:

Resumo:

This paper describes a vision-only system for place recognition in environments that are tra- versed at different times of day, when chang- ing conditions drastically affect visual appear- ance, and at different speeds, where places aren’t visited at a consistent linear rate. The ma- jor contribution is the removal of wheel-based odometry from the previously presented algo- rithm (SMART), allowing the technique to op- erate on any camera-based device; in our case a mobile phone. While we show that the di- rect application of visual odometry to our night- time datasets does not achieve a level of perfor- mance typically needed, the VO requirements of SMART are orthogonal to typical usage: firstly only the magnitude of the velocity is required, and secondly the calculated velocity signal only needs to be repeatable in any one part of the environment over day and night cycles, but not necessarily globally consistent. Our results show that the smoothing effect of motion constraints is highly beneficial for achieving a locally consis- tent, lighting-independent velocity estimate. We also show that the advantage of our patch-based technique used previously for frame recogni- tion, surprisingly, does not transfer to VO, where SIFT demonstrates equally good performance. Nevertheless, we present the SMART system us- ing only vision, which performs sequence-base place recognition in extreme low-light condi- tions where standard 6-DOF VO fails and that improves place recognition performance over odometry-less benchmarks, approaching that of wheel odometry.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

In this paper we propose a hypothetical scheme for recognizing the alphanumerics. The scheme is based on the known physiological structure of the visual cortex and the concept of a short Lino extractor nouron (SLEN). We assumo four basic typca of such units for extracting vertical, horizontal, right and left inclined straight line segments. The patterns reconstructed from the scheme show perfect agreement with the test patterns. The model indicates that the recognition of letters T and H requires extraction of the largest number of features.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The prevalence and the causes of childhood visual impairment in Finland during the 1970s and the 1980s were investigated, with special attention to risk factors and further prevention of visual impairment in children. The primary data on children with visual impairment were obtained from the Finnish Register of Visual Impairment, one of the patient registers kept up by the National Research and Development Centre for Welfare and Health (Stakes). The data were supplemented from other registers in Stakes and from patient records of the children in Finnish central hospitals. Visual impairment had been registered in 556 children from a population of 1,138,326 children between ages 0-17, born from 1972 through 1989. The age-specific prevalence of registered visual impairment was 49/100,000 in total. Of them, 23/100,000 were blind children and 11/100,000 were children born prematurely. Boys were impaired more often and more severely than girls. Congenital malformations (52%), systemic diseases (48%), and multiple impairments (50%) were common. The main ophthalmic groups of visual impairment were retinal diseases (35%), ocular malformations (29%), and neuro-ophthalmological disorders (29%). Optic nerve atrophy was the most common diagnosis of visual impairment (22%), followed by congenital cataract (11%), retinopathy of prematurity (10%), and cerebral visual impairment (8%). Genetic factors (42%) were the most common etiologies of visual impairment, followed by prenatal (30%) and perinatal (21%) factors. The highest rates of blindness were seen in cerebral visual impairment (83%) and retinopathy of prematurity (82%). Retinopathy of prematurity had developed in the children born at a gestational age of 32 weeks or earlier. Significant risks for visual impairment were found in the association with preterm births, prenatal infections, birth asphyxia, neonatal respiratory difficulties, mechanical ventilation lasting over two weeks, and hyperbilirubinemia. A rise in blind and multi-impaired children was seen during the study period, associating with increases in the survival of preterm infants with extremely low birth weight. The incidence of visual impairment in children born prematurely was seven times higher than in children born at full term. A reliable profile of childhood visual impairment was obtained. The importance of highly qualified antenatal, neonatal, and ophthalmological care was clearly proved. The risks associated with pre- and perinatal disorders during pregnancy must be emphasized, e.g. the risks associated with maternal infections and the use of tobacco, alcohol, and drugs during pregnancy. Obvious needs for gene therapies and other new treatments for hereditary diseases were also proved.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

This work proposes a boosting-based transfer learning approach for head-pose classification from multiple, low-resolution views. Head-pose classification performance is adversely affected when the source (training) and target (test) data arise from different distributions (due to change in face appearance, lighting, etc). Under such conditions, we employ Xferboost, a Logitboost-based transfer learning framework that integrates knowledge from a few labeled target samples with the source model to effectively minimize misclassifications on the target data. Experiments confirm that the Xferboost framework can improve classification performance by up to 6%, when knowledge is transferred between the CLEAR and FBK four-view headpose datasets.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Peripherally triarylborane decorated porphyrin (2) and its Zn(II) complex (3) have been synthesized. Compound 3 contains of two different Lewis acidic binding sites (Zn(II) and boron center). Unlike all previously known triarylborane based sensors, the optical responses of 3 toward fluoride and cyanide are distinctively different, thus enabling the discrimination of these two interfering anions. Metalloporphyrin 3 shows a multiple channel fluorogenic response toward fluoride and cyanide and also a selective visual colorimetric response toward cyanide. By comparison with model systems and from detailed photophysical studies on 2 and 3, we conclude that the preferential binding of fluoride occurs at the peripheral borane moieties resulting in the cessation of the EET (electronic energy transfer) process from borane to porphyrin core and with negligible negetive cooperative effects. On the other hand, cyanide binding occurs at the Zn(II) core leading to drastic changes in its absorption behavior which can be followed by the naked eye. Such changes are not observed when the boryl substituent is absent (e.g., Zn-TPP and TPP). Compounds 2 and 3 were also found to be capable of extracting fluoride from aqueous medium.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Plants emit volatile organic compounds (VOCs) from most parts of their anatomy. Conventionally, the volatiles of leaves, flowers, fruits and seeds have been investigated separately. This review presents an integrated perspective of volatiles produced by fruits and seeds in the context of selection on the whole plant. It suggests that fruit and seed volatiles may only be understood in the light of the chemistry of the whole plant. Fleshy fruit may be viewed as an ecological arena within which several evolutionary games are being played involving fruit VOCs. Fruit odour and colour may be correlated and interact via multimodal signalling in influencing visits by frugivores. The hypothesis of volatile crypsis in the evolution of hard seeds as protection against volatile diffusion and perception by seed predators is reviewed. Current views on the role of volatiles in ant dispersal of seeds or myrmecochory are summarised, especially the suggestion that ants are being manipulated by plants in the form of a sensory trap while providing this service. Plant VOC production is presented as an emergent phenotype that could result from multiple selection pressures acting on various plant parts; the ``plant'' phenotype and VOC profile may receive significant contributions from symbionts within the plant. Viewing the plant as a holobiont would benefit an understanding of the emergent plant phenotype.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

With the increasing availability of wearable cameras, research on first-person view videos (egocentric videos) has received much attention recently. While some effort has been devoted to collecting various egocentric video datasets, there has not been a focused effort in assembling one that could capture the diversity and complexity of activities related to life-logging, which is expected to be an important application for egocentric videos. In this work, we first conduct a comprehensive survey of existing egocentric video datasets. We observe that existing datasets do not emphasize activities relevant to the life-logging scenario. We build an egocentric video dataset dubbed LENA (Life-logging EgoceNtric Activities) (http://people.sutd.edu.sg/similar to 1000892/dataset) which includes egocentric videos of 13 fine-grained activity categories, recorded under diverse situations and environments using the Google Glass. Activities in LENA can also be grouped into 5 top-level categories to meet various needs and multiple demands for activities analysis research. We evaluate state-of-the-art activity recognition using LENA in detail and also analyze the performance of popular descriptors in egocentric activity recognition.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

In optical character recognition of very old books, the recognition accuracy drops mainly due to the merging or breaking of characters. In this paper, we propose the first algorithm to segment merged Kannada characters by using a hypothesis to select the positions to be cut. This method searches for the best possible positions to segment, by taking into account the support vector machine classifier's recognition score and the validity of the aspect ratio (width to height ratio) of the segments between every pair of cut positions. The hypothesis to select the cut position is based on the fact that a concave surface exists above and below the touching portion. These concave surfaces are noted down by tracing the valleys in the top contour of the image and similarly doing it for the image rotated upside-down. The cut positions are then derived as closely matching valleys of the original and the rotated images. Our proposed segmentation algorithm works well for different font styles, shapes and sizes better than the existing vertical projection profile based segmentation. The proposed algorithm has been tested on 1125 different word images, each containing multiple merged characters, from an old Kannada book and 89.6% correct segmentation is achieved and the character recognition accuracy of merged words is 91.2%. A few points of merge are still missed due to the absence of a matched valley due to the specific shapes of the particular characters meeting at the merges.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

This paper presents a novel coarse-to-fine global localization approach that is inspired by object recognition and text retrieval techniques. Harris-Laplace interest points characterized by SIFT descriptors are used as natural land-marks. These descriptors are indexed into two databases: an inverted index and a location database. The inverted index is built based on a visual vocabulary learned from the feature descriptors. In the location database, each location is directly represented by a set of scale invariant descriptors. The localization process consists of two stages: coarse localization and fine localization. Coarse localization from the inverted index is fast but not accurate enough; whereas localization from the location database using voting algorithm is relatively slow but more accurate. The combination of coarse and fine stages makes fast and reliable localization possible. In addition, if necessary, the localization result can be verified by epipolar geometry between the representative view in database and the view to be localized. Experimental results show that our approach is efficient and reliable. ©2005 IEEE.