14 resultados para Visual Speech Recognition, Multiple Views, Frontal View, Profile View

em University of Queensland eSpace - Australia


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The McGurk effect, in which auditory [ba] dubbed onto [go] lip movements is perceived as da or tha, was employed in a real-time task to investigate auditory-visual speech perception in prelingual infants. Experiments 1A and 1B established the validity of real-time dubbing for producing the effect. In Experiment 2, 4(1)/(2)-month-olds were tested in a habituation-test paradigm, in which 2 an auditory-visual stimulus was presented contingent upon visual fixation of a live face. The experimental group was habituated to a McGurk stimulus (auditory [ba] visual [ga]), and the control group to matching auditory-visual [ba]. Each group was then presented with three auditory-only test trials, [ba], [da], and [deltaa] (as in then). Visual-fixation durations in test trials showed that the experimental group treated the emergent percept in the McGurk effect, [da] or [deltaa], as familiar (even though they had not heard these sounds previously) and [ba] as novel. For control group infants [da] and [deltaa] were no more familiar than [ba]. These results are consistent with infants'perception of the McGurk effect, and support the conclusion that prelinguistic infants integrate auditory and visual speech information. (C) 2004 Wiley Periodicals, Inc.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Children with autistic spectrum disorder (ASD) may have poor audio-visual integration, possibly reflecting dysfunctional 'mirror neuron' systems which have been hypothesised to be at the core of the condition. In the present study, a computer program, utilizing speech synthesizer software and a 'virtual' head (Baldi), delivered speech stimuli for identification in auditory, visual or bimodal conditions. Children with ASD were poorer than controls at recognizing stimuli in the unimodal conditions, but once performance on this measure was controlled for, no group difference was found in the bimodal condition. A group of participants with ASD were also trained to develop their speech-reading ability. Training improved visual accuracy and this also improved the children's ability to utilize visual information in their processing of speech. Overall results were compared to predictions from mathematical models based on integration and non-integration, and were most consistent with the integration model. We conclude that, whilst they are less accurate in recognizing stimuli in the unimodal condition, children with ASD show normal integration of visual and auditory speech stimuli. Given that training in recognition of visual speech was effective, children with ASD may benefit from multi-modal approaches in imitative therapy and language training. (C) 2004 Elsevier Ltd. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper presents a corpus-based descriptive analysis of the most prevalent transfer effects and connected speech processes observed in a comparison of 11 Vietnamese English speakers (6 females, 5 males) and 12 Australian English speakers (6 males, 6 females) over 24 grammatical paraphrase items. The phonetic processes are segmentally labelled in terms of IPA diacritic features using the EMU speech database system with the aim of labelling departures from native-speaker pronunciation. An analysis of prosodic features was made using ToBI framework. The results show many phonetic and prosodic processes which make non-native speakers’ speech distinct from native ones. The corpusbased methodology of analysing foreign accent may have implications for the evaluation of non-native accent, accented speech recognition and computer assisted pronunciation- learning.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

One hundred and twelve university students completed 7 tests assessing word-reading accuracy, print exposure, phonological sensitivity, phonological coding and knowledge of English morphology as predictors of spelling accuracy. Together the tests accounted for 71% of the variance in spelling, with phonological skills and morphological knowledge emerging as strong predictors of spelling accuracy for words with both regular and irregular sound-spelling correspondences. The pattern of relationships was consistent with a model in which, as a function of the learning opportunities that are provided by reading experience, phonological skills promote the learning of individual word orthographies and structural relationships among words.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

These are the full proceedings of the conference.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

The aims of the present study were to compare the perceptual assessments of deviant speech signs (dysarthria) exhibited by Australian and Swedish speakers with multiple sclerosis (MS) and to explore whether judgements of dysarthria differed depending on whether the speakers and the judges spoke the same or different languages. Ten Australian and 10 Swedish individuals with MS (matched as closely as possible for age, gender, progression type and severity of dysarthria) were assessed by 2 Australian and 2 Swedish clinically experienced judges using a protocol including 33 speech parameters. Results show that the following perceptual dimensions were identified by both pairs of judges in both groups of speakers to a just noticeable or moderate degree: imprecise consonants, inappropriate pitch level, reduced general rate, and glottal fry. The reliability (Spearman rank-order correlation) of the consensus ratings from the Australian and the Swedish judges was high, with a mean rho of 85.7 for the Australian speakers and mean rho of 84.3 for the Swedish speakers. The most difficult perceptual parameters to assess (i.e. to agree on) included harshness, level of pitch and loudness, precision of consonants and general stress pattern. The study indicated that perceptual assessments of speech characteristics in individuals with MS are informative and can be achieved with high inter-judge reliability irrespective of the judge's knowledge of the speaker's language. Copyright (C) 2003 S. Karger AG, Basel.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

Background: The purpose of the present study was to describe a profile of Australian paediatric occupational therapy practice in terms of theories, assessments and interventions used with the most frequently seen client groups. Methods: An ex post facto survey design was utilised. A purpose-designed survey was mailed to 600 occupational therapists identified by OT Australia as working in paediatrics. Results: The response rate was 55% (n = 330). Respondents in the sample worked chiefly with children with developmental delays, learning disabilities, neurological impairments, and infants/toddlers. Theoretical models used by paediatric clinicians that were common to the most frequently seen client groups focused on sensory integration/multisensory approaches, occupational performance, and client-centred practice. Assessment tools most frequently used were the Test of Visual Motor Integration, Sensory Profile, Bruininks-Oseretsky Test of Motor Proficiency, Handwriting Speed Test, and Motor-Free Visual Perception Test. The most often used treatment methods across the four most frequently seen client groups were parent/caregiver education, sensory integration/stimulation techniques, and managing activities of daily living. Conclusions: Paediatric occupational therapists appeared to draw on a range of theoretical models. With the exception of the Sensory Profile, the assessment and treatment methods most frequently used are not congruent with the most commonly used theoretical models. It is critical that the assessment and treatment methods used are conceptually consistent with the theoretical models that guide practice. Occupational therapists need to examine the evidence and determine whether their clinical practice is grounded in the best contemporary theoretical models, assessments and interventions.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

PURPOSE. The driving environment is becoming increasingly complex, including both visual and auditory distractions within the in- vehicle and external driving environments. This study was designed to investigate the effect of visual and auditory distractions on a performance measure that has been shown to be related to driving safety, the useful field of view. METHODS. A laboratory study recorded the useful field of view in 28 young visually normal adults (mean 22.6 +/- 2.2 years). The useful field of view was measured in the presence and absence of visual distracters (of the same angular subtense as the target) and with three levels of auditory distraction (none, listening only, listening and responding). RESULTS. Central errors increased significantly (P < 0.05) in the presence of auditory but not visual distracters, while peripheral errors increased in the presence of both visual and auditory distracters. Peripheral errors increased with eccentricity and were greatest in the inferior region in the presence of distracters. CONCLUSIONS. Visual and auditory distracters reduce the extent of the useful field of view, and these effects are exacerbated in inferior and peripheral locations. This result has significant ramifications for road safety in an increasingly complex in-vehicle and driving environment.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

Probabilistic robotics most often applied to the problem of simultaneous localisation and mapping (SLAM), requires measures of uncertainty to accompany observations of the environment. This paper describes how uncertainty can be characterised for a vision system that locates coloured landmarks in a typical laboratory environment. The paper describes a model of the uncertainty in segmentation, the internal cameral model and the mounting of the camera on the robot. It explains the implementation of the system on a laboratory robot, and provides experimental results that show the coherence of the uncertainty model.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

Recovering position from sensor information is an important problem in mobile robotics, known as localisation. Localisation requires a map or some other description of the environment to provide the robot with a context to interpret sensor data. The mobile robot system under discussion is using an artificial neural representation of position. Building a geometrical map of the environment with a single camera and artificial neural networks is difficult. Instead it would be simpler to learn position as a function of the visual input. Usually when learning images, an intermediate representation is employed. An appropriate starting point for biologically plausible image representation is the complex cells of the visual cortex, which have invariance properties that appear useful for localisation. The effectiveness for localisation of two different complex cell models are evaluated. Finally the ability of a simple neural network with single shot learning to recognise these representations and localise a robot is examined.