215 resultados para Perceptual speech analysis
em University of Queensland eSpace - Australia
Resumo:
Perceptual voice analysis is a subjective process. However, despite reports of varying degrees of intrajudge and interjudge reliability, it is widely used in clinical voice evaluation. One of the ways to improve the reliability of this procedure is to provide judges with signals as external standards so that comparison can be made in relation to these anchor signals. The present study used a Klatt speech synthesizer to create a set of speech signals with varying degree of three different voice qualities based on a Cantonese sentence. The primary objective of the study was to determine whether different abnormal voice qualities could be synthesized using the built-in synthesis parameters using a perceptual study. The second objective was to determine the relationship between acoustic characteristics of the synthesized signals and perceptual judgment. Twenty Cantonese-speaking speech pathologists with at least three years of clinical experience in perceptual voice evaluation were asked to undertake two tasks. The first was to decide whether the voice quality of the synthesized signals was normal or not. The second was to decide whether the abnormal signals should be described as rough, breathy, or vocal fry. The results showed that signals generated with a small degree of aspiration noise were perceived as breathiness while signals with a small degree of flutter or double pulsing were perceived as roughness. When the flutter or double pulsing increased further, tremor and vocal fry, rather than roughness, were perceived. Furthermore, the amount of aspiration noise, flutter, or double pulsing required for male voice stimuli was different from that required for the female voice stimuli with a similar level of perceptual breathiness and roughness. These findings showed that changes in perceived vocal quality could be achieved by systematic modifications of synthesis parameters. This opens up the possibility of using synthesized voice signals as external standards or anchors to improve the reliability of clinical perceptual voice evaluation. (C) 2002 Acoustical Society of America.
Resumo:
The speech characteristics, oromotor function and speech intelligibility of a group of children treated for cerebellar tumour (CT) was investigated perceptually. Assessment of these areas was performed on 11 children treated for CT with dysarthric speech as well as 21 non-neurologically impaired controls matched for age and sex to obtain a comprehensive perceptual profile of their speech and oromotor mechanism. Contributing to the perception of dysarthria were a number of deviant speech dimensions including imprecision of consonants, hoarseness and decreased pitch variation, as well as a reduction in overall speech intelligibility for both sentences and connected speech. Oromotor assessment revealed deficits in lip, tongue and laryngeal function, particularly relating to deficits in timing and coordination of movements. The most salient features of the dysarthria seen in children treated for CT were the mild nature of the speech disorder and clustering of speech deficits in the prosodic, phonatory and articulatory aspects of speech production.
Resumo:
Objective: Laryngeal and tongue function was assessed in 28 patients to evaluate the presence, nature, and resolution of superior recurrent laryngeal and hypoglossal nerve damage resulting from standard open primary carotid endarterectomy (CEA). Methods. The laryngeal and tongue function in 28 patients who underwent CEA were examined prospectively with various physiologic (Aerophone II, laryngograph, tongue transducer), acoustic (Multi-Dimensional Voice Program), and perceptual speech assessments. Measures were obtained from all participants preoperatively, and at 2 weeks and at 3 months postoperatively. Results. The perceptual speech assessment indicated that the vocal quality of roughness was significantly more apparent at the 2-week postoperative assessment than preoperatively. However, by the 3-month postoperative assessment these values had returned to near preoperative levels, with no significant difference detected between preoperative and 3-month postoperative levels or between 2-week and 3-month postoperative levels. Both the instrumental assessments of laryngeal function and the acoustic assessment of vocal quality failed to identify any significant difference on any measure across the three assessment periods. Similarly, no significant impairment in tongue strength, endurance, or rate of repetitive tongue movements was detected at instrumental assessment of tongue function. Conclusions: No permanent changes to vocal or tongue function occurred in this group of participants after primary CEA. The lack of any significant long-term laryngeal or tongue dysfunction in this group suggests that the standard open CEA procedure is not associated with high rates of superior recurrent and hypoglossal nerve dysfunction, as previously believed.
Resumo:
The present study examined the effects of neurosurgical management of Parkinson's disease (PD), including the procedures of pallidotomy, thalamotomy, and deep-brain stimulation (DBS) on perceptual speech characteristics, speech,, intelligibility and oromotor function in a group of 22 participants with PD. The surgical participant group was compared with a group of 25 non-neurologically impaired individuals matched for age and sex. In addition, the study investigated 16 participants with PD who did not undergo neurosurgical management to control for disease progression. Results revealed that neurosurgical intervention did not significantly change the surgical participants' perceptual speech dimensions or oromotor function despite significant postoperative improvements in ratings of general motor function and disease severity. Reasons why neurosurgical intervention resulted in dissimilar outcomes with respect to participants' perceptual speech dimensions and general motor function are proposed.
Resumo:
The differences in spectral shape resolution abilities among cochlear implant ~CI! listeners, and between CI and normal-hearing ~NH! listeners, when listening with the same number of channels ~12!, was investigated. In addition, the effect of the number of channels on spectral shape resolution was examined. The stimuli were rippled noise signals with various ripple frequency-spacings. An adaptive 4IFC procedure was used to determine the threshold for resolvable ripple spacing, which was the spacing at which an interchange in peak and valley positions could be discriminated. The results showed poorer spectral shape resolution in CI compared to NH listeners ~average thresholds of approximately 3000 and 400 Hz, respectively!, and wide variability among CI listeners ~range of approximately 800 to 8000 Hz!. There was a significant relationship between spectral shape resolution and vowel recognition. The spectral shape resolution thresholds of NH listeners increased as the number of channels increased from 1 to 16, while the CI listeners showed a performance plateau at 4–6 channels, which is consistent with previous results using speech recognition measures. These results indicate that this test may provide a measure of CI performance which is time efficient and non-linguistic, and therefore, if verified, may provide a useful contribution to the prediction of speech perception in adults and children who use CIs.
Resumo:
The present study aimed to investigate how induced lingual fatigue affected lingual strength, articulatory kinematics, and perceptual speech features in CS, a 51-year-old female with active myasthenia gravis (MG), and three age and gender matched control participants, Lingual fatigue was elicited via a series of endurance tasks using a tongue pressure bulb. Following each endurance task, the participants performed a speech task containing the phonemes /k/, /t/, and /j/ that was recorded with an electromagnetic articulograph, followed by a lingual strength assessment using a tongue pressure bulb. Participants repeated this schedule over five phases and kinematic and strength changes during each phase were compared to baseline measurements. All of CSs significant kinematic changes occurred during the final fatigue phase compared to 27.3% of the control group's kinematic changes occurring during this phase, suggesting the kinematic changes associated with fatigue were not accelerated in CS. The endurance tasks also elicited different kinematic effects for CSs anterior and posterior tongue segments. While CS exhibited mostly similar kinematic and perceptual changes to the control group, some of CS's perceptual transcriptions for /k/ and kinematic changes were not replicated, indicating that some different perceptual and physiological consequences to CS's speech were elicited by the endurance tasks.
Resumo:
Compression amplification significantly alters the acoustic speech signal in comparison to linear amplification. The central hypothesis of the present study was that the compression settings of a two-channel aid that best preserved the acoustic properties of speech compared to linear amplification would yield the best perceptual results, and that the compression settings that most altered the acoustic properties of speech compared to linear would yield significantly poorer speech perception. On the basis of initial acoustic analysis of the test stimuli recorded through a hearing aid, two different compression amplification settings were chosen for the perceptual study. Participants were 74 adults with mild to moderate sensorineural hearing impairment. Overall, the speech perception results supported the hypothesis. A further aim of the study was to determine if variation in participants' speech perception with compression amplification (compared to linear amplification) could be explained by the individual characteristics of age, degree of loss, dynamic range, temporal resolution, and frequency selectivity; however, no significant relationships were found.
Resumo:
The aims of the present study were to compare the perceptual assessments of deviant speech signs (dysarthria) exhibited by Australian and Swedish speakers with multiple sclerosis (MS) and to explore whether judgements of dysarthria differed depending on whether the speakers and the judges spoke the same or different languages. Ten Australian and 10 Swedish individuals with MS (matched as closely as possible for age, gender, progression type and severity of dysarthria) were assessed by 2 Australian and 2 Swedish clinically experienced judges using a protocol including 33 speech parameters. Results show that the following perceptual dimensions were identified by both pairs of judges in both groups of speakers to a just noticeable or moderate degree: imprecise consonants, inappropriate pitch level, reduced general rate, and glottal fry. The reliability (Spearman rank-order correlation) of the consensus ratings from the Australian and the Swedish judges was high, with a mean rho of 85.7 for the Australian speakers and mean rho of 84.3 for the Swedish speakers. The most difficult perceptual parameters to assess (i.e. to agree on) included harshness, level of pitch and loudness, precision of consonants and general stress pattern. The study indicated that perceptual assessments of speech characteristics in individuals with MS are informative and can be achieved with high inter-judge reliability irrespective of the judge's knowledge of the speaker's language. Copyright (C) 2003 S. Karger AG, Basel.
Perceptual and instrumental analysis of laryngeal function after traumatic brain injury in childhood
Resumo:
Objective: To investigate laryngeal function and phonatory disturbance in children with traumatic brain injury (TBI), using both perceptual and instrumental techniques. Design and participants: The performance of 16 individuals with moderate to severe TBI acquired in childhood and 16 nonneurologicatly impaired control subjects was compared on a battery of perceptual (Frenchay Dysarthria Assessment, speech sample analysis) and instrumental (Aerophone II, laryngograph) assessments. Results and conclusions: As a group, the children with TBI demonstrated normal, or only minimally impaired laryngeal function, when compared with the control group, which contrasts with the significant laryngeal impairment noted in adults after TBI. Several reasons for the different findings in relation to laryngeal function in adults and children after TBI are postulated: (1) differing types of injury usually incurred by adults and children may result in a relatively decreased degree of neurologic impairment in these children, (2) differences in recovery potential between adults and children, and (3) the pediatric larynx is still developing, hence it may be better able to compensate for any impairment incurred.
Resumo:
Primary objective: To investigate the articulatory function of a group of children with traumatic brain injury (TBI), using both perceptual and instrumental techniques. Research design: The performance of 24 children with TBI was assessed on a battery of perceptual (Frenchay Dysarthria Assessment, Assessment of Intelligibility of Dysarthric Speech and speech sample analysis) and instrumental ( lip and tongue pressure transduction systems) assessments and compared with that of 24 non-neurologically impaired children matched for age and sex. Main outcomes: Perceptual assessment identified consonant and vowel imprecision, increased length of phonemes and overall reduction in speech intelligibility, while instrumental assessment revealed significant impairment in lip and tongue function in the TBI group, with rate and pressure in repetitive lip and tongue tasks particularly impaired. Significant negative correlations were identified between the degree of deviance of perceptual articulatory features and decreased function on many non-speech measures of lip function, as well as maximum tongue pressure and fine force tongue control at 20% of maximum tongue pressure. Additionally, sub-clinical articulatory deficits were identified in the children with TBI who were non-dysarthric. Conclusion: The results of the instrumental assessment of lip and tongue function support the finding of substantial articulatory dysfunction in this group of children following TBI. Hence, remediation of articulatory function should be a therapeutic priority in these children.
Resumo:
This study describes a preliminary examination of the viability and suitability of the physiologic technique electromagnetic articulography (EMA) in investigating lingual fatigue in myasthenia gravis (MG). A 52.9-year-old female diagnosed with MG at the age of 18 years, but who was in remission, participated in the study with a matched control subject. Changes in the duration, speed, and range of tongue-tip and tongue-back movements during repetition of /taka/ over two minutes were investigated. Results revealed that the MG subject did not exhibit significant changes in duration, maximum velocity, maximum acceleration, or the distance travelled by her tongue as measured by EMA over the task. The kinematic results were, in part, expected since the MG subject was in remission. The results, therefore, may not be representative of the majority of individuals with active MG. The examination of the current case did highlight, however, the potential advantages of EMA in providing detailed, objective information regarding lingual kinematics for future investigations of individuals with MG. It also showed that EMA may be sensitive in detecting subclinical kinematic features of fatigue in individuals who are in remission from MG. Finally, EMA led to the identification of possible physiologic factors underlying the CV transform effect, which was evident for the MG subject's syllable productions. In the past, the effect had been assumed to be a purely perceptual-based phenomenon.
Resumo:
The present paper reviews the findings of 30 years of verbal/manual dual task studies, the method most commonly used to assess lateralization of speech production in non-clinical samples. Meta-analysis of 64 results revealed that both the type of manual task used and the nature of practice that is given influence the size of the laterality effect. A meta-analysis of 36 results examining the effect size of sex differences in estimate,, of lateralization of speech production indicated that males appear to show, slightly larger laterality effects than females. (C) 2002 Elsevier Science Ltd. All rights reserved.