173 resultados para Intelligibility
Resumo:
An important aspect of speech perception is the ability to group or select formants using cues in the acoustic source characteristics-for example, fundamental frequency (F0) differences between formants promote their segregation. This study explored the role of more radical differences in source characteristics. Three-formant (F1+F2+F3) synthetic speech analogues were derived from natural sentences. In Experiment 1, F1+F3 were generated by passing a harmonic glottal source (F0 = 140 Hz) through second-order resonators (H1+H3); in Experiment 2, F1+F3 were tonal (sine-wave) analogues (T1+T3). F2 could take either form (H2 or T2). In some conditions, the target formants were presented alone, either monaurally or dichotically (left ear = F1+F3; right ear = F2). In others, they were accompanied by a competitor for F2 (F1+F2C+F3; F2), which listeners must reject to optimize recognition. Competitors (H2C or T2C) were created using the time-reversed frequency and amplitude contours of F2. Dichotic presentation of F2 and F2C ensured that the impact of the competitor arose primarily through informational masking. In the absence of F2C, the effect of a source mismatch between F1+F3 and F2 was relatively modest. When F2C was present, intelligibility was lowest when F2 was tonal and F2C was harmonic, irrespective of which type matched F1+F3. This finding suggests that source type and context, rather than similarity, govern the phonetic contribution of a formant. It is proposed that wideband harmonic analogues are more effective informational maskers than narrowband tonal analogues, and so become dominant in across-frequency integration of phonetic information when placed in competition.
Resumo:
The role of source properties in across-formant integration was explored using three-formant (F1+F2+F3) analogues of natural sentences (targets). In experiment 1, F1+F3 were harmonic analogues (H1+H3) generated using a monotonous buzz source and second-order resonators; in experiment 2, F1+F3 were tonal analogues (T1+T3). F2 could take either form (H2 or T2). Target formants were always presented monaurally; the receiving ear was assigned randomly on each trial. In some conditions, only the target was present; in others, a competitor for F2 (F2C) was presented contralaterally. Buzz-excited or tonal competitors were created using the time-reversed frequency and amplitude contours of F2. Listeners must reject F2C to optimize keyword recognition. Whether or not a competitor was present, there was no effect of source mismatch between F1+F3 and F2. The impact of adding F2C was modest when it was tonal but large when it was harmonic, irrespective of whether F2C matched F1+F3. This pattern was maintained when harmonic and tonal counterparts were loudness-matched (experiment 3). Source type and competition, rather than acoustic similarity, governed the phonetic contribution of a formant. Contrary to earlier research using dichotic targets, requiring across-ear integration to optimize intelligibility, H2C was an equally effective informational masker for H2 as for T2.
Resumo:
OBJETIVOS: Descrever as características de fala de indivíduos submetidos à palatoplastia primária; relacioná-las com tipo de fissura, técnica cirúrgica e idade na ocasião da cirurgia; e descrever as condutas fonoaudiológicas após a cirurgia. MÉTODOS: Estudo retrospectivo de 167 casos, de ambos os gêneros, com fissura labiopalatina, submetidos à palatoplastia primária. Foram coletadas informações relativas ao tipo de fissura, idade na palatoplastia, técnica cirúrgica, e as análises subjetivas sobre as características da fala, realizadas por fonoaudiólogas. RESULTADOS: Na avaliação perceptiva da fala após a cirurgia, encontrou-se inteligibilidade de fala alterada (46%), ressonância hipernasal (33%), articulações compensatórias (26%), emissão de ar nasal (14%), mímica facial (11%) e fraca pressão aérea intra-oral (8%). Na associação entre a ressonância e as articulações compensatórias com tipo de fissura, técnica cirúrgica e faixa etária, não houve diferença significativa. A conduta mais frequentemente tomada foi a de terapia fonoaudiológica (38%), para correção das articulações compensatórias e/ou outras alterações. CONCLUSÃO: A maioria dos indivíduos apresentou ressonância equilibrada ou hipernasalidade aceitável e ausência de articulações compensatória, independente do tipo de fissura, da técnica cirúrgica e da faixa etária, embora não tenha ocorrido diferença significativa. Dentre as condutas adotadas após a primeira avaliação pós-palatoplastia primária, a terapia fonoaudiológica foi a mais frequente.
Resumo:
The purpose of the present study was to examine the benefits of providing audible speech to listeners with sensorineural hearing loss when the speech is presented in a background noise. Previous studies have shown that when listeners have a severe hearing loss in the higher frequencies, providing audible speech (in a quiet background) to these higher frequencies usually results in no improvement in speech recognition. In the present experiments, speech was presented in a background of multitalker babble to listeners with various severities of hearing loss. The signal was low-pass filtered at numerous cutoff frequencies and speech recognition was measured as additional high-frequency speech information was provided to the hearing-impaired listeners. It was found in all cases, regardless of hearing loss or frequency range, that providing audible speech resulted in an increase in recognition score. The change in recognition as the cutoff frequency was increased, along with the amount of audible speech information in each condition (articulation index), was used to calculate the "efficiency" of providing audible speech. Efficiencies were positive for all degrees of hearing loss. However, the gains in recognition were small, and the maximum score obtained by an listener was low, due to the noise background. An analysis of error patterns showed that due to the limited speech audibility in a noise background, even severely impaired listeners used additional speech audibility in the high frequencies to improve their perception of the "easier" features of speech including voicing
Resumo:
Treatment case studies of three children whose speech was characterized by non-developmental errors are described. Three therapy methods were trialed with each child: phonological contrast; core vocabulary and PROMPT. The accuracy and intelligibility of the children's connected speech improved throughout: the course of the programme. Intervention that focused on teaching a rule about the contrastive use of phonemes was most successful for a child who consistently made non-developmental errors. Children making inconsistent errors received most benefit from the core vocabulary approach that markedly enhanced consistency of production. However, once consistency was established, one child benefited from phonological contrast therapy. While the results of the study should be interpreted with caution due to the small sample size and the cumulative effects of intervention, the findings suggest that different parts of a child's phonological and phonetic system may respond to various types of treatment approaches that target different aspects of speech production. The implication drawn is that just as no single treatment approach is appropriate for all children with disordered phonology, management of some children may involve selecting and sequencing a range of different approaches.
Resumo:
Objective: To assess, in patients undergoing glossectomy, the influence of the palatal augmentation prosthesis on the speech intelligibility and acoustic spectrographic characteristics of the formants of oral vowels in Brazilian Portuguese, specifically the first 3 formants (F1 [/a,e,u/], F2 [/o,o,u/], and F3 [/a,o/]). Design: Speech evaluation with and without a palatal augmentation prosthesis using blinded randomized listener judgments. Setting: Tertiary referral center. Patients: Thirty-six patients (33 men and 3 women) aged 30 to 80 (mean [SD], 53.9 [10.5]) years underwent glossectomy (14, total glossectomy; 12, total glossectomy and partial mandibulectomy; 6, hemiglossectomy; and 4, subtotal glossectomy) with use of the augmentation prosthesis for at least 3 months before inclusion in the study. Main Outcome Measures: Spontaneous speech intel-ligibility (assessed by expert listeners using a 4-category scale) and spectrographic formants assessment. Results: We found a statistically significant improvement of spontaneous speech intelligibility and the average number of correctly identified syllables with the use of the prosthesis (P < .05). Statistically significant differences occurred for the F1 values of the vowels /a,e,u/; for F2 values, there was a significant difference of the vowels /o,o,u/; and for F3 values, there was a significant difference of the vowels la,61 (P < .001). Conclusions: The palatal augmentation prosthesis improved the intelligibility of spontaneous speech and syllables for patients who underwent glossectomy. It also increased the F2 and F3 values for all vowels and the F I values for the vowels /o,o,u/. This effect brought the values of many vowel formants closer to normal.
Resumo:
Dysfunction of the articulatory subsystem (i.c.. the lips, tongue, and jaw) has bccn identified as a major contributor to the reduction in speech intelligibility experienced by a high proportion of people with multiple sclerosis (MS). In particular. consonant imprecision has been reported to be the articulatory deficit that contributes most to variations in overall intelligibility of MS speakers. Electropalatography(EPG) IS an instrurncntal technique that visually documents the location and timing of tongue-topalatc contacts during speech. Although such a technique would be valuablc in objectively assessing the articulatory disturbances exhibited by individuals with dysarthria ia motor speech disorder) associated with MS, to-date no such study ha< been reported. The aim of the present study was to use EPG to assess tongue-to-palate contact patterns and articulatory timing in patients with dysarthria associated with MS. A dysarthric participant with a diagnosis of definite MS was fitted with an acrylic EPG palate (Reading EPG.?) and asked to read aloud a list of single syllable words which contained lingual consonants in the word-initial position and in consonant clusters. Each mord was repeated five times. The EPG palate was specifically moulded to tit the participant's hard palate and contained 62 electrodes that detected the tongue contacts. A non-neurologically impaired participant matched for age and sex servcd as a control. The results of the study revealed that the tongue-to-palate contacts produced by the participant with MS varied from those produced by the control in a number of ways in regard to spatial configurations and timing characteristics exhibited. The rcsults arc discussed in relation to the neuropathophysiological effects of MS on speech production. The potcntial use of EPG in programs for treating speech disorders associated with MS will be highlightcd.
Resumo:
Articulatory patterns and nasal resonance were assessed before and 6 months after orthognathic reconstruction surgery in five patients with dentofacial deformities. Perceptual and physiological assessments showed disorders of nasality and articulatory function preoperatively, two patients being hyponasal, and one hypernasal. Four patients had mild articulatory deficits, and four had reduced maximal lip or tongue pressures. Operation resulted in different patterns of change. Nasality deteriorated in three patients and articulatory precision and intelligibility improved in only one patient and showed no change in the other four. Operation improved interlabial pressures in three patients, while its impact on tongue pressures varied, being improved in one case, deteriorating in one, and remaining unchanged in the other three. The variability in the results highlights the need for routine assessment of speech and resonance before and after orthognathic reconstruction. (C) 2002 The British Association of Oral and Maxillofacial Surgeons. Published by Elsevier Science Ltd. All rights reserved.
Resumo:
Este trabalho analisa a protagonista do filme “Io sono l’amore”a partir do imaginário ocidental que impeliu a identidade das mulheres para o amor. Porno- tropic russo a ser inseminado pela civilização italiana, Emma será a propriedade erotizada que encontra no adultério a possibilidade de transgredir contra a identidadeque sustenta. Porém, se a infidelidade ameaça a ordem para qual as mulheres servem de base, elaainda colaboracom a inteligibilidade que liga as mulheres ao imaginário amoroso dito livre.
Resumo:
Relatório de Estágio Apresentado ao Instituto de Contabilidade e Administração do Porto para a obtenção do grau de Mestre em Empreendedorismo e Internacionalização, sob orientação da Mestre Inês Veiga Pereira
Resumo:
Surveillance registers monitor the prevalence of cerebral palsy and the severity of resulting impairments across time and place. The motor disorders of cerebral palsy can affect children’s speech production and limit their intelligibility. We describe the development of a scale to classify children’s speech performance for use in cerebral palsy surveillance registers, and its reliability across raters and across time. Speech and language therapists, other healthcare professionals and parents classified the speech of 139 children with cerebral palsy (85 boys, 54 girls; mean age 6.03 years, SD 1.09) from observation and previous knowledge of the children. Another group of health professionals rated children’s speech from information in their medical notes. With the exception of parents, raters reclassified children’s speech at least four weeks after their initial classification. Raters were asked to rate how easy the scale was to use and how well the scale described the child’s speech production using Likert scales. Inter-rater reliability was moderate to substantial (k > .58 for all comparisons). Test–retest reliability was substantial to almost perfect for all groups (k > .68). Over 74% of raters found the scale easy or very easy to use; 66% of parents and over 70% of health care professionals judged the scale to describe children’s speech well or very well. We conclude that the Viking Speech Scale is a reliable tool to describe the speech performance of children with cerebral palsy, which can be applied through direct observation of children or through case note review.
Resumo:
When speech is degraded, word report is higher for semantically coherent sentences (e.g., her new skirt was made of denim) than for anomalous sentences (e.g., her good slope was done in carrot). Such increased intelligibility is often described as resulting from "top-down" processes, reflecting an assumption that higher-level (semantic) neural processes support lower-level (perceptual) mechanisms. We used time-resolved sparse fMRI to test for top-down neural mechanisms, measuring activity while participants heard coherent and anomalous sentences presented in speech envelope/spectrum noise at varying signal-to-noise ratios (SNR). The timing of BOLD responses to more intelligible speech provides evidence of hierarchical organization, with earlier responses in peri-auditory regions of the posterior superior temporal gyrus than in more distant temporal and frontal regions. Despite Sentence content × SNR interactions in the superior temporal gyrus, prefrontal regions respond after auditory/perceptual regions. Although we cannot rule out top-down effects, this pattern is more compatible with a purely feedforward or bottom-up account, in which the results of lower-level perceptual processing are passed to inferior frontal regions. Behavioral and neural evidence that sentence content influences perception of degraded speech does not necessarily imply "top-down" neural processes.
Resumo:
Velopharyngeal insufficiency (VPI) is a structural or functional trouble, which causes hypernasal speech. Velopharyngeal flaps, speech therapy and augmentation pharyngoplasty, using different implants, have all been used to address this trouble. We hereby present our results following rhinopharyngeal autologous fat injection in 18 patients with mild velopharyngeal insufficiency (12 soft palate clefts, 4 functional VPI, 2 myopathy). 28 injections were carried out between 2004 and 2007. The degree of hypernasal speech was evaluated pre- and postoperatively by a speech therapist and an ENT specialist and quantified by an acoustic nasometry (Kay Elemetrics). All patients were exhaustively treated with preoperative speech therapy (average, 8 years). The mean value of the nasalance score was 37% preoperatively and 23% postoperatively (p = 0.015). The hypernasality was reduced postoperatively in all patients (1-3 degrees of the Borel-Maisonny score). There were no major complications, two minor complications (one hematoma, one cervical pain). The autologous fat injection is a simple, safe, minimally invasive procedure. It proves to be efficient in cases of mild velopharyngeal insufficiency or after a suboptimal velopharyngoplasty.
Resumo:
Recent efforts to implement gender mainstreaming in the field of security sector reform have resulted in an international policy discourse on gender and security sector reform (GSSR). Critics have challenged GSSR for its focus on 'adding women' and its failure to be transformative. This article contests this assessment, demonstrating that GSSR is not only about 'adding women', but also, importantly, about 'gendering men differently' and has important albeit problematic transformative implications. Drawing on poststructuralist and postcolonial feminist theory, I propose a critical reading of GSSR policy discourse in order to analyse its built-in logics, tensions and implications. I argue that this discourse establishes a powerful 'grid of intelligibility' that draws on gendered and racialized dualisms to normalize certain forms of subjectivity while rendering invisible and marginalizing others, and contributing to reproduce certain forms of normativity and hierarchy. Revealing such processes of discursive in/exclusion and marginalized subjectivities can serve as a starting point to challenge and transform GSSR practice and identify sites of contestation.
Resumo:
This dissertation considers the segmental durations of speech from the viewpoint of speech technology, especially speech synthesis. The idea is that better models of segmental durations lead to higher naturalness and better intelligibility. These features are the key factors for better usability and generality of synthesized speech technology. Even though the studies are based on a Finnish corpus the approaches apply to all other languages as well. This is possibly due to the fact that most of the studies included in this dissertation are about universal effects taking place on utterance boundaries. Also the methods invented and used here are suitable for any other study of another language. This study is based on two corpora of news reading speech and sentences read aloud. The other corpus is read aloud by a 39-year-old male, whilst the other consists of several speakers in various situations. The use of two corpora is twofold: it involves a comparison of the corpora and a broader view on the matters of interest. The dissertation begins with an overview to the phonemes and the quantity system in the Finnish language. Especially, we are covering the intrinsic durations of phonemes and phoneme categories, as well as the difference of duration between short and long phonemes. The phoneme categories are presented to facilitate the problem of variability of speech segments. In this dissertation we cover the boundary-adjacent effects on segmental durations. In initial positions of utterances we find that there seems to be initial shortening in Finnish, but the result depends on the level of detail and on the individual phoneme. On the phoneme level we find that the shortening or lengthening only affects the very first ones at the beginning of an utterance. However, on average, the effect seems to shorten the whole first word on the word level. We establish the effect of final lengthening in Finnish. The effect in Finnish has been an open question for a long time, whilst Finnish has been the last missing piece for it to be a universal phenomenon. Final lengthening is studied from various angles and it is also shown that it is not a mere effect of prominence or an effect of speech corpus with high inter- and intra-speaker variation. The effect of final lengthening seems to extend from the final to the penultimate word. On a phoneme level it reaches a much wider area than the initial effect. We also present a normalization method suitable for corpus studies on segmental durations. The method uses an utterance-level normalization approach to capture the pattern of segmental durations within each utterance. This prevents the impact of various problematic variations within the corpora. The normalization is used in a study on final lengthening to show that the results on the effect are not caused by variation in the material. The dissertation shows an implementation and prowess of speech synthesis on a mobile platform. We find that the rule-based method of speech synthesis is a real-time software solution, but the signal generation process slows down the system beyond real time. Future aspects of speech synthesis on limited platforms are discussed. The dissertation considers ethical issues on the development of speech technology. The main focus is on the development of speech synthesis with high naturalness, but the problems and solutions are applicable to any other speech technology approaches.