Biblioteca Digital

898 resultados para performativity of speech

Impact of inter-judge agreement on perceptual judgment of nasality

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)

Veja mais

Result on speech perception after conversion from Spectra (R) to Freedom (R)

Relevância:

90.00% 90.00%

Publicador:

Resumo:

New technology in the Freedom (R) speech processor for cochlear implants was developed to improve how incoming acoustic sound is processed; this applies not only for new users, but also for previous generations of cochlear implants. Aim: To identify the contribution of this technology - the Nucleus 22 (R) - on speech perception tests in silence and in noise, and on audiometric thresholds. Methods: A cross-sectional cohort study was undertaken. Seventeen patients were selected. The last map based on the Spectra (R) was revised and optimized before starting the tests. Troubleshooting was used to identify malfunction. To identify the contribution of the Freedom (R) technology for the Nucleus22 (R), auditory thresholds and speech perception tests were performed in free field in soundproof booths. Recorded monosyllables and sentences in silence and in noise (SNR = 0dB) were presented at 60 dBSPL. The nonparametric Wilcoxon test for paired data was used to compare groups. Results: Freedom (R) applied for the Nucleus22 (R) showed a statistically significant difference in all speech perception tests and audiometric thresholds. Conclusion: The reedom (R) technology improved the performance of speech perception and audiometric thresholds of patients with Nucleus 22 (R).

Veja mais

Acoustic analyses of diadochokinesis in fluent and stuttering children

Relevância:

90.00% 90.00%

Publicador:

Resumo:

OBJECTIVES: The purpose of the study was to acoustically compare the performance of children who do and do not stutter on diadochokinesis tasks in terms of syllable duration, syllable periods, and peak intensity. METHODS: In this case-control study, acoustical analyses were performed on 26 children who stutter and 20 aged-matched normally fluent children (both groups stratified into preschoolers and school-aged children) during a diadochokinesis task: the repetition of articulatory segments through a task testing the ability to alternate movements. Speech fluency was assessed using the Fluency Profile and the Stuttering Severity Instrument. RESULTS: The children who stutter and those who do not did not significantly differ in terms of the acoustic patterns they produced in the diadochokinesis tasks. Significant differences were demonstrated between age groups independent of speech fluency. Overall, the preschoolers performed poorer. These results indicate that the observed differences are related to speech-motor age development and not to stuttering itself. CONCLUSIONS: Acoustic studies demonstrate that speech segment durations are most variable, both within and between subjects, during childhood and then gradually decrease to adult levels by the age of eleven to thirteen years. One possible explanation for the results of the present study is that children who stutter presented higher coefficients of variation to exploit the motor equivalence to achieve accurate sound production (i.e., the absence of speech disruptions).

Veja mais

Speech Enhancement With Inventory Style Speech Resynthesis

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We present a new method for the enhancement of speech. The method is designed for scenarios in which targeted speaker enrollment as well as system training within the typical noise environment are feasible. The proposed procedure is fundamentally different from most conventional and state-of-the-art denoising approaches. Instead of filtering a distorted signal we are resynthesizing a new “clean” signal based on its likely characteristics. These characteristics are estimated from the distorted signal. A successful implementation of the proposed method is presented. Experiments were performed in a scenario with roughly one hour of clean speech training data. Our results show that the proposed method compares very favorably to other state-of-the-art systems in both objective and subjective speech quality assessments. Potential applications for the proposed method include jet cockpit communication systems and offline methods for the restoration of audio recordings.

Veja mais

Identification, discrimination, and selective adaptation of simultaneous musical intervals

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Four experiments investigated perception of major and minor thirds whose component tones were sounded simultaneously. Effects akin to categorical perception of speech sounds were found. In the first experiment, musicians demonstrated relatively sharp category boundaries in identification and peaks near the boundary in discrimination tasks of an interval continuum where the bottom note was always an F and the top note varied from A to A flat in seven equal logarithmic steps. Nonmusicians showed these effects only to a small extent. The musicians showed higher than predicted discrimination performance overall, and reaction time increases at category boundaries. In the second experiment, musicians failed to consistently identify or discriminate thirds which varied in absolute pitch, but retained the proper interval ratio. In the last two experiments, using selective adaptation, consistent shifts were found in both identification and discrimination, similar to those found in speech experiments. Manipulations of adapting and test showed that the mechanism underlying the effect appears to be centrally mediated and confined to a frequency-specific level. A multistage model of interval perception, where the first stages deal only with specific pitches may account for the results.

Veja mais

Corpus-Based Speech Enhancement With Uncertainty Modeling and Cepstral Smoothing

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We present a new approach for corpus-based speech enhancement that significantly improves over a method published by Xiao and Nickel in 2010. Corpus-based enhancement systems do not merely filter an incoming noisy signal, but resynthesize its speech content via an inventory of pre-recorded clean signals. The goal of the procedure is to perceptually improve the sound of speech signals in background noise. The proposed new method modifies Xiao's method in four significant ways. Firstly, it employs a Gaussian mixture model (GMM) instead of a vector quantizer in the phoneme recognition front-end. Secondly, the state decoding of the recognition stage is supported with an uncertainty modeling technique. With the GMM and the uncertainty modeling it is possible to eliminate the need for noise dependent system training. Thirdly, the post-processing of the original method via sinusoidal modeling is replaced with a powerful cepstral smoothing operation. And lastly, due to the improvements of these modifications, it is possible to extend the operational bandwidth of the procedure from 4 kHz to 8 kHz. The performance of the proposed method was evaluated across different noise types and different signal-to-noise ratios. The new method was able to significantly outperform traditional methods, including the one by Xiao and Nickel, in terms of PESQ scores and other objective quality measures. Results of subjective CMOS tests over a smaller set of test samples support our claims.

Veja mais

Changes in hemodynamics and tissue oxygenation saturation in the brain and skeletal muscle induced by speech therapy - a near-infrared spectroscopy study

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Arts speech therapy (AST) is a therapeutic method within complementary medicine and has been practiced for decades for various medical conditions. It comprises listening and the recitation of different forms of speech exercises under the guidance of a licensed speech therapist. The aim of our study was to noninvasively investigate whether different types of recitation influence hemodynamics and oxygenation in the brain and skeletal leg muscle using near-infrared spectroscopy (NIRS). Seventeen healthy volunteers (eight men and nine women, mean age ± standard deviation 35.6 ± 12.7 years) were enrolled in the study. Each subject was measured three times on different days with the different types of recitation: hexameter, alliteration, and prose verse. Before, during, and after recitation, relative concentration changes of oxyhemoglobin (Δ[O2Hb]), deoxyhemoglobin (Δ[HHb]), total hemoglobin (Δ[tHb]), and tissue oxygenation saturation (StO2) were measured in the brain and skeletal leg muscle using a NIRS device. The study was performed with a randomized crossover design. Significant concentration changes were found during recitation of all verses, with mainly a decrease in Δ[O2Hb] and ΔStO2 in the brain, and an increase in Δ[O2Hb] and Δ[tHb] in the leg muscle during recitation. After the recitations, significant changes were mainly increases of Δ[HHb] and Δ[tHb] in the calf muscle. The Mayer wave spectral power (MWP) was also significantly affected, i.e., mainly the MWP of the Δ[O2Hb] and Δ[tHb] increased in the brain during recitation of hexameter and prose verse. The changes in MWP were also significantly different between hexameter and alliteration, and hexameter and prose. Possible physiological explanations for these changes are discussed. A probable reason is a different effect of recitations on the sympathetic nervous system. In conclusion, these changes show that AST has relevant effects on the hemodynamics and oxygenation of the brain and muscle.

Veja mais

Comparative Analysis of Russian and French Prosodies: Theoretical, Experimental and Applied Aspects"

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Experience shows that in teaching the pronunciation of a foreign language, it is the native syllable stereotype that resists correction most strongly. This is because the syllable is the basic unit of the perception and production of speech, and syllabic production is highly automatic and to some degree determines the prosody of speech at all levels: accent, rhythm, phrase, etc. The results of psycho-physiological studies show that the human acoustic analyser is a typical contemplator organ and new acoustic qualities are perceived through their inclusion into the already existing system of values characteristic to the mother tongue. This results in the adaptation of the perception and so production of foreign speech to native patterns. The less conscious the perception of the unit and the more 'primitive' its status, the greater the degree of its auditory assimilation, and the syllable is certainly among the less controllable linguistic units. The group carried out a complex investigation of the French and Russian languages at the level of syllable realisation, focusing on the stressed syllable of both open and closed types. The useful acoustic characteristics of the French/Russian syllable pattern were determined through identifying a typical syllable pattern within the system of each of the two languages, comparing these patterns to establish their contrasting features, and observing and systematising deviations from the pattern typical of the French/Russian language teaching situation. The components of the syllable pattern shown to need particular attention in teaching French pronunciation to Russian native speakers were intensity, fundamental frequency, and duration. The group then developed a method of correction which combines the auditory and visual canals of sound signal perception and tested this method with groups of Russian students of different levels.

Veja mais

[On quality of life after surgical therapy of oral cancer - a retrospective multi-center study: the connection between dedentition, denture, quality of life, and dysphagia, and the resulting rehabilitation schemes]

Relevância:

90.00% 90.00%

Publicador:

Resumo:

PURPOSE: The surgical treatment of oral cancer results in functional and aesthetical impairments. Patients' quality of life is considerably impaired by oral symptoms resulting from therapy of oral cancer. In many cases the inevitable resection of the tumor, as well as the adjuvant radiochemotherapy will cause the destruction of physiologically and anatomically important structures. One focus of research was the specific rehabilitation of dental loss by functional dentures. Another was the course of 19 impairments (comprehension of speech for unknown others, comprehension of speech for familiar others, eating/swallowing, mobility of the tongue, opening range of the mouth, mobility of lower jaw, mobility of neck, mobility of arms and shoulders, sense of taste, sense of smell, appearance, strength, appetite, respiration, pain, swelling, xerostomia, halitosis). METHODS: Commissioned by the German, Austrian and Swiss cooperative group on tumors of the maxillofacial region (DOSAK), data were collected in 3.894 questionnaires at 43 hospitals in Germany, Austria and Switzerland. The catalogue comprised 147 items in 9 chapters. At the end of the enquiry, 1.761 anonymous questionnaires were returned by 38 hospitals. 1.652 of these could be evaluated regarding the question. RESULTS: The sum score of the 19 impairments was highly increased immediately after the operation and recovered over the next 6 months, without, however, reaching the pre-surgery level. Of 1.652 patients, only 35% did not lose any teeth during therapy. 23% lost up to 5, 17% up to 10 teeth. A quarter of the patients lost more than 10 teeth. The more teeth were lost, the greater the decline of quality of life (p < or = 0.001), although this could be allayed by the functionality of the dentures (p < or = 0.001). There is a reciprocal dependence between the functionality of dental prosthetics and impairment by eating/swallowing (p < or = 0.001). CONCLUSIONS: Patients' quality of life after radical surgery of a carcinoma of the oral cavity depends not only on the functionality of dentures and the specificity of rehabilitation, but also from the initial findings, the extent and location of the resection, the chosen therapy, the general circumstances of the patient's life as well as their strategies of coping. These factors, however, unlike those of functionality of dental prosthesis and rehabilitation, are not modifiable.

Veja mais

Formal speech stylistics and type A behavior in 38 subjects during nonstress interviews

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Open-ended interviews of 90 min length of 38 patients were analyzed with respect to speech stylistics, shown by Schucker and Jacobs to differentiate individuals with type A personality features from those with type B. In our patients, Type A/B had been assessed by the Bortner Personality Inventory. The stylistics studied were: repeated words swallowed words, interruptions, simultaneous speech, silence latency (between question and answer) (SL), speed of speech, uneven speed of speech (USS), explosive words (PW), uneven speech volume (USV), and speech volume. Correlations between both raters for all speech categories were high. Positive correlations between extent of type A and SL (r = 0.33; p = 0.022), USS (r = 0.51; p = 0.002), PW (r = 0.46; p = 0.003) and USV (r = 0.39; p = 0.012) were found. Our results indicate that the speech in nonstress open-ended interviews of type A individuals tends to show a higher emotional tension (positive correlations for USS PW and USV) and is more controlled in conversation (positive correlation for SL).

Veja mais

Content Classification of Multimedia Documents using Partitions of Low-Level Features

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Audio-visual documents obtained from German TV news are classified according to the IPTC topic categorization scheme. To this end usual text classification techniques are adapted to speech, video, and non-speech audio. For each of the three modalities word analogues are generated: sequences of syllables for speech, “video words” based on low level color features (color moments, color correlogram and color wavelet), and “audio words” based on low-level spectral features (spectral envelope and spectral flatness) for non-speech audio. Such audio and video words provide a means to represent the different modalities in a uniform way. The frequencies of the word analogues represent audio-visual documents: the standard bag-of-words approach. Support vector machines are used for supervised classification in a 1 vs. n setting. Classification based on speech outperforms all other single modalities. Combining speech with non-speech audio improves classification. Classification is further improved by supplementing speech and non-speech audio with video words. Optimal F-scores range between 62% and 94% corresponding to 50% - 84% above chance. The optimal combination of modalities depends on the category to be recognized. The construction of audio and video words from low-level features provide a good basis for the integration of speech, non-speech audio and video.

Veja mais

Semiotic Analysis in the Study of Social Work

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Research and professional practices have the joint aim of re-structuring the preconceived notions of reality. They both want to gain the understanding about social reality. Social workers use their professional competence in order to grasp the reality of their clients, while researchers’ pursuit is to open the secrecies of the research material. Development and research are now so intertwined and inherent in almost all professional practices that making distinctions between practising, developing and researching has become difficult and in many aspects irrelevant. Moving towards research-based practices is possible and it is easily applied within the framework of the qualitative research approach (Dominelli 2005, 235; Humphries 2005, 280). Social work can be understood as acts and speech acts crisscrossing between social workers and clients. When trying to catch the verbal and non-verbal hints of each others’ behaviour, the actors have to do a lot of interpretations in a more or less uncertain mental landscape. Our point of departure is the idea that the study of social work practices requires tools which effectively reveal the internal complexity of social work (see, for example, Adams & Dominelli & Payne 2005, 294 – 295). The boom of qualitative research methodologies in recent decades is associated with much profound the rupture in humanities, which is called the linguistic turn (Rorty 1967). The idea that language is not transparently mediating our perceptions and thoughts about reality, but on the contrary it constitutes it was new and even confusing to many social scientists. Nowadays we have got used to read research reports which have applied different branches of discursive analyses or narratologic or semiotic approaches. Although differences are sophisticated between those orientations they share the idea of the predominance of language. Despite the lively research work of today’s social work and the research-minded atmosphere of social work practice, semiotics has rarely applied in social work research. However, social work as a communicative practice concerns symbols, metaphors and all kinds of the representative structures of language. Those items are at the core of semiotics, the science of signs, and the science which examines people using signs in their mutual interaction and their endeavours to make the sense of the world they live in, their semiosis. When thinking of the practice of social work and doing the research of it, a number of interpretational levels ought to be passed before reaching the research phase in social work. First of all, social workers have to interpret their clients’ situations, which will be recorded in the files. In some very rare cases those past situations will be reflected in discussions or perhaps interviews or put under the scrutiny of some researcher in the future. Each and every new observation adds its own flavour to the mixture of meanings. Social workers have combined their observations with previous experience and professional knowledge, furthermore, the situation on hand also influences the reactions. In addition, the interpretations made by social workers over the course of their daily working routines are never limited to being part of the personal process of the social worker, but are also always inherently cultural. The work aiming at social change is defined by the presence of an initial situation, a specific goal, and the means and ways of achieving it, which are – or which should be – agreed upon by the social worker and the client in situation which is unique and at the same time socially-driven. Because of the inherent plot-based nature of social work, the practices related to it can be analysed as stories (see Dominelli 2005, 234), given, of course, that they are signifying and told by someone. The research of the practices is concentrating on impressions, perceptions, judgements, accounts, documents etc. All these multifarious elements can be scrutinized as textual corpora, but not whatever textual material. In semiotic analysis, the material studied is characterised as verbal or textual and loaded with meanings. We present a contribution of research methodology, semiotic analysis, which has to our mind at least implicitly references to the social work practices. Our examples of semiotic interpretation have been picked up from our dissertations (Laine 2005; Saurama 2002). The data are official documents from the archives of a child welfare agency and transcriptions of the interviews of shelter employees. These data can be defined as stories told by the social workers of what they have seen and felt. The official documents present only fragmentations and they are often written in passive form. (Saurama 2002, 70.) The interviews carried out in the shelters can be described as stories where the narrators are more familiar and known. The material is characterised by the interaction between the interviewer and interviewee. The levels of the story and the telling of the story become apparent when interviews or documents are examined with the use of semiotic tools. The roots of semiotic interpretation can be found in three different branches; the American pragmatism, Saussurean linguistics in Paris and the so called formalism in Moscow and Tartu; however in this paper we are engaged with the so called Parisian School of semiology which prominent figure was A. J. Greimas. The Finnish sociologists Pekka Sulkunen and Jukka Törrönen (1997a; 1997b) have further developed the ideas of Greimas in their studies on socio-semiotics, and we lean on their ideas. In semiotics social reality is conceived as a relationship between subjects, observations, and interpretations and it is seen mediated by natural language which is the most common sign system among human beings (Mounin 1985; de Saussure 2006; Sebeok 1986). Signification is an act of associating an abstract context (signified) to some physical instrument (signifier). These two elements together form the basic concept, the “sign”, which never constitutes any kind of meaning alone. The meaning will be comprised in a distinction process where signs are being related to other signs. In this chain of signs, the meaning becomes diverged from reality. (Greimas 1980, 28; Potter 1996, 70; de Saussure 2006, 46-48.) One interpretative tool is to think of speech as a surface under which deep structures – i.e. values and norms – exist (Greimas & Courtes 1982; Greimas 1987). To our mind semiotics is very much about playing with two different levels of text: the syntagmatic surface which is more or less faithful to the grammar, and the paradigmatic, semantic structure of values and norms hidden in the deeper meanings of interpretations. Semiotic analysis deals precisely with the level of meaning which exists under the surface, but the only way to reach those meanings is through the textual level, the written or spoken text. That is why the tools are needed. In our studies, we have used the semiotic square and the actant analysis. The former is based on the distinctions and the categorisations of meanings, and the latter on opening the plotting of narratives in order to reach the value structures.

Veja mais

Social Salience and the Sociolinguistic Monitor: a Case Study of ING and TH-fronting in Britain

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This article examines the role of social salience, or the relative ability of a linguistic variable to evoke social meaning, in structuring listeners’ perceptions of quantitative sociolinguistic distributions. Building on the foundational work of Labov et al. (2006, 2011) on the “sociolinguistic monitor” (a proposed cognitive mechanism responsible for sociolinguistic perception), we examine whether listeners’ evaluative judgments of speech change as a function of the type of variable presented. We consider two variables in British English, ING and TH-fronting, which we argue differ in their relative social salience. Replicating the design of Labov et al.’s studies, we test 149 British listeners’ reactions to different quantitative distributions of these variables. Our experiments elicit a very different pattern of perceptual responses than those reported previously. In particular, our results suggest that a variable’s social salience determines both whether and how it is perceptually evaluated. We argue that this finding is crucial for understanding how sociolinguistic information is cognitively processed.

Veja mais

Benefit of a Contralateral Routing of Signal Device for Unilateral Cochlear Implant Users.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Objective: To investigate objective and subjective effects of an adjunctive contralateral routing of signal (CROS) device at the untreated ear in patients with a unilateral cochlear implant (CI). Design: Prospective study of 10 adult experienced unilateral CI users with bilateral severe-to-profound hearing loss. Speech in noise reception (SNR) and sound localization were measured with and without the additional CROS device. SNR was measured by applying speech signals at the untreated/CROS side while noise signals came from the front (S90N0). For S0N90, signal sources were switched. Sound localization was measured in a 12-loudspeaker full circle setup. To evaluate the subjective benefit, patients tried the device for 2 weeks at home, then filled out the abbreviated Speech, Spatial and Qualities of Hearing Scale as well as the Bern benefit in single-sided deafness questionnaires. Results: In the setting S90N0, all patients showed a highly significant SNR improvement when wearing the additional CROS device (mean 6.4 dB, p < 0.001). In the unfavorable setting S0N90, only a minor deterioration of speech understanding was noted (mean -0.66 dB, p = 0.54). Sound localization did not improve substantially with CROS. In the two questionnaires, 12 of 14 items showed an improvement in mean values, but none of them was statistically significant. Conclusion: Patients with unilateral CI benefit from a contralateral CROS device, particularly in a noisy environment, when speech comes from the CROS ear side. © 2014 S. Karger AG, Basel.

Veja mais

Statistical Learning, Syllable Processing, and Speech Production in Healthy Hearing and Hearing-Impaired Preschool Children: A Mismatch Negativity Study

Relevância:

90.00% 90.00%

Publicador:

Resumo:

OBJECTIVES The objectives of the present study were to investigate temporal/spectral sound-feature processing in preschool children (4 to 7 years old) with peripheral hearing loss compared with age-matched controls. The results verified the presence of statistical learning, which was diminished in children with hearing impairments (HIs), and elucidated possible perceptual mediators of speech production. DESIGN Perception and production of the syllables /ba/, /da/, /ta/, and /na/ were recorded in 13 children with normal hearing and 13 children with HI. Perception was assessed physiologically through event-related potentials (ERPs) recorded by EEG in a multifeature mismatch negativity paradigm and behaviorally through a discrimination task. Temporal and spectral features of the ERPs during speech perception were analyzed, and speech production was quantitatively evaluated using speech motor maximum performance tasks. RESULTS Proximal to stimulus onset, children with HI displayed a difference in map topography, indicating diminished statistical learning. In later ERP components, children with HI exhibited reduced amplitudes in the N2 and early parts of the late disciminative negativity components specifically, which are associated with temporal and spectral control mechanisms. Abnormalities of speech perception were only subtly reflected in speech production, as the lone difference found in speech production studies was a mild delay in regulating speech intensity. CONCLUSIONS In addition to previously reported deficits of sound-feature discriminations, the present study results reflect diminished statistical learning in children with HI, which plays an early and important, but so far neglected, role in phonological processing. Furthermore, the lack of corresponding behavioral abnormalities in speech production implies that impaired perceptual capacities do not necessarily translate into productive deficits.

Veja mais

898 resultados para performativity of speech

Filtro por publicador