990 resultados para speech analysis
Resumo:
In the area of the professional competition, the coach is a fundamental part in the management of a team and more concretely in the game planning. During the competition, the management of the times of pause and times out as well as the conduct of the coach during the same ones is an aspect to analyze in the sports performance. It is for this that it becomes necessary to know some of the behaviors that turn out to be more frequent by the coach and that are more related to a positive performance of his players. For it there has been realized a study of 7 cases of expert coaches in those that his verbal behavior has observed during 4 games. It has focused on the content of the information only to verbal level, on his meaning. The information that have been obtained in the study shows a major quantity of information elaborated during the pauses of the games and a major tactical content with regard to the moments of game. On the other hand, a relation exists between a major number of questions and a minor number of psychological instructions when the score is adverse, whereas in case of victory, a direct relation does not exist with any category. The rest of categories of the speech do not meet influenced directly for the result, for what it is not possible to consider a direct and immediate relation between the coach verbal behavior during the pauses and the result of the game, except in punctual moments.
Resumo:
Mode of access: Internet.
Resumo:
Mode of access: Internet.
Resumo:
Mode of access: Internet.
Resumo:
The aims of the present study were to compare the perceptual assessments of deviant speech signs (dysarthria) exhibited by Australian and Swedish speakers with multiple sclerosis (MS) and to explore whether judgements of dysarthria differed depending on whether the speakers and the judges spoke the same or different languages. Ten Australian and 10 Swedish individuals with MS (matched as closely as possible for age, gender, progression type and severity of dysarthria) were assessed by 2 Australian and 2 Swedish clinically experienced judges using a protocol including 33 speech parameters. Results show that the following perceptual dimensions were identified by both pairs of judges in both groups of speakers to a just noticeable or moderate degree: imprecise consonants, inappropriate pitch level, reduced general rate, and glottal fry. The reliability (Spearman rank-order correlation) of the consensus ratings from the Australian and the Swedish judges was high, with a mean rho of 85.7 for the Australian speakers and mean rho of 84.3 for the Swedish speakers. The most difficult perceptual parameters to assess (i.e. to agree on) included harshness, level of pitch and loudness, precision of consonants and general stress pattern. The study indicated that perceptual assessments of speech characteristics in individuals with MS are informative and can be achieved with high inter-judge reliability irrespective of the judge's knowledge of the speaker's language. Copyright (C) 2003 S. Karger AG, Basel.
Resumo:
This paper presents a corpus-based descriptive analysis of the most prevalent transfer effects and connected speech processes observed in a comparison of 11 Vietnamese English speakers (6 females, 5 males) and 12 Australian English speakers (6 males, 6 females) over 24 grammatical paraphrase items. The phonetic processes are segmentally labelled in terms of IPA diacritic features using the EMU speech database system with the aim of labelling departures from native-speaker pronunciation. An analysis of prosodic features was made using ToBI framework. The results show many phonetic and prosodic processes which make non-native speakers’ speech distinct from native ones. The corpusbased methodology of analysing foreign accent may have implications for the evaluation of non-native accent, accented speech recognition and computer assisted pronunciation- learning.
Resumo:
In this paper we present the design and analysis of an intonation model for text-to-speech (TTS) synthesis applications using a combination of Relational Tree (RT) and Fuzzy Logic (FL) technologies. The model is demonstrated using the Standard Yorùbá (SY) language. In the proposed intonation model, phonological information extracted from text is converted into an RT. RT is a sophisticated data structure that represents the peaks and valleys as well as the spatial structure of a waveform symbolically in the form of trees. An initial approximation to the RT, called Skeletal Tree (ST), is first generated algorithmically. The exact numerical values of the peaks and valleys on the ST is then computed using FL. Quantitative analysis of the result gives RMSE of 0.56 and 0.71 for peak and valley respectively. Mean Opinion Scores (MOS) of 9.5 and 6.8, on a scale of 1 - -10, was obtained for intelligibility and naturalness respectively.
Resumo:
We propose a study of the mathematical properties of voice as an audio signal -- This work includes signals in which the channel conditions are not ideal for emotion recognition -- Multiresolution analysis- discrete wavelet transform – was performed through the use of Daubechies Wavelet Family (Db1-Haar, Db6, Db8, Db10) allowing the decomposition of the initial audio signal into sets of coefficients on which a set of features was extracted and analyzed statistically in order to differentiate emotional states -- ANNs proved to be a system that allows an appropriate classification of such states -- This study shows that the extracted features using wavelet decomposition are enough to analyze and extract emotional content in audio signals presenting a high accuracy rate in classification of emotional states without the need to use other kinds of classical frequency-time features -- Accordingly, this paper seeks to characterize mathematically the six basic emotions in humans: boredom, disgust, happiness, anxiety, anger and sadness, also included the neutrality, for a total of seven states to identify
Resumo:
We propose a novel analysis alternative, based on two Fourier Transforms for emotion recognition from speech -- Fourier analysis allows for display and synthesizes different signals, in terms of power spectral density distributions -- A spectrogram of the voice signal is obtained performing a short time Fourier Transform with Gaussian windows, this spectrogram portraits frequency related features, such as vocal tract resonances and quasi-periodic excitations during voiced sounds -- Emotions induce such characteristics in speech, which become apparent in spectrogram time-frequency distributions -- Later, the signal time-frequency representation from spectrogram is considered an image, and processed through a 2-dimensional Fourier Transform in order to perform the spatial Fourier analysis from it -- Finally features related with emotions in voiced speech are extracted and presented
Resumo:
Background: Schizophrenia is likely to be a consequence of DNA alterations that, together with environmental factors, will lead to protein expression differences and the ultimate establishment of the illness. The superior temporal gyrus is implicated in schizophrenia and executes functions such as the processing of speech, language skills and sound processing. Methods: We performed an individual comparative proteome analysis using two-dimensional gel electrophoresis of 9 schizophrenia and 6 healthy control patients' left posterior superior temporal gyrus (Wernicke's area - BA22p) identifying by mass spectrometry several protein expression alterations that could be related to the disease. Results: Our analysis revealed 11 downregulated and 14 upregulated proteins, most of them related to energy metabolism. Whereas many of the identified proteins have been previously implicated in schizophrenia, such as fructose-bisphosphate aldolase C, creatine kinase and neuron-specific enolase, new putative disease markers were also identified such as dihydrolipoyl dehydrogenase, tropomyosin 3, breast cancer metastasis-suppressor 1, heterogeneous nuclear ribonucleoproteins C1/C2 and phosphate carrier protein, mitochondrial precursor. Besides, the differential expression of peroxiredoxin 6 (PRDX6) and glial fibrillary acidic protein (GFAP) were confirmed by western blot in schizophrenia prefrontal cortex. Conclusion: Our data supports a dysregulation of energy metabolism in schizophrenia as well as suggests new markers that may contribute to a better understanding of this complex disease.
Resumo:
The purpose of the present study was to examine the benefits of providing audible speech to listeners with sensorineural hearing loss when the speech is presented in a background noise. Previous studies have shown that when listeners have a severe hearing loss in the higher frequencies, providing audible speech (in a quiet background) to these higher frequencies usually results in no improvement in speech recognition. In the present experiments, speech was presented in a background of multitalker babble to listeners with various severities of hearing loss. The signal was low-pass filtered at numerous cutoff frequencies and speech recognition was measured as additional high-frequency speech information was provided to the hearing-impaired listeners. It was found in all cases, regardless of hearing loss or frequency range, that providing audible speech resulted in an increase in recognition score. The change in recognition as the cutoff frequency was increased, along with the amount of audible speech information in each condition (articulation index), was used to calculate the "efficiency" of providing audible speech. Efficiencies were positive for all degrees of hearing loss. However, the gains in recognition were small, and the maximum score obtained by an listener was low, due to the noise background. An analysis of error patterns showed that due to the limited speech audibility in a noise background, even severely impaired listeners used additional speech audibility in the high frequencies to improve their perception of the "easier" features of speech including voicing
Resumo:
This work attempts to discuss, in the light of the French Analysis of the Discourse, how the concept of memory and heterogeneity in language actions can contribute to a reflection on information and documentation studies. Starting from cuttings of Clarice Lispector - the hour of the star exhibition pamphlet, accomplished in the second semester of 2007 by the Portuguese Language Museum (Luz train station, Sao Paulo), we interpreted the several voices that surround and sustain the subject and the sense.
Evaluation of oral-motor movements and speech in patients with tetanus of a public service in Brazil
Resumo:
The characterisation of oral-motor movements and speech of patients with tetanus were investigated to determine the existence of possible signs that are characteristic of this pathology. Thirteen patients clinically diagnosed with tetanus (10 with severe tetanus and three with very severe tetanus) and admitted to an intensive care unit underwent clinical evaluation of oral-motor movements and speech. Statistical analysis indicated significant between-group differences for speech motor functions, suggesting that individuals with very severe tetanus present rigidity as a characteristic interfering in articulatory precision (P = 0 035) and movement rate (P = 0 038). For lip closure, tongue movement, palatal elevation, gag reflex and voice quality, no between-group differences were identified for the specific abnormal characteristics. The observed abnormal results indicate that muscle strength and functional status of the oral-motor system presented by most of the participants of the study did not ensure the necessary integrity for satisfactory performance. The characterisation of the oral myofunctional aspects of patients with tetanus provides medical teams, patients and families with a wider and better description of the clinical situation, giving support to the diagnosis, prognostics and treatment.
Resumo:
Profound hearing loss is a disability that affects personality and when it involves teenagers before language acquisition, these bio-psychosocial conflicts can be exacerbated, requiring careful evaluation and choice of them for cochlear implant. Aim: To evaluate speech perception by adolescents with profound hearing loss, users of cochlear Implants. Study Design: Prospective. Materials and Methods: Twenty-five individuals with severe or profound pre-lingual hearing loss who underwent cochlear implantation during adolescence, between 10 to 17 years and 11 months, who went through speech perception tests before the implant and 2 years after device activation. For comparison and analysis we used the results from tests of four choice, recognition of vowels and recognition of sentences in a closed setting and the open environment. Results: The average percentage of correct answers in the four choice test before the implant was 46.9% and after 24 months of device use, this value went up to 86.1% in the vowels recognition test, the average difference was 45.13% to 83.13% and the sentences recognition test together in closed and open settings was 19.3% to 60.6% and 1.08% to 20.47% respectively. Conclusion: All patients, although with mixed results, achieved statistical improvement in all speech tests that were employed.