995 resultados para Analysis of the speech
Resumo:
Many of the items in the “Speech, Spatial, and Qualities of Hearing” scale questionnaire [S. Gatehouse and W. Noble, Int. J. Audiol.43, 85–99 (2004)] are concerned with speech understanding in a variety of backgrounds, both speech and nonspeech. To study if this self-report data reflected informational masking, previously collected data on 414 people were analyzed. The lowest scores (greatest difficulties) were found for the two items in which there were two speech targets, with successively higher scores for competing speech (six items), energetic masking (one item), and no masking (three items). The results suggest significant masking by competing speech in everyday listening situations.
Resumo:
This paper describes certain findings of intonation and intensity study of emotive speech with the minimal use of signal processing algorithms. This study was based on six basic emotions and the neutral, elicited from 1660 English utterances obtained from the speech recordings of six Indian women. The correctness of the emotional content was verified through perceptual listening tests. Marked similarity was noted among pitch contours of like-worded, positive valence emotions, though no such similarity was observed among the four negative valence emotional expressions. The intensity patterns were also studied. The results of the study were validated using arbitrary television recordings for four emotions. The findings are useful to technical researchers, social psychologists and to the common man interested in the dynamics of vocal expression of emotions
Resumo:
This paper reviews a study to analyze the number of times alphabet symbols occur in three commonly used basal reader series.
Resumo:
This paper discusses a study that examined acoustic measures and the relationship to speech intelligibility of children with cochlear implants.
Resumo:
The assumption that ignoring irrelevant sound in a serial recall situation is identical to ignoring a non-target channel in dichotic listening is challenged. Dichotic listening is open to moderating effects of working memory capacity (Conway et al., 2001) whereas irrelevant sound effects (ISE) are not (Beaman, 2004). A right ear processing bias is apparent in dichotic listening, whereas the bias is to the left ear in the ISE (Hadlington et al., 2004). Positron emission tomography (PET) imaging data (Scott et al., 2004, submitted) show bilateral activation of the superior temporal gyrus (STG) in the presence of intelligible, but ignored, background speech and right hemisphere activation of the STG in the presence of unintelligible background speech. It is suggested that the right STG may be involved in the ISE and a particularly strong left ear effect might occur because of the contralateral connections in audition. It is further suggested that left STG activity is associated with dichotic listening effects and may be influenced by working memory span capacity. The relationship of this functional and neuroanatomical model to known neural correlates of working memory is considered.
Resumo:
This essay has identified and analysed rhetorical devices in Gordon Brown’s speech delivered at the Labour Party conference on September 25, 2006. The aim of the study was to identify specific rhetorical devices which are described as interactional resources, analyse their uses and discuss possible effects that they may have when included in a political speech. The results are based on my own interpretations but are supported by information provided in current literature by analysts and researchers of rhetoric use. The result findings could probably serve as evidence of the need for better understanding of the devices used by politicians in their relentless endeavours to influence audience decisions.
Resumo:
In the area of the professional competition, the coach is a fundamental part in the management of a team and more concretely in the game planning. During the competition, the management of the times of pause and times out as well as the conduct of the coach during the same ones is an aspect to analyze in the sports performance. It is for this that it becomes necessary to know some of the behaviors that turn out to be more frequent by the coach and that are more related to a positive performance of his players. For it there has been realized a study of 7 cases of expert coaches in those that his verbal behavior has observed during 4 games. It has focused on the content of the information only to verbal level, on his meaning. The information that have been obtained in the study shows a major quantity of information elaborated during the pauses of the games and a major tactical content with regard to the moments of game. On the other hand, a relation exists between a major number of questions and a minor number of psychological instructions when the score is adverse, whereas in case of victory, a direct relation does not exist with any category. The rest of categories of the speech do not meet influenced directly for the result, for what it is not possible to consider a direct and immediate relation between the coach verbal behavior during the pauses and the result of the game, except in punctual moments.
Resumo:
The paper analyzes the performance of the unconstrained filtered-x LMS (FxLMS) algorithm for active noise control (ANC), where we remove the constraints on the controller that it must be causal and has finite impulse response. It is shown that the unconstrained FxLMS algorithm always converges to, if stable, the true optimum filter, even if the estimation of the secondary path is not perfect, and its final mean square error is independent of the secondary path. Moreover, we show that the sufficient and necessary stability condition for the feedforward unconstrained FxLMS is that the maximum phase error of the secondary path estimation must be within 90°, which is the only necessary condition for the feedback unconstrained FxLMS. The significance of the analysis on a practical system is also discussed. Finally we show how the obtained results can guide us to design a robust feedback ANC headset.
Resumo:
Adaptive filters used in code division multiple access (CDMA) receivers to counter interference have been formulated both with and without the assumption of training symbols being transmitted. They are known as training-based and blind detectors respectively. We show that the convergence behaviour of the blind minimum-output-energy (MOE) detector can be quite easily derived, unlike what was implied by the procedure outlined in a previous paper. The simplification results from the observation that the correlation matrix determining convergence performance can be made symmetric, after which many standard results from the literature on least mean square (LMS) filters apply immediately.
Resumo:
We present a novel approach for the detection of severe obstructive sleep apnea (OSA) based on patients' voices introducing nonlinear measures to describe sustained speech dynamics. Nonlinear features were combined with state-of-the-art speech recognition systems using statistical modeling techniques (Gaussian mixture models, GMMs) over cepstral parameterization (MFCC) for both continuous and sustained speech. Tests were performed on a database including speech records from both severe OSA and control speakers. A 10 % relative reduction in classification error was obtained for sustained speech when combining MFCC-GMM and nonlinear features, and 33 % when fusing nonlinear features with both sustained and continuous MFCC-GMM. Accuracy reached 88.5 % allowing the system to be used in OSA early detection. Tests showed that nonlinear features and MFCCs are lightly correlated on sustained speech, but uncorrelated on continuous speech. Results also suggest the existence of nonlinear effects in OSA patients' voices, which should be found in continuous speech.
Resumo:
In this paper, we describe an algorithm that automatically detects and labels peaks I - VII of the normal, suprathreshold auditory brainstem response (ABR). The algorithm proceeds in three stages, with the option of a fourth: ( 1) all candidate peaks and troughs in the ABR waveform are identified using zero crossings of the first derivative, ( 2) peaks I - VII are identified from these candidate peaks based on their latency and morphology, ( 3) if required, peaks II and IV are identified as points of inflection using zero crossings of the second derivative and ( 4) interpeak troughs are identified before peak latencies and amplitudes are measured. The performance of the algorithm was estimated on a set of 240 normal ABR waveforms recorded using a stimulus intensity of 90 dBnHL. When compared to an expert audiologist, the algorithm correctly identified the major ABR peaks ( I, III and V) in 96 - 98% of the waveforms and the minor ABR peaks ( II, IV, VI and VII) in 45 - 83% of waveforms. Whilst peak II was correctly identified in only 83% and peak IV in 77% of waveforms, it was shown that 5% of the peak II identifications and 31% of the peak IV identifications came as a direct result of allowing these peaks to be found as points of inflection. Copyright (C) 2005 S. Karger AG, Basel.
Resumo:
AIM: To establish a simple method to quantify muscle/fat constituents in cervical muscles of asymptomatic women using magnetic resonance imaging (MRI), and to determine whether there is an age effect within a defined age range. MATERIALS AND METHODS: MRI of the upper cervical spine was performed for 42 asymptomatic women aged 18-45 years. The muscle and fat signal intensities on axial spin echo T1-weighted images were quantitatively classified by taking a ratio of the pixel intensity profiles of muscle against those of intermuscular fat for the rectus capitis posterior major and minor and inferior obliquus capitis muscles bilaterally. Inter- and intra-examiner agreement was scrutinized. RESULTS: The average relative values of fat within the upper cervical musculature compared with intermuscular fat indicated that there were only slight variations in indices between the three sets of muscles. There was no significant correlation between age and fat indices. There were significant differences for the relative fat within the muscle compared with intermuscular fat and body mass index for the right rectus capitis posterior major and right and left inferior obliquus capitis muscles (p = 0.032). Intraclass correlation coefficients for intraobserver agreement ranged from 0.94 to 0.98. Inter-rater agreement of the measurements ranged from 0.75 to 0.97. CONCLUSION: A quantitative measure of muscle/fat constituents has been developed, and results of this study indicate that relative fatty infiltration is not a feature of age in the upper cervical extensor muscles of women aged 18-45 years. (C) 2005 The Royal College of Radiologists. Published by Elsevier Ltd. All rights reserved.