935 resultados para Offensive speech


Relevância:

20.00% 20.00%

Publicador:

Resumo:

The aim of this thesis is to investigate computerized voice assessment methods to classify between the normal and Dysarthric speech signals. In this proposed system, computerized assessment methods equipped with signal processing and artificial intelligence techniques have been introduced. The sentences used for the measurement of inter-stress intervals (ISI) were read by each subject. These sentences were computed for comparisons between normal and impaired voice. Band pass filter has been used for the preprocessing of speech samples. Speech segmentation is performed using signal energy and spectral centroid to separate voiced and unvoiced areas in speech signal. Acoustic features are extracted from the LPC model and speech segments from each audio signal to find the anomalies. The speech features which have been assessed for classification are Energy Entropy, Zero crossing rate (ZCR), Spectral-Centroid, Mean Fundamental-Frequency (Meanf0), Jitter (RAP), Jitter (PPQ), and Shimmer (APQ). Naïve Bayes (NB) has been used for speech classification. For speech test-1 and test-2, 72% and 80% accuracies of classification between healthy and impaired speech samples have been achieved respectively using the NB. For speech test-3, 64% correct classification is achieved using the NB. The results direct the possibility of speech impairment classification in PD patients based on the clinical rating scale.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Johansson, Fredrik (2012). Filmljudets funktioner i dramafilm – En audio-visuell analys av filmen The King´s Speech. Examensuppsats inom Ljudproduktion, Högskolan Dalarna, Akademin för språk och medier, Falun. I denna uppsats undersöktes filmljudet i dramafilmen The King´s Speech. Detta för att ta reda på vilka funktioner filmljudet fyller i de valda sekvenserna ur nämnda film, samt hur ljudet är placerat i filmens flerkanalsmix. Filmen granskades med hjälp av en audio-visuell analys. Denna metod går ut på att ljudet och bilden undersöks separat, för att sedan åter kombineras och analyseras som helhet. Den audio-visuella analysmetod som använts kommer från ljudteoretikern Michel Chion, och kallas Masking. Resultatet av den audio-visuella analysen pekade mot att ljudets huvudsakliga funktioner var att skapa en realistisk skildring av karaktärer och omgivningar, skapa en känsla av närvaro, samt att skapa och bibehålla olika perspektiv i den narrativa världen. Den stora majoriteten av ljud visade sig vara placerade i centerkanalen, medan främst ickediegetisk musik och ambiensljud var placerade i front- och surroundkanalerna. Detta kanalanvändande tycktes gynna de funna funktionerna, främst genom att bidra till känslan av närvaro och realism, genom att omsluta filmpubliken med ambienta ljud.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: Voice processing in real-time is challenging. A drawback of previous work for Hypokinetic Dysarthria (HKD) recognition is the requirement of controlled settings in a laboratory environment. A personal digital assistant (PDA) has been developed for home assessment of PD patients. The PDA offers sound processing capabilities, which allow for developing a module for recognition and quantification HKD. Objective: To compose an algorithm for assessment of PD speech severity in the home environment based on a review synthesis. Methods: A two-tier review methodology is utilized. The first tier focuses on real-time problems in speech detection. In the second tier, acoustics features that are robust to medication changes in Levodopa-responsive patients are investigated for HKD recognition. Keywords such as Hypokinetic Dysarthria , and Speech recognition in real time were used in the search engines. IEEE explorer produced the most useful search hits as compared to Google Scholar, ELIN, EBRARY, PubMed and LIBRIS. Results: Vowel and consonant formants are the most relevant acoustic parameters to reflect PD medication changes. Since relevant speech segments (consonants and vowels) contains minority of speech energy, intelligibility can be improved by amplifying the voice signal using amplitude compression. Pause detection and peak to average power rate calculations for voice segmentation produce rich voice features in real time. Enhancements in voice segmentation can be done by inducing Zero-Crossing rate (ZCR). Consonants have high ZCR whereas vowels have low ZCR. Wavelet transform is found promising for voice analysis since it quantizes non-stationary voice signals over time-series using scale and translation parameters. In this way voice intelligibility in the waveforms can be analyzed in each time frame. Conclusions: This review evaluated HKD recognition algorithms to develop a tool for PD speech home-assessment using modern mobile technology. An algorithm that tackles realtime constraints in HKD recognition based on the review synthesis is proposed. We suggest that speech features may be further processed using wavelet transforms and used with a neural network for detection and quantification of speech anomalies related to PD. Based on this model, patients' speech can be automatically categorized according to UPDRS speech ratings.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This essay studies how dialectal speech is reflected in written literature and how this phenomenon functions in translation. With this purpose in mind, Styron's Sophie's Choice and Twain's The Adventures of Huckleberry Finn are analysed using samples of non-standard orthography which have been applied in order to reflect the dialect, or accent, of certain characters. In the same way, Lundgren's Swedish translation of Sophie's Choice and Ferres and Rolfe's Spanish version of The Adventures of Huckleberry Finn are analysed. The method consists of linguistically analysing a few text samples from each novel, establishing how dialect is represented through non-standard orthography, and thereafter, comparing the same samples with their translation into another language in order to establish whether dialectal features are visible also in the translated novels. It is concluded that non-standard orthography is applied in the novels in order to represent each possible linguistic level, including pronunciation, morphosyntax, and vocabulary. Furthermore, it is concluded that while Lundgren's translation intends to orthographically represent dialectal speech on most occasions where the original does so, Ferres and Rolfe's translation pays no attention to dialectology. The discussion following the data analysis establishes some possible reasons for the exclusion of dialectal features in the Spanish translation considered here. Finally, the reason for which this study contributes to the study of dialectology is declared.

Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Escola de Direito de São Paulo da Fundação Getulio Vargas

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The purpose of this study was to determine the influence of hearing protection devices (HPDs) on the understanding of speech in young adults with normal hearing, both in a silent situation and in the presence of ambient noise. The experimental research was carried out with the following variables: five different conditions of HPD use (without protectors, with two earplugs and with two earmuffs); a type of noise (pink noise); 4 test levels (60, 70, 80 and 90 dB[A]); 6 signal/noise ratios (without noise, + 5, + 10, zero, - 5 and - 10 dB); 5 repetitions for each case, totalling 600 tests with 10 monosyllables in each one. The variable measure was the percentage of correctly heard words (monosyllabic) in the test. The results revealed that, at the lowest levels (60 and 70 dB), the protectors reduced the intelligibility of speech (compared to the tests without protectors) while, in the presence of ambient noise levels of 80 and 90 dB and unfavourable signal/noise ratios (0, -5 and -10 dB), the HPDs improved the intelligibility. A comparison of the effectiveness of earplugs versus earmuffs showed that the former offer greater efficiency in respect to the recognition of speech, providing a 30% improvement over situations in which no protection is used. As might be expected, this study confirmed that the protectors' influence on speech intelligibility is related directly to the spectral curve of the protector's attenuation. (C) 2003 Elsevier B.V. Ltd. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Speech signals degraded by additive noise can affects different applications in telecommunication. The noise may degrades the intelligibility of the speech signals and its waveforms as well. In some applications such as speech coding, both intelligibility and waveform quality are important but only intelligibility has been focused lastly. So, modern speech quality measurement techniques such as PESQ (Perceptual Evaluation of Speech Quality) have been used and classical distortion measurement techniques such as Cepstral Distance are becoming unused. In this paper it is shown that some classical distortion measures are still important in applications where speech corrupted by additive noise has to be evaluated.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Williams syndrome (WS) is a neurodevelopmental genetic disorder, often referred as being characterized by dissociation between verbal and non-verbal abilities, although the number of studies disputing this proposal is emerging. Indeed, although they have been traditionally reported as displaying increased speech fluency, this topic has not been fully addressed in research. In previous studies carried out with a small group of individuals with WS, we reported speech breakdowns during conversational and autobiographical narratives suggestive of language difficulties. In the current study, we characterized the speech fluency profile using an ecologically based measure - a narrative task (story generation) was collected from a group of individuals with WS (n = 30) and typically developing group (n = 39) matched in mental age. Oral narratives were elicited using a picture stimulus - the cookie theft picture from Boston Diagnosis Aphasia Test. All narratives were analyzed according to typology and frequency of fluency breakdowns (non-stuttered and stuttered disfluencies). Oral narratives in WS group differed from typically developing group, mainly due to a significant increase in the frequency of disfluencies, particularly in terms of hesitations, repetitions and pauses. This is the first evidence of disfluencies in WS using an ecologically based task (oral narrative task), suggesting that these speech disfluencies may represent a significant marker of language problems in WS. (C) 2011 Elsevier Ltd. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Objective: To evaluate the maximum residual signal auto-correlation also known as pitch amplitude (PA) values in patients with Parkinson's disease (PD) patients. Method. The signals of 21 Parkinson's patients were compared with 15 healthy individuals, divided according age and gender. Results: Statistical difference was seen between groups for PA, 0.39 for controls and 0.25 for PD. Normal value threshold was set as 0.3; (p <= 0.001). In the Parkinson's group 80.77%, and in the control group only 12.28%, had a PA < 0.3 demonstrating an association between these variables. The dispersion diagram for age and PA for PD individuals showed p=0.01 and r=0.54. There was no significant difference in relation to gender and PA between groups: Conclusion: the significant differences in pitch's amplitude between PD patients and healthy individuals demonstrate the methods specificity.-The results showed the need of prospective controlled studies,to improve the use and indications of residual signal auto-correlation to evaluate speech in PD patients.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Characteristics of speech, especially figures of speech, are used by specific communities or domains, and, in this way, reflect their identities through their choice of vocabulary. This topic should be an object of study in the context of knowledge representation once it deals with different contexts of production of documents. This study aims to explore the dimensions of the concepts of euphemism, dysphemism, and orthophemism, focusing on the latter with the goal of extracting a concept which can be included in discussions about subject analysis and indexing. Euphemism is used as an alternative to a non-preferred expression or as an alternative to an offensive attribution-to avoid potential offense taken by the listener or by other persons, for instance, pass away. Dysphemism, on the other hand, is used by speakers to talk about people and things that frustrate and annoy them-their choice of language indicates disapproval and the topic is therefore denigrated, humiliated, or degraded, for instance, kick the bucket. While euphemism tries to make something sound better, dysphemism tries to make something sound worse. Orthophemism (Allan and Burridge 2006) is also used as an alternative to expressions, but it is a preferred, formal, and direct language of expression when representing an object or a situation, for instance, die. This paper suggests that the comprehension and use of such concepts could support the following issues: possible contributions from linguistics and terminology to subject analysis as demonstrated by Talamo et al. (1992); decrease of polysemy and ambiguity of terms used to represent certain topics of documents; and construction and evaluation of indexing languages. The concept of orthophemism can also serves to support associative relationships in the context of subject analysis, indexing, and even information retrieval related to more specific requests.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper describes a speech enhancement system (SES) based on a TMS320C31 digital signal processor (DSP) for real-time application. The SES algorithm is based on a modified spectral subtraction method and a new speech activity detector (SAD) is used. The system presents a medium computational load and a sampling rate up to 18 kHz can be used. The goal is load and a sampling rate up to 18 kHz can be used. The goal is to use it to reduce noise in an analog telephone line.