972 resultados para Voice analysis
Resumo:
We propose a study of the mathematical properties of voice as an audio signal -- This work includes signals in which the channel conditions are not ideal for emotion recognition -- Multiresolution analysis- discrete wavelet transform – was performed through the use of Daubechies Wavelet Family (Db1-Haar, Db6, Db8, Db10) allowing the decomposition of the initial audio signal into sets of coefficients on which a set of features was extracted and analyzed statistically in order to differentiate emotional states -- ANNs proved to be a system that allows an appropriate classification of such states -- This study shows that the extracted features using wavelet decomposition are enough to analyze and extract emotional content in audio signals presenting a high accuracy rate in classification of emotional states without the need to use other kinds of classical frequency-time features -- Accordingly, this paper seeks to characterize mathematically the six basic emotions in humans: boredom, disgust, happiness, anxiety, anger and sadness, also included the neutrality, for a total of seven states to identify
Resumo:
We propose a novel analysis alternative, based on two Fourier Transforms for emotion recognition from speech -- Fourier analysis allows for display and synthesizes different signals, in terms of power spectral density distributions -- A spectrogram of the voice signal is obtained performing a short time Fourier Transform with Gaussian windows, this spectrogram portraits frequency related features, such as vocal tract resonances and quasi-periodic excitations during voiced sounds -- Emotions induce such characteristics in speech, which become apparent in spectrogram time-frequency distributions -- Later, the signal time-frequency representation from spectrogram is considered an image, and processed through a 2-dimensional Fourier Transform in order to perform the spatial Fourier analysis from it -- Finally features related with emotions in voiced speech are extracted and presented
Resumo:
The evolution of cellular systems towards third generation (3G) or IMT-2000 seems to have a tendency to use W-CDMA as the standard access method, as ETSI decisions have showed. However, there is a question about the improvements in capacity and the wellness of this access method. One of the aspects that worry developers and researchers planning the third generation is the extended use of the Internet and more and more bandwidth hungry applications. This work shows the performance of a W-CDMA system simulated in a PC using cover maps generated with DC-Cell, a GIS based planning tool developed by the Technical University of Valencia, Spain. The maps are exported to MATLAB and used in the model. The system used consists of several microcells in a downtown area. We analyse the interference from users in the same cell and in adjacent cells and the effect in the system, assuming perfect control for each cell. The traffic generated by the simulator is voice and data. This model allows us to work with coverage that is more accurate and is a good approach to analyse the multiple access interference (MAI) problem in microcellular systems with irregular coverage. Finally, we compare the results obtained, with the performance of a similar system using TDMA.
Resumo:
Objetivo: Compreender o conhecimento e o uso da voz por mulheres que cantam em coral e as repercussões para a promoção da saúde. Métodos: Realizou-se estudo qualitativo, de dezembro de 2011 a fevereiro de 2012, com 13 mulheres de 23 a 66 anos, membros de um coral de uma universidade, em Fortaleza, Ceará, Brasil. Coletaram-se os dados através de entrevista semiestruturada. Aplicou-se a análise temática para organizar os resultados em categorias, analisando-as à luz do interacionismo simbólico. Resultados: Identificaram-se dois núcleos de sentido: conhecimento sobre voz e uso da voz. As coralistas definiram a voz como meio de comunicação, identidade pessoal e forma para expressar emoções. Elas não demonstraram conhecimento consistente sobre os aspectos anatômicos e fisiológicos da voz, mas as definições apresentadas mostram que elas entendem que a voz permeia espaços pessoais, sociais e profissionais. A voz profissional e o envelhecimento destacaram-se no contexto do uso vocal. As participantes reconhecem que o conhecimento e o uso da voz podem ser aprimorados pelas atividades no coral, o que remete à promoção da saúde. Conclusão: As coralistas apresentam conhecimento limitado sobre a saúde vocal, porém, compreendem os efeitos benéficos do coral sobre sua saúde, ampliando a compreensão sobre a voz; isso estimula a adoção de hábitos saudáveis e de medidas preventivas, o que favorece o uso vocal.
Resumo:
Voice acoustic analysis is becoming more and more usefúl in diagnosis of voice disorders or laryngological pathologies. The facility to record a voice sigiial is an advantage over other invasive techniques. This paper presents the statistical analyzes ofa set of voice parameters like jitter, shimmer and HNR over a 4 groups of subjects vvith dysphonia, fünctional dysphonia, hyperfünctional dysphonia, and psychogenic dysphonia and a control group. No statistical signifícance differences over pathologic groups were found but clear tendencies can be seen between pathologic and control group. The tendencies indicates this parameters as a good features to be used in an intelligent diagnosis system, moreover the jitter and shimmer parameters measured over different tones and vowels.
Resumo:
Background: Long-term exposure to infrasound and low frequency noise (ILFN <500 Hz, including infrasound) can lead to the development of vibroacoustic disease (VAD). VAD is a systemic pathology characterized by the abnormal growth of extracellular matrices in the absence of inflammatory processes, namely of collagen and elastin, both of which are abundant in the basement membrane zone of the vocal folds. ILFN-exposed workers include pilots, cabin crewmembers, restaurant workers, ship machinists and, in previous studies, even though they did not present vocal symptoms, ILFN-exposed workers had significant different voice acoustic patterns (perturbation and temporal measures) when compared with normative population. Study Aims: The present study investigates the effects of age and years of occupational ILFN-exposure on voice acoustic parameters of 37 cabin crewmembers: 12 males and 25 females. Specifically, the goals of this study are to: 1) Verify if acoustic parameters change over the age and years of ILFN-exposure and 2) Determine if there is any interaction between age and years of ILFNexposure on voice acoustic parameters of crewmembers. Materials and Methods: Spoken phonatory tasks were recorded with a C420III PP AKG head-worn microphone and a DA-P1 Tascam DAT. Acoustic analyses were performed using KayPENTAX Computer Speech Lab and Multi-Dimensional Voice Program. Acoustic parameters included speaking fundamental frequency, perturbation measures (jitter, shimmer and harmonicto- noise ratio), temporal measures (maximum phonation time and s/z ratio) and voice tremor frequency. Results: One-way ANOVA analysis revealed that as the number of ILFN-exposure years increased male cabin crewmembers presented significant different shimmer values of /i/ as well as tremor frequency of /u/. Females presented significantly different jitter % of /i, a, O/ (p <0.05). Lastly, Two-way ANOVA analysis revealed that for females, there was a significant interaction between age and occupational ILFN-exposure for voice acoustic parameters, namely for jitter’s mean for /a, O/ and shimmer’s (%) mean for /a, i/ (p <0.05). Discussion and Conclusion: These perturbation measure patterns may be indicative of histological changes within the vocal folds as a result of ILFN-exposure. The results of this study suggest that voice acoustic analysis may be an important tool for confirming ILFN-induced health effects.
Resumo:
Background: Vibroacoustic disease (VAD) is a systematic pathology characterized by the abnormal growth of extra-cellular matrices in the absence of infl ammatory processes, namely collagen and elastin, both of which are abundant in the basement membrane zone of the vocal folds. VAD can develop due to long-term exposure to infrasound and low-frequency noise (ILFN, <500 Hz). Mendes et al. (2006, 2008 and 2012) revealed that ILFN-exposed males and females presented an increased fundamental frequency (F0), decreased jitter %, and reduced maximum phonation frequency range, when compared with normative data. Temporal measures of maximum phonation time and S/Z ratio were generally reduced. Study Aims: Herein, the same voice acoustic parameters of 48 males, 36 airline pilots and 12 cabin crewmembers (age range 25-60 years) were studied, and the effects and interaction of age and years of ILFN exposure were investigated within those parameters. ILFN-exposure time (i.e. years of professional activity) ranged from 3.5 to 36 years. Materials and Methods: Spoken and sung phonatory tasks were recorded with a DA-P1 Tascam DAT and a C420III PP AKG head-worn microphone, positioned at 3 cm from the mouth. Acoustic analyses were performed using KayPENTAX Computer Speech Lab and Multi-Dimensional Voice Program. Results: Results revealed that even though pilots and cabin crewmembers were exposed to occupational environments with distinct (ILFN-rich) acoustical frequency distributions and sound pressure levels, differences in the vocal acoustic parameters were not evident. Analyzing data from both professional groups (N = 48) revealed that F0 increased signifi cantly with the number of years of professional activity. Conclusion: These results strongly suggest that the number of years of professional activity (i.e. total ILFN exposure time) had a signifi cant effect on F0. Furthermore, they may refl ect the histological changes specifi cally observed on the vocal folds of ILFN-exposed professionals.
Resumo:
We present an advanced method to achieve natural modifications when applying a pitch shifting process to singing voice by modifying the spectral envelope of the audio ex- cerpt. To this end, an all-pole spectral envelope model has been selected to describe the global variations of the spectral envelope with the changes of the pitch. We performed a pitch shifting process of some sustained vowels with the envelope processing and without it, and compared both by means of a survey open to volunteers in our website.
Resumo:
This paper analyses the relation of feminine voice performance in the years of radio age and the way the brazilians singers sings today. The goal is to analyze enunciative traces of a singular subjectivity anchored in the singing voice. The paper focus the moment, since the years of 1980, when the feminine voice no longer sounds like the singers of the gold radio time. In this period, to display a dramatic mark in the voice was the production conditions of the singing woman. In the area of the French school of discourse analysis, this paper is a part of a larger research in progress. We intend to describe the certain mode of feminine subjectivity acting in the voice as an act of enonciation.
Resumo:
Inserted in the perspective of literary studies, this paper proposes an analysis of the “Cartas Portuguesas” (Portuguese Letters), a work attributed to Mariana Alcoforado, assuming that this work is constituted within the Lusitanian literature as an important formative element of the imaginary loving Portuguese female voice. Through the study, it is possible to identify the fact that the letters are prefaced, stylistically or thematically, by the songs of love and of friend, and succeeded by works such as “Livro de Sóror Saudade” (Book of Longing Sóror), of Florbela Espanca, and “Novas Cartas Portuguesas” (New Portuguese Letters) by Maria Isabel Barreno, Maria Velho da Costa and Maria Teresa Horta.
Resumo:
This article aims to analyze the book Bufo & Spallanzani (1985), of the Brazilian writer Ruben Fonseca, in order to observe how it approaches and/or departs from some characteristics attributed to the Detective Novel, particularly in light of Tzvetan Todorov's theory. For that, this reading turns to the study of the multifaceted figure of the narrator, who is, at the same time, writer and murderer. Therefore, it aims to clarify what are the implications, aesthetic and/or otherwise, of the voice given to the killer-writer, as well as what is the role of the detective in the condiction of a secondary character.
Resumo:
Raman spectroscopy of formamide-intercalated kaolinites treated using controlled-rate thermal analysis technology (CRTA), allowing the separation of adsorbed formamide from intercalated formamide in formamide-intercalated kaolinites, is reported. The Raman spectra of the CRTA-treated formamide-intercalated kaolinites are significantly different from those of the intercalated kaolinites, which display a combination of both intercalated and adsorbed formamide. An intense band is observed at 3629 cm-1, attributed to the inner surface hydroxyls hydrogen bonded to the formamide. Broad bands are observed at 3600 and 3639 cm-1, assigned to the inner surface hydroxyls, which are hydrogen bonded to the adsorbed water molecules. The hydroxyl-stretching band of the inner hydroxyl is observed at 3621 cm-1 in the Raman spectra of the CRTA-treated formamide-intercalated kaolinites. The results of thermal analysis show that the amount of intercalated formamide between the kaolinite layers is independent of the presence of water. Significant differences are observed in the CO stretching region between the adsorbed and intercalated formamide.
Resumo:
Diffusion equations that use time fractional derivatives are attractive because they describe a wealth of problems involving non-Markovian Random walks. The time fractional diffusion equation (TFDE) is obtained from the standard diffusion equation by replacing the first-order time derivative with a fractional derivative of order α ∈ (0, 1). Developing numerical methods for solving fractional partial differential equations is a new research field and the theoretical analysis of the numerical methods associated with them is not fully developed. In this paper an explicit conservative difference approximation (ECDA) for TFDE is proposed. We give a detailed analysis for this ECDA and generate discrete models of random walk suitable for simulating random variables whose spatial probability density evolves in time according to this fractional diffusion equation. The stability and convergence of the ECDA for TFDE in a bounded domain are discussed. Finally, some numerical examples are presented to show the application of the present technique.