13 resultados para Photo-voice

em Universidad Politécnica de Madrid


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Voice biometry is classically based on the parameterization and patterning of speech features mainly. The present approach is based on the characterization of phonation features instead (glottal features). The intention is to reduce intra-speaker variability due to the `text'. Through the study of larynx biomechanics it may be seen that the glottal correlates constitute a family of 2-nd order gaussian wavelets. The methodology relies in the extraction of glottal correlates (the glottal source) which are parameterized using wavelet techniques. Classification and pattern matching was carried out using Gaussian Mixture Models. Data of speakers from a balanced database and NIST SRE HASR2 were used in verification experiments. Preliminary results are given and discussed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The dramatic impact of neurological degenerative pathologies in life quality is a growing concern. It is well known that many neurological diseases leave a fingerprint in voice and speech production. Many techniques have been designed for the detection, diagnose and monitoring the neurological disease. Most of them are costly or difficult to extend to primary attention medical services. Through the present paper it will be shown how some neurological diseases can be traced at the level of phonation. The detection procedure would be based on a simple voice test. The availability of advanced tools and methodologies to monitor the organic pathology of voice would facilitate the implantation of these tests. The paper hypothesizes that some of the underlying mechanisms affecting the production of voice produce measurable correlates in vocal fold biomechanics. A general description of the methodological foundations for the voice analysis system which can estimate correlates to the neurological disease is shown. Some study cases will be presented to illustrate the possibilities of the methodology to monitor neurological diseases by voice

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This study assessed the applicability of a ferrous oxalate mediated photo-Fenton pretreatment for indigo-dyed wastewaters as to produce a biodegradable enough effluent, likely of being derived to conventional biological processes. The photochemical treatment was performed with ferrous oxalate and hydrogen peroxide in a Compound Parabolic Concentrator (CPC) under batch operation conditions. The reaction was studied at natural pH conditions (5–6) with indigo concentrations in the range of 6.67–33.33 mg L−1, using a fixed oxalate-to-iron mass ratio (C2O42−/Fe2+ = 35) and assessing the system's biodegradability at low (257 mg L−1) and high (1280 mg L−1) H2O2 concentrations. In order to seek the optimal conditions for the treatment of indigo dyed wastewaters, an experimental design consisting in a statistical surface response approach was carried out. This analysis revealed that the best removal efficiencies for Total Organic Carbon (TOC) were obtained for low peroxide doses. In general it was observed that after 20 kJ L−1, almost every treated effluent increased its biodegradability from a BOD5/COD value of 0.4. This increase in the biodegradability was confirmed by the presence of short chain carboxylic acids as intermediate products and by the mineralization of organic nitrogen into nitrate. Finally, an overall decrease in the LC50 for Artemia salina indicated a successful detoxification of the effluent.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

GaN and InGaN nanocolumns of various compositions are studied by room-temperature photoluminescence (PL) under different ambient conditions. GaN nanocolumns exhibit a reversible quenching upon exposure to air under constant UV excitation, following a t−1/2 time dependence and resulting in a total reduction of intensity by 85–90%, as compared to PL measured in vacuum, with no spectral change. This effect is not observed when exposing the samples to pure nitrogen. We attribute this effect to photoabsorption and photodesorption of oxygen that modifies the surface potential bending. InGaN nanocolumns, under the same experimental conditions do not show the same quenching features: The high-energy part of the broad PL line is not modified by exposure to air, whereas a lower-energy part, which does quench by 80–90%, can now be distinguished. We discuss the different behaviors in terms of carrier localization and possible composition or strain gradients in the InGaN nanocolumns.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We have analyzed the increase of the sheet conductance (ΔG□) under spectral illumination in high dose Ti implanted Si samples subsequently processed by pulsed-laser melting. Samples with Ti concentration clearly above the insulator-metal transition limit show a remarkably high ΔG□, even higher than that measured in a silicon reference sample. This increase in the ΔG□ magnitude is contrary to the classic understanding of recombination centers action and supports the lifetime recovery predicted for concentrations of deep levels above the insulator-metal transition.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The employment of nonlinear analysis techniques for automatic voice pathology detection systems has gained popularity due to the ability of such techniques for dealing with the underlying nonlinear phenomena. On this respect, characterization using nonlinear analysis typically employs the classical Correlation Dimension and the largest Lyapunov Exponent, as well as some regularity quantifiers computing the system predictability. Mostly, regularity features highly depend on a correct choosing of some parameters. One of those, the delay time �, is usually fixed to be 1. Nonetheless, it has been stated that a unity � can not avoid linear correlation of the time series and hence, may not correctly capture system nonlinearities. Therefore, present work studies the influence of the � parameter on the estimation of regularity features. Three � estimations are considered: the baseline value 1; a � based on the Average Automutual Information criterion; and � chosen from the embedding window. Testing results obtained for pathological voice suggest that an improved accuracy might be obtained by using a � value different from 1, as it accounts for the underlying nonlinearities of the voice signal.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Current text-to-speech systems are developed using studio-recorded speech in a neutral style or based on acted emotions. However, the proliferation of media sharing sites would allow developing a new generation of speech-based systems which could cope with spontaneous and styled speech. This paper proposes an architecture to deal with realistic recordings and carries out some experiments on unsupervised speaker diarization. In order to maximize the speaker purity of the clusters while keeping a high speaker coverage, the paper evaluates the F-measure of a diarization module, achieving high scores (>85%) especially when the clusters are longer than 30 seconds, even for the more spontaneous and expressive styles (such as talk shows or sports).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Teaching the adequate use of the singing voice conveys a lot of knowledge in musical performance as well as in objective estimation techniques involving the use of air, muscles, room and body acoustics, and the tuning of a fine instrument as the human voice. Although subjective evaluation and training is a very delicate task to be carried out only by expert singers, biomedical engineering may help contributing with well-funded methodologies developed for the study of voice pathology. The present work is a preliminary study of exploratory character describing the performance of a student singer in a regular classroom under the point of view of vocal fold biomechanics. Estimates of biomechanical parameters obtained from singing voice are given and their potential use is discussed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A case study of vocal fold paralysis treatment is described with the help of the voice quality analysis application BioMet®Phon. The case corresponds to a description of a 40 - year old female patient who was diagnosed of vocal fold paralysis following a cardio - pulmonar intervention which required intubation for 8 days and posterior tracheotomy for 15 days. The patient presented breathy and asthenic phon ation, and dysphagia. Six main examinations were conducted during a full year period that the treatment lasted consisting in periodic reviews including video - endostroboscopy, voice analysis and breathing function monitoring. The phoniatrician treatment inc luded 20 sessions of vocal rehabilitation, followed by an intracordal infiltration with Radiesse 8 months after the rehabilitation treatment started followed by 6 sessions of rehabilitation more. The videondoscopy and the voicing quality analysis refer a s ubstantial improvement in the vocal function with recovery in all the measures estimated (jitter, shimmer, mucosal wave contents, glottal closure, harmonic contents and biomechanical function analysis). The paper refers the procedure followed and the results obtained by comparing the longitudinal progression of the treatment, illustrating the utility of voice quality analysis tools in speech therapy.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Teaching the adequate use of the singing voice conveys a lot of knowledge in musical performance as well as in objective estimation techniques involving the use of air, muscles, room and body acoustics, and the tuning of a fine instrument as the human voice. Although subjective evaluation and training is a very delicate task to be carried out only by expert singers, biomedical engineering may help contributing with well - funded methodologies developed for the study of voice pathology. The present study is a preliminary study of exploratory character describing the performance of a student singer in a regular classroom under the point of view of vocal fold biomechanics. Estimate s of biomechanical parameters obtained from singing voice are given and their use i n the classroom is discussed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Voice therapies of muscle tension dysphonia in Germany need to be increased in effectiveness by applying intensive, manualized procedures and standardized assessment protocols. The same holds true for therapies of disturbed singer's voices. According to a Cochrane review on the effectiveness of therapies of functional dysphonia neither direct nor indirect voice therapies alone but combinations of both elements are effective (Ruotsalainen et al., 2007).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

El uso universal de síntesis de voz en diferentes aplicaciones requeriría un desarrollo sencillo de las nuevas voces con poca intervención manual. Teniendo en cuenta la cantidad de datos multimedia disponibles en Internet y los medios de comunicación, un objetivo interesante es el desarrollo de herramientas y métodos para construir automáticamente las voces de estilo de varios de ellos. En un trabajo anterior se esbozó una metodología para la construcción de este tipo de herramientas, y se presentaron experimentos preliminares con una base de datos multiestilo. En este artículo investigamos más a fondo esta tarea y proponemos varias mejoras basadas en la selección del número apropiado de hablantes iniciales, el uso o no de filtros de reducción de ruido, el uso de la F0 y el uso de un algoritmo de detección de música. Hemos demostrado que el mejor sistema usando un algoritmo de detección de música disminuye el error de precisión 22,36% relativo para el conjunto de desarrollo y 39,64% relativo para el montaje de ensayo en comparación con el sistema base, sin degradar el factor de mérito. La precisión media para el conjunto de prueba es 90.62% desde 76.18% para los reportajes de 99,93% para los informes meteorológicos.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Acoustic parameters are frequently used to assess the presence of pathologies in human voice. Many of them have demonstrated to be useful but in some cases its results could be optimized by selecting appropriate working margins. In this study two indices, CIL and RALA, obtained from Modulation Spectra are described and tuned using different frame lengths and frequency ranges to maximize AUC in normal to pathological voice detection. After the tuning process, AUC reaches 0.96 and 0.95 values for CIL and RALA respectively representing an improvement of 16 % and 12 % at each case respect to the typical tuning based only on frame length selection.