913 resultados para Speech genre
Resumo:
We present a new method for the enhancement of speech. The method is designed for scenarios in which targeted speaker enrollment as well as system training within the typical noise environment are feasible. The proposed procedure is fundamentally different from most conventional and state-of-the-art denoising approaches. Instead of filtering a distorted signal we are resynthesizing a new “clean” signal based on its likely characteristics. These characteristics are estimated from the distorted signal. A successful implementation of the proposed method is presented. Experiments were performed in a scenario with roughly one hour of clean speech training data. Our results show that the proposed method compares very favorably to other state-of-the-art systems in both objective and subjective speech quality assessments. Potential applications for the proposed method include jet cockpit communication systems and offline methods for the restoration of audio recordings.
Resumo:
This study aimed to assess speech perception and communication skills in adolescents between ages 8 and 18 that received cochlear implants for pre- and peri-lingual deafness.
Resumo:
Boris Pasternak’s poemy are acutely self-conscious of their place in the epic tradition. Lieutenant Schmidt (LS) represents one attempt at exploring the parameters of the poema itself as the poet makes a “difficult” transition from “lyric thinking” to “the epic.” In this article I examine this transition against a contemporaneous example in the genre, Tsvetaeva’s Poema of the End (PE). In LS, structural elements of the poema are counterposed to those of PE. While PE amplifies the individual voice, LS muffles what is personal for the sake of the public voice. While PE is atemporal, LS is historical. While PE unfolds on symbolic planes, with elements of plot kept to a bare minimum (a single moment of separation), LS is a plot-driven account based on concrete, documentary material. Finally, while PE is an “overgrown lyric”—representing the “lyric thinking” that Pasternak hopes to transcend— LS is an exploration of the possibilities that a more traditional model of the poema can offer. Although in the present analysis I draw on several theories of poetic genres, this is by no means an exhaustive study of epic versus lyric forms of poetry. Instead, my analysis focuses on those structural and thematic features of the poema that the poets themselves perceived as central to their texts. Pasternak, for his part, develops the structure and thematics of his poema in ways that are inspired by PE, but also, as we will see, in more significant ways, contrast with it.
Resumo:
A new idea for waveform coding using vector quantisation (VQ) is introduced. This idea makes it possible to deal with codevectors much larger than before for a fixed bit per sample rate. Also a solution to the matching problem (inherent in the present context) in the &-norm describing a measure of neamess is presented. The overall computational complexity of this solution is O(n3 log, n). Sample results are presented to demonstrate the advantage of using this technique in the context of coding of speech waveforms.
Resumo:
We present a new approach for corpus-based speech enhancement that significantly improves over a method published by Xiao and Nickel in 2010. Corpus-based enhancement systems do not merely filter an incoming noisy signal, but resynthesize its speech content via an inventory of pre-recorded clean signals. The goal of the procedure is to perceptually improve the sound of speech signals in background noise. The proposed new method modifies Xiao's method in four significant ways. Firstly, it employs a Gaussian mixture model (GMM) instead of a vector quantizer in the phoneme recognition front-end. Secondly, the state decoding of the recognition stage is supported with an uncertainty modeling technique. With the GMM and the uncertainty modeling it is possible to eliminate the need for noise dependent system training. Thirdly, the post-processing of the original method via sinusoidal modeling is replaced with a powerful cepstral smoothing operation. And lastly, due to the improvements of these modifications, it is possible to extend the operational bandwidth of the procedure from 4 kHz to 8 kHz. The performance of the proposed method was evaluated across different noise types and different signal-to-noise ratios. The new method was able to significantly outperform traditional methods, including the one by Xiao and Nickel, in terms of PESQ scores and other objective quality measures. Results of subjective CMOS tests over a smaller set of test samples support our claims.
Resumo:
Arts speech therapy (AST) is a therapeutic method within complementary medicine and has been practiced for decades for various medical conditions. It comprises listening and the recitation of different forms of speech exercises under the guidance of a licensed speech therapist. The aim of our study was to noninvasively investigate whether different types of recitation influence hemodynamics and oxygenation in the brain and skeletal leg muscle using near-infrared spectroscopy (NIRS). Seventeen healthy volunteers (eight men and nine women, mean age ± standard deviation 35.6 ± 12.7 years) were enrolled in the study. Each subject was measured three times on different days with the different types of recitation: hexameter, alliteration, and prose verse. Before, during, and after recitation, relative concentration changes of oxyhemoglobin (Δ[O2Hb]), deoxyhemoglobin (Δ[HHb]), total hemoglobin (Δ[tHb]), and tissue oxygenation saturation (StO2) were measured in the brain and skeletal leg muscle using a NIRS device. The study was performed with a randomized crossover design. Significant concentration changes were found during recitation of all verses, with mainly a decrease in Δ[O2Hb] and ΔStO2 in the brain, and an increase in Δ[O2Hb] and Δ[tHb] in the leg muscle during recitation. After the recitations, significant changes were mainly increases of Δ[HHb] and Δ[tHb] in the calf muscle. The Mayer wave spectral power (MWP) was also significantly affected, i.e., mainly the MWP of the Δ[O2Hb] and Δ[tHb] increased in the brain during recitation of hexameter and prose verse. The changes in MWP were also significantly different between hexameter and alliteration, and hexameter and prose. Possible physiological explanations for these changes are discussed. A probable reason is a different effect of recitations on the sympathetic nervous system. In conclusion, these changes show that AST has relevant effects on the hemodynamics and oxygenation of the brain and muscle.