Biblioteca Digital

39 resultados para Voiced or unvoiced classification

em Cambridge University Engineering Department Publications Database

Continuous F0 in the source-excitation generation for HMM-based TTS: Do we need voiced/unvoiced classification?

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Most HMM-based TTS systems use a hard voiced/unvoiced classification to produce a discontinuous F0 signal which is used for the generation of the source-excitation. When a mixed source excitation is used, this decision can be based on two different sources of information: the state-specific MSD-prior of the F0 models, and/or the frame-specific features generated by the aperiodicity model. This paper examines the meaning of these variables in the synthesis process, their interaction, and how they affect the perceived quality of the generated speech The results of several perceptual experiments show that when using mixed excitation, subjects consistently prefer samples with very few or no false unvoiced errors, whereas a reduction in the rate of false voiced errors does not produce any perceptual improvement. This suggests that rather than using any form of hard voiced/unvoiced classification, e.g., the MSD-prior, it is better for synthesis to use a continuous F0 signal and rely on the frame-level soft voiced/unvoiced decision of the aperiodicity model. © 2011 IEEE.

Adaptive weighting of microphone arrays for distant-talking F0 and voiced/unvoiced estimation

Relevância:

50.00% 50.00%

Publicador:

Joint modelling of voicing label and continuous F0 for HMM based speech synthesis

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Fundamental frequency, or F0 is critical for high quality speech synthesis in HMM based speech synthesis. Traditionally, F0 values are considered to depend on a binary voicing decision such that they are continuous in voiced regions and undefined in unvoiced regions. Multi-space distribution HMM (MSDHMM) has been used for modelling the discontinuous F0. Recently, a continuous F0 modelling framework has been proposed and shown to be effective, where continuous F0 observations are assumed to always exist and voicing labels are explicitly modelled by an independent stream. In this paper, a refined continuous F0 modelling approach is proposed. Here, F0 values are assumed to be dependent on voicing labels and both are jointly modelled in a single stream. Due to the enforced dependency, the new method can effectively reduce the voicing classification error. Subjective listening tests also demonstrate that the new approach can yield significant improvements on the naturalness of the synthesised speech. A dynamic random unvoiced F0 generation method is also investigated. Experiments show that it has significant effect on the quality of synthesised speech. © 2011 IEEE.

Regular Positive-Real Functions and the Classification of Transformer less Series-Parallel Networks

Relevância:

30.00% 30.00%

Publicador:

Probabilistic graphical models for semi-supervised traffic classification

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Traffic classification using machine learning continues to be an active research area. The majority of work in this area uses off-the-shelf machine learning tools and treats them as black-box classifiers. This approach turns all the modelling complexity into a feature selection problem. In this paper, we build a problem-specific solution to the traffic classification problem by designing a custom probabilistic graphical model. Graphical models are a modular framework to design classifiers which incorporate domain-specific knowledge. More specifically, our solution introduces semi-supervised learning which means we learn from both labelled and unlabelled traffic flows. We show that our solution performs competitively compared to previous approaches while using less data and simpler features. Copyright © 2010 ACM.

On the classification of series-parallel electrical and mechanical networks

Relevância:

30.00% 30.00%

Publicador:

Spoken language understanding from unaligned data using discriminative classification models

Relevância:

30.00% 30.00%

Publicador:

Probabilistic modelling of F0 in unvoiced regions in HMM-based speech synthesis

Relevância:

30.00% 30.00%

Publicador:

Exploring spaces of system architectures using constraint-based classification and Euler diagrams

Relevância:

30.00% 30.00%

Publicador:

Outlier robust gaussian process classification

Relevância:

30.00% 30.00%

Publicador:

Component classification: a change perspective

Relevância:

30.00% 30.00%

Publicador:

A classification of uncertainty for early product and system design

Relevância:

30.00% 30.00%

Publicador:

An empirical classification of visual methods for management: results of picture sorting experiments with managers and students

Relevância:

30.00% 30.00%

Publicador:

Tensor canonical correlation analysis for action classification

Relevância:

30.00% 30.00%

Publicador:

Semantic annotation to support automatic taxonomy classification

Relevância:

30.00% 30.00%

Publicador:

«
1
2
3
»