Biblioteca Digital

28 resultados para seminar-based training

em Cambridge University Engineering Department Publications Database

Kernelized log linear models for continuous speech recognition

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Large margin criteria and discriminative models are two effective improvements for HMM-based speech recognition. This paper proposed a large margin trained log linear model with kernels for CSR. To avoid explicitly computing in the high dimensional feature space and to achieve the nonlinear decision boundaries, a kernel based training and decoding framework is proposed in this work. To make the system robust to noise a kernel adaptation scheme is also presented. Previous work in this area is extended in two directions. First, most kernels for CSR focus on measuring the similarity between two observation sequences. The proposed joint kernels defined a similarity between two observation-label sequence pairs on the sentence level. Second, this paper addresses how to efficiently employ kernels in large margin training and decoding with lattices. To the best of our knowledge, this is the first attempt at using large margin kernel-based log linear models for CSR. The model is evaluated on a noise corrupted continuous digit task: AURORA 2.0. © 2013 IEEE.

Training a parametric-based logF0 model with the minimum generation error criterion

Relevância:

40.00% 40.00%

Publicador:

Trajectory training considering global variance for HMM-based speech synthesis

Relevância:

40.00% 40.00%

Publicador:

Training a real-world POMDP-based dialog system

Relevância:

40.00% 40.00%

Publicador:

The training algorithm based on variational approximation for separable 2D-HMM

Relevância:

40.00% 40.00%

Publicador:

Lattice-based discriminative training for large vocabulary speech recognition

Relevância:

40.00% 40.00%

Publicador:

Context Adaptive Training with Factorized Decision Trees for HMM-Based Speech Synthesis

Relevância:

40.00% 40.00%

Publicador:

Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis

Relevância:

40.00% 40.00%

Publicador:

Factor analysis based VTS discriminative adaptive training

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Vector Taylor Series (VTS) model based compensation is a powerful approach for noise robust speech recognition. An important extension to this approach is VTS adaptive training (VAT), which allows canonical models to be estimated on diverse noise-degraded training data. These canonical model can be estimated using EM-based approaches, allowing simple extensions to discriminative VAT (DVAT). However to ensure a diagonal corrupted speech covariance matrix the Jacobian (loading matrix) relating the noise and clean speech is diagonalised. In this work an approach for yielding optimal diagonal loading matrices based on minimising the expected KL-divergence between the diagonal loading matrix and "correct" distributions is proposed. The performance of DVAT using the standard and optimal diagonalisation was evaluated on both in-car collected data and the Aurora4 task. © 2012 IEEE.

Speech factorization for HMM-TTS based on cluster adaptive training.

Relevância:

40.00% 40.00%

Publicador:

Model-based approaches to adaptive training in reverberant environments

Relevância:

40.00% 40.00%

Publicador:

Developing a process for formulating a postponement-based supply chain strategy

Relevância:

30.00% 30.00%

Publicador:

What the communicated-based models of design reveal and conceal

Relevância:

30.00% 30.00%

Publicador:

A knowledge-based approach to evaluating manufacturing technology investments

Relevância:

30.00% 30.00%

Publicador:

DES/MILES simulations with differential equation based wall distances

Relevância:

30.00% 30.00%

Publicador:

«
1
2
»