Biblioteca Digital

72 resultados para nationalism and language.

em Cambridge University Engineering Department Publications Database

Dasher - A data entry interface using continuous gestures and language models

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Existing devices for communicating information to computers are bulky, slow to use, or unreliable. Dasher is a new interface incorporating language modelling and driven by continuous two-dimensional gestures, e.g. a mouse, touchscreen, or eye-tracker. Tests have shown that this device can be used to enter text at a rate of up to 34 words per minute, compared with typical ten-finger keyboard typing of 40-60 words per minute. Although the interface is slower than a conventional keyboard, it is small and simple, and could be used on personal data assistants and by motion-impaired computer users.

Erratum: Language modelling for Russian and English using words and classes (Computer Speech and Language (2003) 17 (87-104))

Relevância:

100.00% 100.00%

Publicador:

Statistical parametric speech synthesis based on speaker and language factorization

Relevância:

100.00% 100.00%

Publicador:

Resumo:

An increasingly common scenario in building speech synthesis and recognition systems is training on inhomogeneous data. This paper proposes a new framework for estimating hidden Markov models on data containing both multiple speakers and multiple languages. The proposed framework, speaker and language factorization, attempts to factorize speaker-/language-specific characteristics in the data and then model them using separate transforms. Language-specific factors in the data are represented by transforms based on cluster mean interpolation with cluster-dependent decision trees. Acoustic variations caused by speaker characteristics are handled by transforms based on constrained maximum-likelihood linear regression. Experimental results on statistical parametric speech synthesis show that the proposed framework enables data from multiple speakers in different languages to be used to: train a synthesis system; synthesize speech in a language using speaker characteristics estimated in a different language; and adapt to a new language. © 2012 IEEE.

Hidden Markov models in speech and language processing

Relevância:

100.00% 100.00%

Publicador:

Corpus-based methods in language and speech processing

Relevância:

100.00% 100.00%

Publicador:

Language modelling for Russian and English using words and classes

Relevância:

100.00% 100.00%

Publicador:

Use of contexts in language model interpolation and adaptation

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Language models (LMs) are often constructed by building multiple individual component models that are combined using context independent interpolation weights. By tuning these weights, using either perplexity or discriminative approaches, it is possible to adapt LMs to a particular task. This paper investigates the use of context dependent weighting in both interpolation and test-time adaptation of language models. Depending on the previous word contexts, a discrete history weighting function is used to adjust the contribution from each component model. As this dramatically increases the number of parameters to estimate, robust weight estimation schemes are required. Several approaches are described in this paper. The first approach is based on MAP estimation where interpolation weights of lower order contexts are used as smoothing priors. The second approach uses training data to ensure robust estimation of LM interpolation weights. This can also serve as a smoothing prior for MAP adaptation. A normalized perplexity metric is proposed to handle the bias of the standard perplexity criterion to corpus size. A range of schemes to combine weight information obtained from training data and test data hypotheses are also proposed to improve robustness during context dependent LM adaptation. In addition, a minimum Bayes' risk (MBR) based discriminative training scheme is also proposed. An efficient weighted finite state transducer (WFST) decoding algorithm for context dependent interpolation is also presented. The proposed technique was evaluated using a state-of-the-art Mandarin Chinese broadcast speech transcription task. Character error rate (CER) reductions up to 7.3 relative were obtained as well as consistent perplexity improvements. © 2012 Elsevier Ltd. All rights reserved.

Speech understanding and spoken dialogue systems

Relevância:

90.00% 90.00%

Publicador:

Acoustic source localisation and tracking using track before detect

Relevância:

90.00% 90.00%

Publicador:

Combining derivative and parametric kernels for speaker verification

Relevância:

90.00% 90.00%

Publicador:

HMM word and phrase alignment for statistical machine translation

Relevância:

90.00% 90.00%

Publicador:

Sparse linear regression with structured priors and application to denoising of musical audio

Relevância:

90.00% 90.00%

Publicador:

Bayesian adaptive inference and adaptive training

Relevância:

90.00% 90.00%

Publicador:

Transformation streams and the HMM error model

Relevância:

90.00% 90.00%

Publicador:

Variable-length category n-gram language models

Relevância:

90.00% 90.00%

Publicador:

«
1
2
3
4
5
»