Biblioteca Digital

932 resultados para Speech and pioneering sports Colima

Decision tree design and applications in speech processing

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The design and operation of the minimum cost classifier, where the total cost is the sum of the measurement cost and the classification cost, is computationally complex. Noting the difficulties associated with this approach, decision tree design directly from a set of labelled samples is proposed in this paper. The feature space is first partitioned to transform the problem to one of discrete features. The resulting problem is solved by a dynamic programming algorithm over an explicitly ordered state space of all outcomes of all feature subsets. The solution procedure is very general and is applicable to any minimum cost pattern classification problem in which each feature has a finite number of outcomes. These techniques are applied to (i) voiced, unvoiced, and silence classification of speech, and (ii) spoken vowel recognition. The resulting decision trees are operationally very efficient and yield attractive classification accuracies.

Speaker verification based on the fusion of speech acoustics and inverted articulatory signals

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We propose apractical, feature-level and score-level fusion approach by combining acoustic and estimated articulatory information for both text independent and text dependent speaker verification. From a practical point of view, we study how to improve speaker verification performance by combining dynamic articulatory information with the conventional acoustic features. On text independent speaker verification, we find that concatenating articulatory features obtained from measured speech production data with conventional Mel-frequency cepstral coefficients (MFCCs) improves the performance dramatically. However, since directly measuring articulatory data is not feasible in many real world applications, we also experiment with estimated articulatory features obtained through acoustic-to-articulatory inversion. We explore both feature level and score level fusion methods and find that the overall system performance is significantly enhanced even with estimated articulatory features. Such a performance boost could be due to the inter-speaker variation information embedded in the estimated articulatory features. Since the dynamics of articulation contain important information, we included inverted articulatory trajectories in text dependent speaker verification. We demonstrate that the articulatory constraints introduced by inverted articulatory features help to reject wrong password trials and improve the performance after score level fusion. We evaluate the proposed methods on the X-ray Microbeam database and the RSR 2015 database, respectively, for the aforementioned two tasks. Experimental results show that we achieve more than 15% relative equal error rate reduction for both speaker verification tasks. (C) 2015 Elsevier Ltd. All rights reserved.

Training and adapting MLP features for Arabic speech recognition

Relevância:

100.00% 100.00%

Publicador:

Lattice segmentation and support vector machines for large vocabulary continuous speech recognition

Relevância:

100.00% 100.00%

Publicador:

Generating and evaluating segmentations for automatic speech recognition of conversational telephone speech

Relevância:

100.00% 100.00%

Publicador:

Large vocabulary speech recognition for read and broadcast Czech

Relevância:

100.00% 100.00%

Publicador:

Speech understanding and spoken dialogue systems

Relevância:

100.00% 100.00%

Publicador:

Corpus-based methods in language and speech processing

Relevância:

100.00% 100.00%

Publicador:

Experiments in speaker normalisation and adaptation for large vocabulary speech recognition

Relevância:

100.00% 100.00%

Publicador:

Continuous speech recognition in noise using spectral subtraction and HMM adaptation

Relevância:

100.00% 100.00%

Publicador:

The auditory processing and recognition of speech

Relevância:

100.00% 100.00%

Publicador:

Particle methods for Bayesian modeling and enhancement of speech signals

Relevância:

100.00% 100.00%

Publicador:

Speech recognition evaluation: a review of the U.S. CSR and LVCSR programmes

Relevância:

100.00% 100.00%

Publicador:

Robust speech recognition in additive and convolutional noise using parallel model combination

Relevância:

100.00% 100.00%

Publicador:

A comparison of the Boltzmann machine and the back propagation network as recognizers of static speech patterns

Relevância:

100.00% 100.00%

Publicador:

«
1
2
...
6
7
8
9
10
11
12
...
62
63
»