Biblioteca Digital

989 resultados para continuous speech

Detection of the closure-burst transitions of stops and affricates in continuous speech using the plosion index

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Automatic and accurate detection of the closure-burst transition events of stops and affricates serves many applications in speech processing. A temporal measure named the plosion index is proposed to detect such events, which are characterized by an abrupt increase in energy. Using the maxima of the pitch-synchronous normalized cross correlation as an additional temporal feature, a rule-based algorithm is designed that aims at selecting only those events associated with the closure-burst transitions of stops and affricates. The performance of the algorithm, characterized by receiver operating characteristic curves and temporal accuracy, is evaluated using the labeled closure-burst transitions of stops and affricates of the entire TIMIT test and training databases. The robustness of the algorithm is studied with respect to global white and babble noise as well as local noise using the TIMIT test set and on telephone quality speech using the NTIMIT test set. For these experiments, the proposed algorithm, which does not require explicit statistical training and is based on two one-dimensional temporal measures, gives a performance comparable to or better than the state-of-the-art methods. In addition, to test the scalability, the algorithm is applied on the Buckeye conversational speech corpus and databases of two Indian languages. (C) 2014 Acoustical Society of America.

Estimation of voice-onset time in continuous speech using temporal measures

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper proposes an automatic acoustic-phonetic method for estimating voice-onset time of stops. This method requires neither transcription of the utterance nor training of a classifier. It makes use of the plosion index for the automatic detection of burst onsets of stops. Having detected the burst onset, the onset of the voicing following the burst is detected using the epochal information and a temporal measure named the maximum weighted inner product. For validation, several experiments are carried out on the entire TIMIT database and two of the CMU Arctic corpora. The performance of the proposed method compares well with three state-of-the-art techniques. (C) 2014 Acoustical Society of America

Minimum Bayes risk estimation and decoding in large vocabulary continuous speech recognition

Relevância:

100.00% 100.00%

Publicador:

Lattice segmentation and support vector machines for large vocabulary continuous speech recognition

Relevância:

100.00% 100.00%

Publicador:

Adaptation of precision matrix models on large vocabulary continuous speech recognition

Relevância:

100.00% 100.00%

Publicador:

Temporally varying model parameters for large vocabulary continuous speech recognition

Relevância:

100.00% 100.00%

Publicador:

Precision matrix modelling for large vocabulary continuous speech recognition

Relevância:

100.00% 100.00%

Publicador:

Basis superposition precision matrix modeling for large vocabulary continuous speech recognition

Relevância:

100.00% 100.00%

Publicador:

Minimum Bayes risk estimation and decoding in large vocabulary continuous speech recognition

Relevância:

100.00% 100.00%

Publicador:

Pinched lattice minimum Bayes risk discriminative training for large vocabulary continuous speech recognition

Relevância:

100.00% 100.00%

Publicador:

Support vector machines for segmental minimum Bayes risk decoding of continuous speech

Relevância:

100.00% 100.00%

Publicador:

On large vocabulary continuous speech recognition of highly inflectional language - Czech

Relevância:

100.00% 100.00%

Publicador:

Improved discriminative training techniques for large vocabulary continuous speech recognition

Relevância:

100.00% 100.00%

Publicador:

Statistical modelling in continuous speech recognition (CSR)

Relevância:

100.00% 100.00%

Publicador:

Modelling sub-phone insertions and deletions in continuous speech recognition

Relevância:

100.00% 100.00%

Publicador:

«
1
2
3
4
5
6
7
8
...
65
66
»