Biblioteca Digital

96 resultados para Decoding Speech Prosody

em Queensland University of Technology - ePrints Archive

The effect of language models on phonetic decoding for spoken term detection

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Spoken term detection (STD) popularly involves performing word or sub-word level speech recognition and indexing the result. This work challenges the assumption that improved speech recognition accuracy implies better indexing for STD. Using an index derived from phone lattices, this paper examines the effect of language model selection on the relationship between phone recognition accuracy and STD accuracy. Results suggest that language models usually improve phone recognition accuracy but their inclusion does not always translate to improved STD accuracy. The findings suggest that using phone recognition accuracy to measure the quality of an STD index can be problematic, and highlight the need for an alternative that is more closely aligned with the goals of the specific detection task.

Spoken term detection using fast phonetic decoding

Relevância:

30.00% 30.00%

Publicador:

Resumo:

While spoken term detection (STD) systems based on word indices provide good accuracy, there are several practical applications where it is infeasible or too costly to employ an LVCSR engine. An STD system is presented, which is designed to incorporate a fast phonetic decoding front-end and be robust to decoding errors whilst still allowing for rapid search speeds. This goal is achieved through mono-phone open-loop decoding coupled with fast hierarchical phone lattice search. Results demonstrate that an STD system that is designed with the constraint of a fast and simple phonetic decoding front-end requires a compromise to be made between search speed and search accuracy.

Adaptive Fusion of Speech and Lip Information for Robust Speaker identification

Relevância:

20.00% 20.00%

Publicador:

Application of the Trended Hidden Markov Model to Speech Synthesis

Relevância:

20.00% 20.00%

Publicador:

Speech Enhancement by Formant Sharpening in the Cepstral Domain

Relevância:

20.00% 20.00%

Publicador:

Automatic Speech Segmentation with HMM

Relevância:

20.00% 20.00%

Publicador:

Multilingual Phone Clustering for Recognition of Spontaneous Indonesian Speech Utilising Pronunciation Modelling Techniques

Relevância:

20.00% 20.00%

Publicador:

Characterising Learners: Speech-language Difficulties and ESL

Relevância:

20.00% 20.00%

Publicador:

Using a Free-Parts Representation for Visual Speech Recognition

Relevância:

20.00% 20.00%

Publicador:

A Hybrid LP-Harmonics Model for Low Bit-Rate Speech Compression with Natural Quality

Relevância:

20.00% 20.00%

Publicador:

Adaptive Parameter Compensation for Robust Hands-Free Speech Recognition Using a Dual Beamforming Microphone Array

Relevância:

20.00% 20.00%

Publicador:

An investigation of HMM classifier combination ctrategies for improved audio-visual speech recognition

Relevância:

20.00% 20.00%

Publicador:

Pseudo-Syntactic Language Modelling for Disfluent Speech Recognition

Relevância:

20.00% 20.00%

Publicador:

Cross-Lingual Pronunciation Modelling for Indonesian Speech

Relevância:

20.00% 20.00%

Publicador:

Trainable Speech Synthesis with Trended Hidden Markov Models

Relevância:

20.00% 20.00%

Publicador:

«
1
2
3
4
5
6
7
»