Biblioteca Digital

108 resultados para Speech journalistic unified

Speech Recognition with unknown partial feature corruption - a review of the union model

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper provides a summary of our studies on robust speech recognition based on a new statistical approach – the probabilistic union model. We consider speech recognition given that part of the acoustic features may be corrupted by noise. The union model is a method for basing the recognition on the clean part of the features, thereby reducing the effect of the noise on recognition. To this end, the union model is similar to the missing feature method. However, the two methods achieve this end through different routes. The missing feature method usually requires the identity of the noisy data for noise removal, while the union model combines the local features based on the union of random events, to reduce the dependence of the model on information about the noise. We previously investigated the applications of the union model to speech recognition involving unknown partial corruption in frequency band, in time duration, and in feature streams. Additionally, a combination of the union model with conventional noise-reduction techniques was studied, as a means of dealing with a mixture of known or trainable noise and unknown unexpected noise. In this paper, a unified review, in the context of dealing with unknown partial feature corruption, is provided into each of these applications, giving the appropriate theory and implementation algorithms, along with an experimental evaluation.

Development of a Java-based Unified and Flexible Natural Language Discourse System

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This paper outlines the design and development of a Java-based, unified and flexible natural language dialogue system that enables users to interact using natural language, e.g. speech. A number of software development issues are considered with the aim of designing an architecture that enables different discourse components to be readily and flexibly combined in a manner that permits information to be easily shared. Use of XML schemas assists this component interaction. The paper describes how a range of Java language features were employed to support the development of the architecture, providing an illustration of how a modern programming language makes tractable the development of a complex dialogue system.

Irish English Speech Acquisition

Relevância:

20.00% 20.00%

Publicador:

Transcribing speech: The segmental and prosodic layers

Relevância:

20.00% 20.00%

Publicador:

Can patients with chronic schizophrenia express emotion? A speech analysis

Relevância:

20.00% 20.00%

Publicador:

Speech in the Process of Becoming Bored

Relevância:

20.00% 20.00%

Publicador:

Acoustic Correlates of Emotion Dimensions in View of Speech Synthesis

Relevância:

20.00% 20.00%

Publicador:

Emotional speech: towards a new generation of databases

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Research on speech and emotion is moving from a period of exploratory research into one where there is a prospect of substantial applications, notably in human-computer interaction. Progress in the area relies heavily on the development of appropriate databases. This paper addresses the issues that need to be considered in developing databases of emotional speech, and shows how the challenge of developing apropriate databases is being addressed in three major recent projects - the Belfast project, the Reading-Leeds project and the CREST-ESP project. From these and other studies the paper draws together the tools and methods that have been developed, addresses the problems that arise and indicates the future directions for the development of emotional speech databases.

Subband Correlation and Robust Speech Recognition

Relevância:

20.00% 20.00%

Publicador:

Robust Speech recognition using probabilistic union models

Relevância:

20.00% 20.00%

Publicador:

Noise compensation for speech recognition with arbitray additive noise

Relevância:

20.00% 20.00%

Publicador:

Modelling Sub-Band Correlation For Noise-Robust Speech Recognition

Relevância:

20.00% 20.00%

Publicador:

A New Posterior Based Audio-Visual Integration Method for Robust Speech Recognition

Relevância:

20.00% 20.00%

Publicador:

Describing the emotional states that are expressed in speech.

Relevância:

20.00% 20.00%

Publicador:

Union: a new approach for combining sub-band observations for noisy speech recognition

Relevância:

20.00% 20.00%

Publicador:

«
1
2
3
4
5
6
7
8
»