Biblioteca Digital

986 resultados para speaker dependencies

Robust speaker verification via fusion of speech and lip modalities

Relevância:

70.00% 70.00%

Publicador:

Resumo:

This paper investigates the use of lip information, in conjunction with speech information, for robust speaker verification in the presence of background noise. It has been previously shown in our own work, and in the work of others, that features extracted from a speaker's moving lips hold speaker dependencies which are complementary with speech features. We demonstrate that the fusion of lip and speech information allows for a highly robust speaker verification system which outperforms the performance of either sub-system. We present a new technique for determining the weighting to be applied to each modality so as to optimize the performance of the fused system. Given a correct weighting, lip information is shown to be highly effective for reducing the false acceptance and false rejection error rates in the presence of background noise

The use of speech and lip modalities for robust speaker verification under adverse conditions

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Investigates the use of lip information, in conjunction with speech information, for robust speaker verification in the presence of background noise. We have previously shown (Int. Conf. on Acoustics, Speech and Signal Proc., vol. 6, pp. 3693-3696, May 1998) that features extracted from a speaker's moving lips hold speaker dependencies which are complementary with speech features. We demonstrate that the fusion of lip and speech information allows for a highly robust speaker verification system which outperforms either subsystem individually. We present a new technique for determining the weighting to be applied to each modality so as to optimize the performance of the fused system. Given a correct weighting, lip information is shown to be highly effective for reducing the false acceptance and false rejection error rates in the presence of background noise

Modeling long-range dependencies in speech data for text-independent speaker recognition

Relevância:

40.00% 40.00%

Publicador:

Adaptive Fusion of Speech and Lip Information for Robust Speaker identification

Relevância:

20.00% 20.00%

Publicador:

Unsupervised Evaluation of Speaker Verification Systems

Relevância:

20.00% 20.00%

Publicador:

Speaker identification using higher order spectral phase features and their effectiveness vis-a-vis Mel-Cepstral features

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The effectiveness of higher-order spectral (HOS) phase features in speaker recognition is investigated by comparison with Mel Cepstral features on the same speech data. HOS phase features retain phase information from the Fourier spectrum unlikeMel–frequency Cepstral coefficients (MFCC). Gaussian mixture models are constructed from Mel– Cepstral features and HOS features, respectively, for the same data from various speakers in the Switchboard telephone Speech Corpus. Feature clusters, model parameters and classification performance are analyzed. HOS phase features on their own provide a correct identification rate of about 97% on the chosen subset of the corpus. This is the same level of accuracy as provided by MFCCs. Cluster plots and model parameters are compared to show that HOS phase features can provide complementary information to better discriminate between speakers.

Phonetic and Lexical Speaker Recognition in Reduced Training Scenarios

Relevância:

20.00% 20.00%

Publicador:

Revisiting Carl Bildt's Impostor: Would a Speaker Verification System Foil Him?

Relevância:

20.00% 20.00%

Publicador:

Frame-Weighted Bayes Factor Scoring for Speaker Verification

Relevância:

20.00% 20.00%

Publicador:

Robust Speaker Recognition Using Microphone Arrays

Relevância:

20.00% 20.00%

Publicador:

Dependence of GMM Adaptation on Feature Post-Processing for Speaker Recognition

Relevância:

20.00% 20.00%

Publicador:

Speaker Verification Using Hidden Markov Models in a Multilingual Text-Constrained Framework

Relevância:

20.00% 20.00%

Publicador:

The QUT NIST 2004 Speaker Verification System: A Fused Acoustic and High Level Approach

Relevância:

20.00% 20.00%

Publicador:

A Study on Standard and Iterative Map Adaptation for Speaker Recognition

Relevância:

20.00% 20.00%

Publicador:

Combination strategies for a factor analysis phone-conditioned speaker verification system

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This work aims to take advantage of recent developments in joint factor analysis (JFA) in the context of a phonetically conditioned GMM speaker verification system. Previous work has shown performance advantages through phonetic conditioning, but this has not been shown to date with the JFA framework. Our focus is particularly on strategies for combining the phone-conditioned systems. We show that the classic fusion of the scores is suboptimal when using multiple GMM systems. We investigate several combination strategies in the model space, and demonstrate improvement over score-level combination as well as over a non-phonetic baseline system. This work was conducted during the 2008 CLSP Workshop at Johns Hopkins University.

«
1
2
3
4
5
6
7
8
...
65
66
»