Biblioteca Digital

914 resultados para robust speech recognition

A Hybrid LP-Harmonics Model for Low Bit-Rate Speech Compression with Natural Quality

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Speaker identification using higher order spectral phase features and their effectiveness vis-a-vis Mel-Cepstral features

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The effectiveness of higher-order spectral (HOS) phase features in speaker recognition is investigated by comparison with Mel Cepstral features on the same speech data. HOS phase features retain phase information from the Fourier spectrum unlikeMel–frequency Cepstral coefficients (MFCC). Gaussian mixture models are constructed from Mel– Cepstral features and HOS features, respectively, for the same data from various speakers in the Switchboard telephone Speech Corpus. Feature clusters, model parameters and classification performance are analyzed. HOS phase features on their own provide a correct identification rate of about 97% on the chosen subset of the corpus. This is the same level of accuracy as provided by MFCCs. Cluster plots and model parameters are compared to show that HOS phase features can provide complementary information to better discriminate between speakers.

Veja mais

Textual Analysis for Script Recognition

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Infra-red pupil detection for use in a face recognition system

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper presents a new method of eye localisation and face segmentation for use in a face recognition system. By using two near infrared light sources, we have shown that the face can be coarsely segmented, and the eyes can be accurately located, increasing the accuracy of the face localisation and improving the overall speed of the system. The system is able to locate both eyes within 25% of the eye-to-eye distance in over 96% of test cases.

Veja mais

Robustness to expression variations in fractal-based face recognition

Relevância:

20.00% 20.00%

Publicador:

Veja mais

An application of fractal image-set coding in facial recognition

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Faces are complex patterns that often differ in only subtle ways. Face recognition algorithms have difficulty in coping with differences in lighting, cameras, pose, expression, etc. We propose a novel approach for facial recognition based on a new feature extraction method called fractal image-set encoding. This feature extraction method is a specialized fractal image coding technique that makes fractal codes more suitable for object and face recognition. A fractal code of a gray-scale image can be divided in two parts – geometrical parameters and luminance parameters. We show that fractal codes for an image are not unique and that we can change the set of fractal parameters without significant change in the quality of the reconstructed image. Fractal image-set coding keeps geometrical parameters the same for all images in the database. Differences between images are captured in the non-geometrical or luminance parameters – which are faster to compute. Results on a subset of the XM2VTS database are presented.

Veja mais

2D-3D Face Recognition Based on PCA and Feature Modelling

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Hybrid face recognition, using image (2D) and structural (3D) information, has explored the fusion of Nearest Neighbour classifiers. This paper examines the effectiveness of feature modelling for each individual modality, 2D and 3D. Furthermore, it is demonstrated that the fusion of feature modelling techniques for the 2D and 3D modalities yields performance improvements over the individual classifiers. By fusing the feature modelling classifiers for each modality with equal weights the average Equal Error Rate improves from 12.60% for the 2D classifier and 12.10% for the 3D classifier to 7.38% for the Hybrid 2D+3D clasiffier.

Veja mais

Cross-Lingual Pronunciation Modelling for Indonesian Speech

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Trainable Speech Synthesis with Trended Hidden Markov Models

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Cross-Language Acoustic Model Refinement for the Indonesian Language

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Basis Pursuit Feature Based Neural Network Pattern Recognition of Rolling Bearing Faults

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Matching Pursuit Features Based Neural Network Pattern Recognition of Rolling Bearing Faults

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Robust Controller Design of Networked Control Systems with Nonlinear Uncertainties

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We address robust stabilization problem for networked control systems with nonlinear uncertainties and packet losses by modelling such systems as a class of uncertain switched systems. Based on theories on switched Lyapunov functions, we derive the robustly stabilizing conditions for state feedback stabilization and design packet-loss dependent controllers by solving some matrix inequalities. A numerical example and some simulations are worked out to demonstrate the effectiveness of the proposed design method.

Veja mais

Subfractals: A New Concept for Fractal Image Coding and Recognition

Relevância:

20.00% 20.00%

Publicador:

Veja mais

Recognition of logo images using invariants defined from higher-order spectra

Relevância:

20.00% 20.00%

Publicador:

Veja mais

914 resultados para robust speech recognition

Filtro por publicador