100 resultados para Speaker Recognition
Resumo:
In this paper, we firstly give the nature of 'hypersausages', study its structure and training of the network, then discuss the nature of it by way of experimenting with ORL face database, and finally, verify its unsurpassable advantages compared with other means.
Resumo:
We investigate the use of independent component analysis (ICA) for speech feature extraction in digits speech recognition systems. We observe that this may be true for recognition tasks based on Geometrical Learning with little training data. In contrast to image processing, phase information is not essential for digits speech recognition. We therefore propose a new scheme that shows how the phase sensitivity can be removed by using an analytical description of the ICA-adapted basis functions. Furthermore, since the basis functions are not shift invariant, we extend the method to include a frequency-based ICA stage that removes redundant time shift information. The digits speech recognition results show promising accuracy. Experiments show that the method based on ICA and Geometrical Learning outperforms HMM in a different number of training samples.
Resumo:
In this paper, we presents HyperSausage Neuron based on the High-Dimension Space(HDS), and proposes a new algorithm for speaker independent continuous digit speech recognition. At last, compared to HMM-based method, the recognition rate of HyperSausage Neuron method is higher than that of in HMM-based method.
Resumo:
National Natural Science Foundation of China 60753001
Resumo:
Pen-based user interface has become a hot research field in recent years. Pen gesture plays an important role in Pen-based user interfaces. But it’s difficult for UI designers to design, and for users to learn and use. In this purpose, we performed a research on user-centered design and recognition pen gestures. We performed a survey of 100 pen gestures in twelve famous pen-bases systems to find problems of pen gestures currently used. And we conducted a questionnaire to evaluate the matching degree between commands and pen gestures to discover the characteristics that a good pen gestures should have. Then cognition theories were applied to analyze the advantages of those characteristics in helping improving the learnability of pen gestures. From these, we analyzed the pen gesture recognition effect and presented some improvements on features selection in recognition algorithm of pen gestures. Finally we used a couple of psychology experiments to evaluate twelve pen gestures designed based on the research. It shows those gestures is better for user to learn and use. Research results of this paper can be used for designer as a primary principle to design pen gestures in pen-based systems.
Resumo:
In recognition-based user interface, users’ satisfaction is determined not only by recognition accuracy but also by effort to correct recognition errors. In this paper, we introduce a crossmodal error correction technique, which allows users to correct errors of Chinese handwriting recognition by speech. The focus of the paper is a multimodal fusion algorithm supporting the crossmodal error correction. By fusing handwriting and speech recognition, the algorithm can correct errors in both character extraction and recognition of handwriting. The experimental result indicates that the algorithm is effective and efficient. Moreover, the evaluation also shows the correction technique can help users to correct errors in handwriting recognition more efficiently than the other two error correction techniques.
Resumo:
A new method of face recognition, based on Biomimetic Pattern Recognition and Multi-Weights Neuron Network, had been proposed. A model for face recognition that is based on Biomimetic Pattern Recognition had been discussed, and a new method of facial feature extraction also had been introduced. The results of experiments with BPR and K-Nearest Neighbor Rules showed that the method based on BPR can eliminate the error recognition of the samples of the types that not be trained, the correct rate is also enhanced.
Resumo:
A new theoretical model of Pattern Recognition principles was proposed, which is based on "matter cognition" instead of "matter classification" in traditional statistical Pattern Recognition. This new model is closer to the function of human being, rather than traditional statistical Pattern Recognition using "optimal separating" as its main principle. So the new model of Pattern Recognition is called the Biomimetic Pattern Recognition (BPR)(1). Its mathematical basis is placed on topological analysis of the sample set in the high dimensional feature space. Therefore, it is also called the Topological Pattern Recognition (TPR). The fundamental idea of this model is based on the fact of the continuity in the feature space of any one of the certain kinds of samples. We experimented with the Biomimetic Pattern Recognition (BPR) by using artificial neural networks, which act through covering the high dimensional geometrical distribution of the sample set in the feature space. Onmidirectionally cognitive tests were done on various kinds of animal and vehicle models of rather similar shapes. For the total 8800 tests, the correct recognition rate is 99.87%. The rejection rate is 0.13% and on the condition of zero error rates, the correct rate of BPR was much better than that of RBF-SVM.
Resumo:
An improved BP algorithm for pattern recognition is proposed in this paper. By a function substitution for error measure, it resolves the inconsistency of BP algorithm for pattern recognition problems, i.e. the quadratic error is not sensitive to whether the training pattern is recognized correctly or not. Trained by this new method, the computer simulation result shows that the convergence speed is increased to treble and performance of the network is better than conventional BP algorithm with momentum and adaptive step size.
Resumo:
A new model of pattern recognition principles-Biomimetic Pattern Recognition, which is based on "matter cognition" instead of "matter classification", has been proposed. As a important means realizing Biomimetic Pattern Recognition, the mathematical model and analyzing method of ANN get breakthrough: a novel all-purpose mathematical model has been advanced, which can simulate all kinds of neuron architecture, including RBF and BP models. As the same time this model has been realized using hardware; the high-dimension space geometry method, a new means to analyzing ANN, has been researched.
Resumo:
The transient state (as the defined point where no enantioseparation is obtained in a dual chiral selector system) of chiral recognition of aminoglutethimide in a binary mixture of neutral cyclodextrins (CDs) was studied by capillary electrophoresis (CE). The following three dual selector systems were used: alpha-cyclodextrin (alpha-CD) and beta-cyclodextrin (beta-CD); alpha-CD and heptakis(di-O-methyl-beta-cyclodextrin) (DM-beta-CD); alpha-CD and heptakis(tri-O-methyl-beta-cyclodextrin) (TM-beta-CD). The S-(-) enantiomer of the analyte was more strongly retained in the presence of either alpha-CD or TM-beta-CD at pH 2.5, 100 mM phosphate buffer, while the R-(+) enantiomer was more strongly retained in the presence of either P-CD or DM-P-CD. In the more simple case, the elution order is invariably kept if the enantiomers have the same elution order in either one of the two hosts of the binary mixture. In contrast, the elution order may be switched by varying the concentration ratio of two hosts that produce opposite elution order for this particular analyte. In such a dual selector system, the enantioselectivity will disappear at the transient state at a certain ratio of host,:host, Moreover, the migration times of the two enantiomers with host, alone (diluted in buffer) is approximately equal to the migration times at the corresponding concentration of host, alone (diluted in buffer), where the ratio of concentrations of host,:host, is the same as in the binary mixture at the transient state. As found by nuclear magnetic resonance experiments, the analyte is forming a 1:1 complex with either one of the CDs applied. From this finding, a theoretical model based on the mobility difference of the two enantiomers was derived that was used to simulate the transient state. (C) 2000 Elsevier Science B.V. All rights reserved.