33 resultados para Visual Speaker Recognition, Visual Speech Recognition, Cascading Appearance-Based Features

em Chinese Academy of Sciences Institutional Repositories Grid Portal


Relevância:

100.00% 100.00%

Publicador:

Resumo:

In speaker-independent speech recognition, the disadvantage of the most diffused technology (HMMs, or Hidden Markov models) is not only the need of many more training samples, but also long train time requirement. This paper describes the use of Biomimetic pattern recognition (BPR) in recognizing some mandarin continuous speech in a speaker-independent manner. A speech database was developed for the course of study. The vocabulary of the database consists of 15 Chinese dish's names, the length of each name is 4 Chinese words. Neural networks (NNs) based on Multi-weight neuron (MWN) model are used to train and recognize the speech sounds. The number of MWN was investigated to achieve the optimal performance of the NNs-based BPR. This system, which is based on BPR and can carry out real time recognition reaches a recognition rate of 98.14% for the first option and 99.81% for the first two options to the persons from different provinces of China speaking common Chinese speech. Experiments were also carried on to evaluate Continuous density hidden Markov models (CDHMM), Dynamic time warping (DTW) and BPR for speech recognition. The Experiment results show that BPR outperforms CDHMM and DTW especially in the cases of samples of a finite size.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We studied the application of Biomimetic Pattern Recognition to speaker recognition. A speaker recognition neural network using network matching degree as criterion is proposed. It has been used in the system of text-dependent speaker recognition. Experimental results show that good effect could be obtained even with lesser samples. Furthermore, the misrecognition caused by untrained speakers occurring in testing could be controlled effectively. In addition, the basic idea "cognition" of Biomimetic Pattern Recognition results in no requirement of retraining the old system for enrolling new speakers.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We studied the application of Biomimetic Pattern Recognition to speaker recognition. A speaker recognition neural network using network matching degree as criterion is proposed. It has been used in the system of text-dependent speaker recognition. Experimental results show that good effect could be obtained even with lesser samples. Furthermore, the misrecognition caused by untrained speakers occurring in testing could be controlled effectively. In addition, the basic idea "cognition" of Biomimetic Pattern Recognition results in no requirement of retraining the old system for enrolling new speakers.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper, a new classifier of speaker identification has been proposed, which is based on Biomimetic pattern recognition (BPR). Distinguished from traditional speaker recognition methods, such as DWT, HMM, GMM, SVM and so on, the proposed classifier is constructed by some finite sub-space which is reasonable covering of the points in high dimensional space according to distributing characteristic of speech feature points. It has been used in the system of speaker identification. Experiment results show that better effect could be obtained especially with lesser samples. Furthermore, the proposed classifier employs a much simpler modeling structure as compared to the GMM. In addition, the basic idea "cognition" of Biomimetic pattern recognition (BPR) results in no requirement of retraining the old system for enrolling new speakers.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Based on biomimetic pattern recognition theory, we proposed a novel speaker-independent continuous speech keyword-spotting algorithm. Without endpoint detection and division, we can get the minimum distance curve between continuous speech samples and every keyword-training net through the dynamic searching to the feature-extracted continuous speech. Then we can count the number of the keywords by investigating the vale-value and the numbers of the vales in the curve. Experiments of small vocabulary continuous speech with various speaking rate have got good recognition results and proved the validity of the algorithm.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A new model of pattern recognition principles-Biomimetic Pattern Recognition, which is based on "matter cognition" instead of "matter classification", has been proposed. As a important means realizing Biomimetic Pattern Recognition, the mathematical model and analyzing method of ANN get breakthrough: a novel all-purpose mathematical model has been advanced, which can simulate all kinds of neuron architecture, including RBF and BP models. As the same time this model has been realized using hardware; the high-dimension space geometry method, a new means to analyzing ANN, has been researched.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

According to the research results reported in the past decades, it is well acknowledged that face recognition is not a trivial task. With the development of electronic devices, we are gradually revealing the secret of object recognition in the primate's visual cortex. Therefore, it is time to reconsider face recognition by using biologically inspired features. In this paper, we represent face images by utilizing the C1 units, which correspond to complex cells in the visual cortex, and pool over S1 units by using a maximum operation to reserve only the maximum response of each local area of S1 units. The new representation is termed C1 Face. Because C1 Face is naturally a third-order tensor (or a three dimensional array), we propose three-way discriminative locality alignment (TWDLA), an extension of the discriminative locality alignment, which is a top-level discriminate manifold learning-based subspace learning algorithm. TWDLA has the following advantages: (1) it takes third-order tensors as input directly so the structure information can be well preserved; (2) it models the local geometry over every modality of the input tensors so the spatial relations of input tensors within a class can be preserved; (3) it maximizes the margin between a tensor and tensors from other classes over each modality so it performs well for recognition tasks and (4) it has no under sampling problem. Extensive experiments on YALE and FERET datasets show (1) the proposed C1Face representation can better represent face images than raw pixels and (2) TWDLA can duly preserve both the local geometry and the discriminative information over every modality for recognition.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A new method of face recognition, based on Biomimetic Pattern Recognition and Multi-Weights Neuron Network, had been proposed. A model for face recognition that is based on Biomimetic Pattern Recognition had been discussed, and a new method of facial feature extraction also had been introduced. The results of experiments with BPR and K-Nearest Neighbor Rules showed that the method based on BPR can eliminate the error recognition of the samples of the types that not be trained, the correct rate is also enhanced.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A new model of pattern recognition principles-Biomimetic Pattern Recognition, which is based on "matter cognition" instead of "matter classification", has been proposed. As a important means realizing Biomimetic Pattern Recognition, the mathematical model and analyzing method of ANN get breakthrough: a novel all-purpose mathematical model has been advanced, which can simulate all kinds of neuron architecture, including RBF and BP models. As the same time this model has been realized using hardware; the high-dimension space geometry method, a new means to analyzing ANN, has been researched.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

草图符号的自适应学习中,不同用户的训练样本数量可能不同。保持在不同样本数量下良好的学习效果成为需要解决的一个重要问题.提出一种自适应的草图符号识别方法,该方法采用与训练样本个数相关的分类器组合策略将模板匹配方法和SVM统计分类方法进行了高效组合.它通过利用支持小样本学习的模板匹配方法和支持大量样本学习的SVM方法,并同时利用草图符号中的在线信息和离线信息,实现了不同样本个数下自适应的符号学习和识别.基于该方法,文中设计并实现了支持自适应识别的草图符号组件.最后,利用扩展的PIBGToolkit开发出原型系统IdeaNote.评估表明,该方法可以在24类草图符号分别使用1到20个训练样本时具有较高的识别正确率和较好的时间性能.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

对视觉伺服进行了综述性的介绍,系统地介绍了机器人视觉伺服控制的发展历史以及现状·从控制模型给出了视觉伺服控制系统的分类·针对两种最基本的分类方式基于位置的视觉伺服和基于图像的视觉伺服进行了重点介绍·对于视觉系统和图像特征的选取问题进行了讨论,此外还对视觉伺服系统的动态过程进行了分析,指出视觉系统的延时是目前伺服控制的研究所面临的最大问题·对未来视觉伺服研究的方向进行了总结·

Relevância:

100.00% 100.00%

Publicador:

Resumo:

要测量出一组特征点分别在两个空间坐标系下的坐标 ,就可以求解两个空间目标间的位姿关系 .实现上述目标位姿测量方法的前提条件是要保证该组特征点在不同坐标系下 ,其位置关系相同 ,但计算误差的存在却破坏了这种固定的位置关系 .为此 ,提出了两种基于模型的三维视觉方法——基于模型的单目视觉和基于模型的双目视觉 ,前者从视觉计算的物理意义入手 ,通过简单的约束迭代求解实现模型约束 ;后者则将简单的约束最小二乘法和基于模型的单目视觉方法融合在一起来实现模型约束 .引入模型约束后 ,单目视觉方法可以达到很高的测量精度 .而基于模型的双目视觉较传统的无模型立体视觉方法位移精度提高有限 ,但姿态精度提高很多

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper we present a robust face location system based on human vision simulations to automatically locate faces in color static images. Our method is divided into four stages. In the first stage we use a gauss low-pass filter to remove the fine information of images, which is useless in the initial stage of human vision. During the second and the third stages, our technique approximately detects the image regions, which may contain faces. During the fourth stage, the existence of faces in the selected regions is verified. Having combined the advantages of Bottom-Up Feature Based Methods and Appearance-Based Methods, our algorithm performs well in various images, including those with highly complex backgrounds.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

本文设计与实现了一种基于TMS320DM642的香烟小包装外观质量检测系统,详细阐述了该系统的硬件构成、软件流程、检测算法以及针对DSP处理器进行的系统优化。系统以TMS320DM642处理器为核心建立硬件平台,通过摄像头获取香烟小包装图像,采用改进的模板匹配算法对当前图像进行质量检测,最终在监视器上显示检测结果并将检测结果送执行单元进行处理。实验结果表明基于TMS320DM642的香烟小包装检测系统,检测效果快速、准确、有效,应用前景广泛。

Relevância:

100.00% 100.00%

Publicador:

Resumo:

香烟小包装在线实时检测系统是一种烟草行业产品包装检测设备,具有广阔的应用前景。在现代生产过程中,生产速度越来越快,对产品质量的要求也越来越高。烟草企业在香烟的生产过程中,从烟叶制丝到卷、接、包装都已经实现了自动化。香烟小包装的外观质量,反映了烟厂的技术装备水平,涉及到企业的形象、信誉问题,同时,有质量缺陷的香烟小包装被市场反馈回企业,也会带来企业成本的增加。 随着计算机软件、硬件技术的发展,以及机器视觉理论的完善,采用机器视觉的方法来检测香烟小包装的外观质量,已经开始应用。机器视觉在检测方面具有先天优势检测速度快、分辨能力高、规范化程度高、可重复性好。采用现代机器视觉技术来进行香烟小包装外观质量的检测,可以大大降低检验人员的劳动强度,提高产品的质量,减少烟厂的人力成本和管理成本,改善企业形象。 本文分析了国内外烟包包装质量检测的许多方法,设计了一套基于DSP的香烟小包装外观质量检测系统,可以对香烟小包装进行实时检测,达到实时剔除有包装质量缺陷的香烟小包装的目的。 从机器视觉的角度出发,本文阐述了视觉检测系统的工作原理、总体机构及系统的工作流程,同时对比各种硬件特性,进行了光源、传感器、相机、镜头及DSP芯片的选型,着重介绍了图像处理算法,尤其是本系统中用到的图像配准、模板匹配以及各种缺陷的识别进行了详细的描述,并给出了程序在DSP中的优化方法。 本文对设计的系统进行了一系列的实验和测试,结果表明,本系统具有速度快,总体检测效果好,稳定性好的特点,可以达到香烟小包装实时检测的要求。