17 resultados para SUPPORT VECTOR MACHINES
em Chinese Academy of Sciences Institutional Repositories Grid Portal
Resumo:
Studies on learning problems from geometry perspective have attracted an ever increasing attention in machine learning, leaded by achievements on information geometry. This paper proposes a different geometrical learning from the perspective of high-dimensional descriptive geometry. Geometrical properties of high-dimensional structures underlying a set of samples are learned via successive projections from the higher dimension to the lower dimension until two-dimensional Euclidean plane, under guidance of the established properties and theorems in high-dimensional descriptive geometry. Specifically, we introduce a hyper sausage like geometry shape for learning samples and provides a geometrical learning algorithm for specifying the hyper sausage shapes, which is then applied to biomimetic pattern recognition. Experimental results are presented to show that the proposed approach outperforms three types of support vector machines with either a three degree polynomial kernel or a radial basis function kernel, especially in the cases of high-dimensional samples of a finite size. (c) 2005 Elsevier B.V. All rights reserved.
Resumo:
目前支持向量机解决模式识别问题是广大学者研究的热点,样本的后验概率在模式识别中至关重要,但是传统的支持向量机技术不提供后验概率.针对这一问题进行了3个方面的研究:①在给出样本点后验概率的基础上,将大规模优化问题分解成最大似然函数和最大分类边界两个小规模优化问题;②给出了一种新的用后验概率修正最优分离超平面的方法,并且分析了该新方法的合理性;③用图像分类的3组实例说明本方法的有效性.
Resumo:
Univ SE Calif, Ctr Syst & Software Engn, ABB, Microsoft Res, IEEE, ACMSIGSOFT, N Carolina State Univ Comp Sci
Resumo:
The Gaussian process latent variable model (GP-LVM) has been identified to be an effective probabilistic approach for dimensionality reduction because it can obtain a low-dimensional manifold of a data set in an unsupervised fashion. Consequently, the GP-LVM is insufficient for supervised learning tasks (e. g., classification and regression) because it ignores the class label information for dimensionality reduction. In this paper, a supervised GP-LVM is developed for supervised learning tasks, and the maximum a posteriori algorithm is introduced to estimate positions of all samples in the latent variable space. We present experimental evidences suggesting that the supervised GP-LVM is able to use the class label information effectively, and thus, it outperforms the GP-LVM and the discriminative extension of the GP-LVM consistently. The comparison with some supervised classification methods, such as Gaussian process classification and support vector machines, is also given to illustrate the advantage of the proposed method.
Resumo:
提出一种基于支持向量机理论的车型分类器的设计方案。通过对实际车辆的图像采集、处理和分析,获取所需样本数据。采用有导师训练方法训练三个支持向量机识别器,使用测试样本对训练出的识别器进行性能测试。最后将三个识别器与表决器结合得到车型分类器。
Resumo:
杜鹃属(Rhododendron L.)是中国种子植物中最大的属,其现代分布和分化中心是我国西南部的横断山区和东喜马拉雅地区。我国西部、西南部的云南、四川、西藏等地共有杜鹃达450种,仅特有种就有约300种。对杜鹃属分布的深入研究是横断山区生物多样性保护不可缺少的重要部分。 由于物种分布与环境因子之间存在着紧密的联系,利用环境因子作为预测物种分布模型的变量是当前最普遍的建模思路。但是绝大多数物种分布预测模型都遇到了难以解决的“高维小样本”问题――模型在标本数据不足时无法给出合理的预测,或者模型无法处理大量的环境变量。机器学习领域的理论和实践已经证明,基于结构风险最小化原理的支持向量机(Support Vector Machine, SVM)算法非常适合“高维小样本”的分类问题。为了探索其应用在物种分布预测问题上的可能性,本文创新性的实现了基于SVM算法的物种分布预测系统。然后,本文以30个杜鹃属(Rhododendron L.)物种为检验对象,利用其标本数据和11个1km的栅格环境变量图层作为模型变量,预测其在中国的潜在分布区。本文通过全面的模型评估——专家评估,ROC (Receiver Operator Characteristic)曲线和曲线下方面积AUC (Area Under the Curve)——来比较模型的性能。试验结果表明,我们所实现的以SVM为核心的物种分布预测系统无论在计算速度还是预测效果上都远远优于当前广泛使用的GARP (Genetic Algorithm for Rule-Set Prediction)预测系统。 之后,本文进一步探讨了SVM预测系统预测效果与环境变量维数和标本点个数的关系。试验结果表明,对于只有少量标本点的物种SVM的预测结果仍然具有相当的合理性。由此可见, SVM预测系统很好的解决了以前众多模型无法克服的稀有种和标本点稀少的物种的潜在分布区模拟问题。同时本文发现大的环境维数(高维)对于物种潜在分布区的预测有着决定性的作用,因此模型处理高维问题的能力显得至关重要。 最后,我们使用中国所有可获取的杜鹃属标本数据,以及83个1km的栅格环境变量图层,对400种杜鹃属物种的潜在分布区进行预测。根据预测出来的物种潜在分布区,我们得到了中国杜鹃属物种潜在多样性分布格局,特有物种潜在多样性分布格局,濒危杜物种潜在的分布格局,各亚属物种潜在分布格局,以及不同生活型物种潜在多样性分布格局。这些分布区图不仅可以对杜鹃属起源研究提供分析验证的条件,还能为其引种、保护和新种的搜寻提供有利的空间依据。
Resumo:
Biomimetic pattern recogntion (BPR), which is based on "cognition" instead of "classification", is much closer to the function of human being. The basis of BPR is the Principle of homology-continuity (PHC), which means the difference between two samples of the same class must be gradually changed. The aim of BPR is to find an optimal covering in the feature space, which emphasizes the "similarity" among homologous group members, rather than "division" in traditional pattern recognition. Some applications of BPR are surveyed, in which the results of BPR are much better than the results of Support Vector Machine. A novel neuron model, Hyper sausage neuron (HSN), is shown as a kind of covering units in BPR. The mathematical description of HSN is given and the 2-dimensional discriminant boundary of HSN is shown. In two special cases, in which samples are distributed in a line segment and a circle, both the HSN networks and RBF networks are used for covering. The results show that HSN networks act better than RBF networks in generalization, especially for small sample set, which are consonant with the results of the applications of BPR. And a brief explanation of the HSN networks' advantages in covering general distributed samples is also given.
Resumo:
In this paper, a novel approach for mandarin speech emotion recognition, that is mandarin speech emotion recognition based on high dimensional geometry theory, is proposed. The human emotions are classified into 6 archetypal classes: fear, anger, happiness, sadness, surprise and disgust. According to the characteristics of these emotional speech signals, the amplitude, pitch frequency and formant are used as the feature parameters for speech emotion recognition. The new method called high dimensional geometry theory is applied for recognition. Compared with traditional GSVM model, the new method has some advantages. It is noted that this method has significant values for researches and applications henceforth.
Resumo:
准确的网络流量分类是众多网络研究工作的基础,也一直是网络测量领域的研究热点.近年来,利用机器学习方法处理流量分类问题成为了该领域一个新兴的研究方向.在目前研究中应用较多的是朴素贝叶斯(nave Bayes,NB)及其改进算法.这些方法具有实现简单、分类高效的特点.但该方法过分依赖于样本空间的分布,具有内在的不稳定性.因此,提出一种基于支持向量机(support vector machine,SVM)的流量分类方法.该方法利用非线性变换和结构风险最小化(structural risk minimization,SRM)原则将流量分类问题转化为二次寻优问题,具有良好的分类准确率和稳定性.在理论分析的基础上,通过在实际网络流集合上与朴素贝叶斯算法的对比实验,可以看出使用支持向量机方法处理流量分类问题,具有以下3个优势:1)网络流属性不必满足条件独立假设,无须进行属性过滤;2)能够在先验知识相对不足的情况下,仍保持较高的分类准确率;3)不依赖于样本空间的分布,具有较好的分类稳定性.
Resumo:
While Histograms of Oriented Gradients (HOG) plus Support Vector Machine (SVM) (HOG+SVM) is the most successful human detection algorithm, it is time-consuming. This paper proposes two ways to deal with this problem. One way is to reuse the features in blocks to construct the HOG features for intersecting detection windows. Another way is to utilize sub-cell based interpolation to efficiently compute the HOG features for each block. The combination of the two ways results in significant increase in detecting humans-more than five times better. To evaluate the proposed method, we have established a top-view human database. Experimental results on the top-view database and the well-known INRIA data set have demonstrated the effectiveness and efficiency of the proposed method. (C) 2010 Elsevier B.V. All rights reserved.
Resumo:
针对用于服务机器人的脑机接口系统中脑电信号模式识别精度不高,不能满足机器人多任务要求的问题,提出一种基于C-支持向量多分类机的多类复杂手操作EEG信号模式识别方法,并将其应用到复杂手操作的EEG信号模式识别试验中,实现一个4类复杂手操作的模式识别,实验结果表明,与之前用BP神经网络进行识别相比,识别率由85%提高到了90%。
Resumo:
成分分析(PCA)只能从2阶上去消除数据的相关性,传统支持向量机在解决多类问题时会出现分类的盲区问题,针对这两种情况,首先采用独立成分分析(ICA)方法解决了高阶上的数据相关性问题;同时在传统支持向量机中引入模糊隶属度函数,用模糊支持向量机解决了传统支持向量机在多类数据识别中的盲区问题。通过实验证明了该方法在人脸识别率上取得了显著提高。