27 resultados para k-Means algorithm

em Chinese Academy of Sciences Institutional Repositories Grid Portal


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Tianjin University of Technology

Relevância:

100.00% 100.00%

Publicador:

Resumo:

文本聚类在信息过滤,网页分类中有着很好的应用。但它面临数据量大,特征维度高的难点。由于K平均算法易于实现,对数据依赖度底,在文本聚类中得到应用。然而,传统K平均以及它的变种会产生有较大波动的聚类结果。因此对K平均算法进行了改进,通过优化聚类初始中心的选择,得到一种适合对文本数据聚类分析的改进算法。大量实验显示,该算法可以生成质量较高而且聚类质量波动性较小的结果。

Relevância:

100.00% 100.00%

Publicador:

Resumo:

完整性是数据质量的一个重要维度,由于数据本身固有的不确定性、采集的随机性及不准确性,导致现实应用中产生了大量具有如下特点的数据集:1)数据规模庞大;2)数据往往是不完整、不准确的.因此将大规模数据集分段到不同的数据窗口中处理是数据处理的重要方法,但缺失数据估算的相关研究大都忽视了数据集的特点和窗口的应用,而且回定大小的数据窗17容易造成算法的准确性和性能受窗口大小及窗口内数据值分布的影响.假设数据满足一定的领域相关的约束,首先提出了一种新的基于时间的动态自适应数据窗口检测算法,并基于此窗口提出了一种改进的模糊k-均值聚类算法来进行不完整数据的缺失数据估算.实验表明较之其他算法,不仅能更适应数据集的特点,具有较好的性能,而且能够保证准确性.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

本文提出一种聚类引导搜索(cluster guide searching,CGS)的路径规划方法。采用基于最大最小距离的K均值聚类方法对样本进行离线聚类学习,学习结果以相似环境相似决策的知识形式进行存储。路径规划过程中,机器人在线整理环境信息,获得输入空间样本,通过与知识库匹配,检索到最近的类别,然后在该类别内部采用速度优先策略和方向优先策略交替的方式搜索输出空间。若知识不完备导致检索失败,可重启线性规划算法(linear programming,LP)进行在线路径规划,并更新聚类知识库。仿真结果表明该方法是一种有效的路径规划学习方法。

Relevância:

80.00% 80.00%

Publicador:

Resumo:

  语言文字的信息处理技术是人类社会现代化的重要成果。信息处理技术的水平已成为一个国家、一个民族发展进步的重要标志。改革开放以来,在实现汉语言文字信息处理技术的同时,我国少数民族语言文字信息处理技术也取得了巨大成就。从上世纪80、90年代起,藏语文信息处理工作取得突破性进展,然而在藏文的联机手写识别领域却仍处于起步阶段。   本文在总结了汉字识别和已有的藏文联机识别成果的基础上,在预处理过程、特征提取部分以及聚类部分进行了改进。在预处理过程中,使用了应用于图像处理中的数学形态学操作作为预处理中的去噪声方法,主要操作包括膨胀,腐蚀和细化;在特征提取阶段,在分析了原始特征值在后续计算中存在缺陷的基础上,本文对原始特征值进行了非线性变换,在网格方向特征提取时加入了网格的权值,从而增强了特征向量的区分能力;聚类使用的是k-means聚类方法,在聚类距离方面,我们在对比了各种距离测度方法后,提出了一种变型的欧氏距离计算方法,并将硬聚类方法改为模糊聚类方法,提高了算法的鲁棒性;为了提高系统区分相似字的能力,本文在借鉴了签名识别的方法后,实现了二级分类器,增强了其对细微差别的辨别能力。   实验结果表明,本文提出的改进方法对识别率有一定的提高,说明方法可行有效,适用于藏文的联机手写识别。

Relevância:

80.00% 80.00%

Publicador:

Resumo:

变形手势跟踪是基于视觉的人机交互研究中的一项重要内容.单摄像头条件下,提出一种新颖的变形手势实时跟踪方法.利用一组2D手势模型替代高维度的3D手模型.首先利用贝叶斯分类器对静态手势进行识别,然后对图像进行手指和指尖定位,通过将图像特征与识别结果进行匹配,实现了跟踪过程的自动初始化.提出将K-means聚类算法与粒子滤波相结合,用于解决多手指跟踪问题中手指互相干扰的问题.跟踪过程中进行跟踪状态检测,实现了自动恢复跟踪及手势模型更新.实验结果表明,该方法可以实现对变形手势快速、准确的连续跟踪,能够满足基于视觉的实时人机交互的要求.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

银行属于数据密集型企业,每天产生海量的业务数据,蕴含有大量有用的信息。如何利用这些数据,让其发挥巨大作用,成为提高银行竞争力的重要内容。当前,随着银行转变经营理念、信息技术的飞速发展,银行信息化已经进入一个全新的发展时期,数据仓库技术等新兴技术已成为银行数据分析和决策必不可少的重要工具。但目前,很多数据仓库应用显得功能不够实用,效果也没有预想的明显。为此,结合业务来深入研究数据仓库的使用机理,挖掘分析功能成为银行当前迫在眉睫的任务。 本文首先通过分析银行现有的实际业务需求,着重对数据仓库技术在银行业中的应用现状进行了详细表述,并根据数据仓库和数据挖掘技术的深入研究,结合银行的具体业务特点,设计与实现了商业银行分布式数据仓库系统;同时对分布式数据仓库系统的构建机理和实施步骤进行了详细描述。利用构件方式重新设计和实现了数据仓库中的重要部分——ETL系统,并加入ETL服务器的线程池缓冲区机制,实现系统性能的优化。 本文在建立数据仓库系统的基础上,利用联机分析处理OLAP技术和改进数据挖掘K-means聚类算法——X-means算法,实现了银行数据仓库系统的客户管理分析功能,取得了很好的应用效果,从而为银行数据仓库系统开发应用提供了可借鉴的操作思路。

Relevância:

80.00% 80.00%

Publicador:

Resumo:

针对视频监控系统,提出了一种改进的基于区域的运动目标分割方法。与传统方法相比,在运动检测阶段,结合时域差分和背景差分进行运动检测,并通过自适应方法进行背景更新;在差分图像二值化时,采用自适应阈值方法来代替传统的手工确定阈值法;对于区域分割,使用基于加权平方欧式距离的均值聚类算法代替传统的均值聚类算法。实验结果表明该改进方法比传统方法具有更好的实时性、鲁棒性和有效性。

Relevância:

80.00% 80.00%

Publicador:

Resumo:

从特殊票据———火车票票面字符的特点出发 ,将笔划复杂性指数与四周面积编码结合起来作为粗分类的分类特征 采用C -均值聚类算法进行预分类 最后生成分类特征库———分类字典 .得到了预期的分类效果 ,正确分类率达到 95 % .

Relevância:

80.00% 80.00%

Publicador:

Resumo:

模糊C-means算法在聚类分析中已得到了成功的应用,本文提出一种利用模糊C-means算法消除噪声的新方法。一般来说,图象中的噪声点就是其灰度值与其周围象素的灰度值之差超过某个门限值的点。根据这个事实,首先利用模糊C-means算法分类,再利用标准核函数检测出噪声点,然后将噪声点去掉。由于只修改噪声点处的象素灰度值,而对于其它象素的灰度值不予改变,所以本算法能够很好地保护细节和边缘。本方法每次处理3×3个点,而以往的方法只能每次处理一个点,所以本方法能提高运算速度。文中给出了利用本方法对实际图象处理的结果,并与梯度倒数权值法进行了定量的比较。

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The quantum coherence control of a solid-state charge qubit is studied by using a suboptimal continuous feedback algorithm within the Bayesian feedback scheme. For the coherent Rabi oscillation, the present algorithm suggests a simple bang-bang control protocol, in which the control parameter is modulated between two values. For the coherence protection of the idle state, the present approach is applicable to arbitrary states, including those lying on the equator of the Bloch sphere which are out of control in the previous Markovian feedback scheme.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper, we propose a novel three-dimensional imaging method by which the object is captured by a coded cameras array (CCA) and computationally reconstructed as a series of longitudinal layered surface images of the object. The distribution of cameras in array, named code pattern, is crucial for reconstructed images fidelity when the correlation decoding is used. We use DIRECT global optimization algorithm to design the code patterns that possess proper imaging property. We have conducted primary experiments to verify and test the performance of the proposed method with a simple discontinuous object and a small-scale CCA including nine cameras. After certain procedures such as capturing, photograph integrating, computational reconstructing and filtering, etc., we obtain reconstructed longitudinal layered surface images of the object with higher signal-to-noise ratio. The results of experiments show that the proposed method is feasible. It is a promising method to be used in fields such as remote sensing, machine vision, etc. (c) 2006 Elsevier GmbH. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

When noises considerations are made, nonredundant arrays (NRAs) are endowed with many advantages which other arrays e.g., uniformly redundant arrays (URAs) do not possess in applications of coded aperture imaging. However, lower aperture opening ratio limits the applications of NRA in practice. In this paper, we present a computer searching method based on a global optimization algorithm named DIRECT to design NRAs. Compared with the existing NRAs e.g., Golay's NRAs, which are well known and widely used in various applications, NRAs found by our method have higher aperture opening ratio and auto correlation compression ratio. These advantages make our aperture arrays be very useful for practical applications especially for which of aperture size are limited. Here, we also present some aperture arrays we found. These aperture arrays have an interesting property that they belong to both NRA and URA. (C) 2006 Elsevier GmbH. All rights reserved.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The admixture of linear and circular photogalvanic effects and (CPGEs) in AlxGa1-xN/GaN heterostructures has been investigated quantitatively by near-infrared irradiation at room temperature. The spin-based photocurrent that the authors have observed solidly indicates the sizable spin-orbital interaction of the two-dimensional electron gas in the heterostructures. Further analysis shows consistency between studies by optical and magnetic (Shubnikov de-Haas) measurements on the spin-orbital coupling effects among different AlxGa1-xN/GaN heterostructures, indicating that the CPGE measurement is a good way to investigate the spin splitting and the spin polarization in semiconductors. (C) 2007 American Institute of Physics.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

One of the most important kinds of queries in Spatial Network Databases (SNDB) to support location-based services (LBS) is the shortest path query. Given an object in a network, e.g. a location of a car on a road network, and a set of objects of interests, e.g. hotels,gas station, and car, the shortest path query returns the shortest path from the query object to interested objects. The studies of shortest path query have two kinds of ways, online processing and preprocessing. The studies of preprocessing suppose that the interest objects are static. This paper proposes a shortest path algorithm with a set of index structures to support the situation of moving objects. This algorithm can transform a dynamic problem to a static problem. In this paper we focus on road networks. However, our algorithms do not use any domain specific information, and therefore can be applied to any network. This algorithm’s complexity is O(klog2 i), and traditional Dijkstra’s complexity is O((i + k)2).