A new indexing method for high dimensional dataset


Autoria(s): An, Jiyuan; Chen, Yi-Ping Phoebe; Xu, Qinying; Zhou, Xiaofang
Data(s)

01/01/2005

Resumo

Indexing high dimensional datasets has attracted extensive attention from many researchers in the last decade. Since R-tree type of index structures are known as suffering curse of dimensionality problems, Pyramid-tree type of index structures, which are based on the B-tree, have been proposed to break the curse of dimensionality. However, for high dimensional data, the number of pyramids is often insufficient to discriminate data points when the number of dimensions is high. Its effectiveness degrades dramatically with the increase of dimensionality. In this paper, we focus on one particular issue of curse of dimensionality; that is, the surface of a hypercube in a high dimensional space approaches 100% of the total hypercube volume when the number of dimensions approaches infinite. We propose a new indexing method based on the surface of dimensionality. We prove that the Pyramid tree technology is a special case of our method. The results of our experiments demonstrate clear priority of our novel method.<br />

Identificador

http://hdl.handle.net/10536/DRO/DU:30003066

Idioma(s)

eng

Publicador

Springer-Verlag

Relação

http://dro.deakin.edu.au/eserv/DU:30003066/n20050625.pdf

http://www.springerlink.com/content/rtyn11xj4l9yhf19/fulltext.pdf

Direitos

Springer-Verlag Berlin Heidelberg, 2005

Palavras-Chave #computer science #theory & methods
Tipo

Journal Article