A new indexing method for high dimensional dataset


Autoria(s): An, Jiyuan; Chen, Yi-Ping Phoebe; Xu, Qinying; Zhou, Xiaofang
Contribuinte(s)

Lizhu Zhou

Beng Chin Ooi

Xiaofeng Meng

Data(s)

01/01/2005

Resumo

Indexing high dimensional datasets has attracted extensive attention from many researchers in the last decade. Since R-tree type of index structures are known as suffering curse of dimensionality problems, Pyramid-tree type of index structures, which are based on the B-tree, have been proposed to break the curse of dimensionality. However, for high dimensional data, the number of pyramids is often insufficient to discriminate data points when the number of dimensions is high. Its effectiveness degrades dramatically with the increase of dimensionality. In this paper, we focus on one particular issue of curse of dimensionality; that is, the surface of a hypercube in a high dimensional space approaches 100% of the total hypercube volume when the number of dimensions approaches infinite. We propose a new indexing method based on the surface of dimensionality. We prove that the Pyramid tree technology is a special case of our method. The results of our experiments demonstrate clear priority of our novel method.

Identificador

http://espace.library.uq.edu.au/view/UQ:101946

Idioma(s)

eng

Publicador

Springer-Verlag Berlin

Palavras-Chave #E1 #280108 Database Management #700103 Information processing services
Tipo

Conference Paper