Efficient bulk-loading on dynamic metric access methods


Autoria(s): VESPA, Thiago G.; TRAINA JR., Caetano; TRAINA, Agma J.
Contribuinte(s)

UNIVERSIDADE DE SÃO PAULO

Data(s)

20/10/2012

20/10/2012

2010

Resumo

This paper presents a new technique and two algorithms to bulk-load data into multi-way dynamic metric access methods, based on the covering radius of representative elements employed to organize data in hierarchical data structures. The proposed algorithms are sample-based, and they always build a valid and height-balanced tree. We compare the proposed algorithm with existing ones, showing the behavior to bulk-load data into the Slim-tree metric access method. After having identified the worst case of our first algorithm, we describe adequate counteractions in an elegant way creating the second algorithm. Experiments performed to evaluate their performance show that our bulk-loading methods build trees faster than the sequential insertion method regarding construction time, and that it also significantly improves search performance. (C) 2009 Elsevier B.V. All rights reserved.

Identificador

INFORMATION SYSTEMS, v.35, n.5, Special Issue, p.557-569, 2010

0306-4379

http://producao.usp.br/handle/BDPI/28754

10.1016/j.is.2009.07.002

http://dx.doi.org/10.1016/j.is.2009.07.002

Idioma(s)

eng

Publicador

PERGAMON-ELSEVIER SCIENCE LTD

Relação

Information Systems

Direitos

restrictedAccess

Copyright PERGAMON-ELSEVIER SCIENCE LTD

Palavras-Chave #Information storage and retrieval #Data structures #Searching and sorting #SLIM-TREES #PERFORMANCE #Computer Science, Information Systems
Tipo

article

proceedings paper

publishedVersion