Reducing the bandwidth requirements of P2P keyword indexing


Autoria(s): Casey, John; Zhou, Wanlei
Data(s)

01/01/2005

Resumo

This paper describes the design and evaluation of a federated, peer-to-peer indexing system, which can be used to integrate the resources of local systems into a globally addressable index using a distributed hash table. The salient feature of the indexing systems design is the efficient dissemination of term-document indices using a combination of duplicate elimination, leaf set forwarding and conventional techniques such as aggressive index pruning, index compression, and batching. Together these indexing strategies help to reduce the number of RPC operations required to locate the nodes responsible for a section of the index, as well as the bandwidth utilization and the latency of the indexing service. Using empirical observation we evaluate the performance benefits of these cumulative optimizations and show that these design trade-offs can significantly improve indexing performance when using a distributed hash table. <br /><br />

Identificador

http://hdl.handle.net/10536/DRO/DU:30003129

Idioma(s)

eng

Publicador

Springer-Verlag

Relação

http://dro.deakin.edu.au/eserv/DU:30003129/n20050794.pdf

http://www.springerlink.com/content/5nx03kk7rlf6lubb/fulltext.pdf

Direitos

2005, Springer-Verlag

Tipo

Journal Article