Approximate Membership Localization within a Web-Based Join Framework


Autoria(s): Li, Zhixu; Sitbon, Laurianne; Wang, Liwei; Zhou, Xiaofang; Du, Xiaoyong
Data(s)

2010

Resumo

In this paper, we propose a search-based approach to join two tables in the absence of clean join attributes. Non-structured documents from the web are used to express the correlations between a given query and a reference list. To implement this approach, a major challenge we meet is how to efficiently determine the number of times and the locations of each clean reference from the reference list that is approximately mentioned in the retrieved documents. We formalize the Approximate Membership Localization (AML) problem and propose an efficient partial pruning algorithm to solve it. A study using real-word data sets demonstrates the effectiveness of our search-based approach, and the efficiency of our AML algorithm.

Identificador

http://eprints.qut.edu.au/45631/

Publicador

ACM

Relação

DOI:10.1145/1871437.1871611

Li, Zhixu, Sitbon, Laurianne, Wang, Liwei, Zhou, Xiaofang, & Du, Xiaoyong (2010) Approximate Membership Localization within a Web-Based Join Framework. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management, ACM.

Direitos

ACM

Fonte

Faculty of Science and Technology

Palavras-Chave #080600 INFORMATION SYSTEMS
Tipo

Conference Paper