BOSTER : an efficient algorithm for mining frequent unordered induced subtrees


Autoria(s): Chowdhury, Israt J.; Nayak, Richi
Contribuinte(s)

Benatallah, Boualem

Bestavros, Azer

Vakali, Athena

Data(s)

2014

Resumo

Extracting frequent subtrees from the tree structured data has important applications in Web mining. In this paper, we introduce a novel canonical form for rooted labelled unordered trees called the balanced-optimal-search canonical form (BOCF) that can handle the isomorphism problem efficiently. Using BOCF, we define a tree structure guided scheme based enumeration approach that systematically enumerates only the valid subtrees. Finally, we present the balanced optimal search tree miner (BOSTER) algorithm based on BOCF and the proposed enumeration approach, for finding frequent induced subtrees from a database of labelled rooted unordered trees. Experiments on the real datasets compare the efficiency of BOSTER over the two state-of-the-art algorithms for mining induced unordered subtrees, HybridTreeMiner and UNI3. The results are encouraging.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/78881/

Publicador

Springer International Publishing

Relação

http://eprints.qut.edu.au/78881/4/78881.pdf

DOI:10.1007/978-3-319-11749-2_12

Chowdhury, Israt J. & Nayak, Richi (2014) BOSTER : an efficient algorithm for mining frequent unordered induced subtrees. Lecture Notes in Computer Science : Web Information Systems Engineering – WISE 2014, 8786, pp. 146-155.

Direitos

Copyright 2014 Springer International Publishing Switzerland

The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-11749-2_12

Fonte

School of Electrical Engineering & Computer Science; Science & Engineering Faculty

Palavras-Chave #080000 INFORMATION AND COMPUTING SCIENCES #080100 ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING #080109 Pattern Recognition and Data Mining #anzsrc Australian and New Zealand Standard Research Class #Web mining #Frequent subtrees #Labelled rooted unordered trees #Induced subtrees #Canonical form #Enumeration approach
Tipo

Journal Article