FreeS: A fast algorithm to discover frequent free subtrees using a novel canonical form
Contribuinte(s) |
Wang, Jianyong Cellary, Wojciech |
---|---|
Data(s) |
2015
|
Resumo |
Web data can often be represented in free tree form; however, free tree mining methods seldom exist. In this paper, a computationally fast algorithm FreeS is presented to discover all frequently occurring free subtrees in a database of labelled free trees. FreeS is designed using an optimal canonical form, BOCF that can uniquely represent free trees even during the presence of isomorphism. To avoid enumeration of false positive candidates, it utilises the enumeration approach based on a tree-structure guided scheme. This paper presents lemmas that introduce conditions to conform the generation of free tree candidates during enumeration. Empirical study using both real and synthetic datasets shows that FreeS is scalable and significantly outperforms (i.e. few orders of magnitude faster than) the state-of-the-art frequent free tree mining algorithms, HybridTreeMiner and FreeTreeMiner. |
Formato |
application/pdf |
Identificador | |
Publicador |
Springer |
Relação |
http://eprints.qut.edu.au/93069/8/93069.pdf DOI:10.1007/978-3-319-26190-4_9 Chowdhury, Israt J. & Nayak, Richi (2015) FreeS: A fast algorithm to discover frequent free subtrees using a novel canonical form. In Wang, Jianyong & Cellary, Wojciech (Eds.) Web Information Systems Engineering – WISE 2015, Springer, Miami, FL, pp. 123-137. |
Direitos |
2015 Springer International Publishing Switzerland |
Fonte |
School of Electrical Engineering & Computer Science; Science & Engineering Faculty |
Palavras-Chave | #Web data #Free tree #Canonical form #Enumeration approach #Graph mining |
Tipo |
Conference Paper |