A novel method for finding similarities between unordered trees using matrix data model


Autoria(s): Chowdhury, Israt J.; Nayak, Richi
Contribuinte(s)

Lin, Xuemin

Manolopoulos, Yannis

Srivastava, Divesh

Huang, Guangyan

Data(s)

01/10/2013

Resumo

Trees are capable of portraying the semi-structured data which is common in web domain. Finding similarities between trees is mandatory for several applications that deal with semi-structured data. Existing similarity methods examine a pair of trees by comparing through nodes and paths of two trees, and find the similarity between them. However, these methods provide unfavorable results for unordered tree data and result in yielding NP-hard or MAX-SNP hard complexity. In this paper, we present a novel method that encodes a tree with an optimal traversing approach first, and then, utilizes it to model the tree with its equivalent matrix representation for finding similarity between unordered trees efficiently. Empirical analysis shows that the proposed method is able to achieve high accuracy even on the large data sets.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/64797/

Publicador

Springer

Relação

http://eprints.qut.edu.au/64797/1/I.J.Chowdhury%28WISE-2013%29.pdf

DOI:10.1007/978-3-642-41230-1_35

Chowdhury, Israt J. & Nayak, Richi (2013) A novel method for finding similarities between unordered trees using matrix data model. Lecture Notes in Computer Science, 8180, pp. 421-430.

Direitos

Copyright 2013 Springer-Verlag Berlin Heidelberg

The final publication is available at link.springer.com

Fonte

School of Electrical Engineering & Computer Science; Science & Engineering Faculty

Palavras-Chave #080000 INFORMATION AND COMPUTING SCIENCES #080100 ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING #080109 Pattern Recognition and Data Mining #Semi-structured Data #Unordered Tree #Similarity Measure #Matrix Representation
Tipo

Journal Article