Robust clustering of multi-type relational data via a heterogeneous manifold ensemble


Autoria(s): Hou, Jun; Nayak, Richi
Data(s)

01/04/2015

Resumo

High-Order Co-Clustering (HOCC) methods have attracted high attention in recent years because of their ability to cluster multiple types of objects simultaneously using all available information. During the clustering process, HOCC methods exploit object co-occurrence information, i.e., inter-type relationships amongst different types of objects as well as object affinity information, i.e., intra-type relationships amongst the same types of objects. However, it is difficult to learn accurate intra-type relationships in the presence of noise and outliers. Existing HOCC methods consider the p nearest neighbours based on Euclidean distance for the intra-type relationships, which leads to incomplete and inaccurate intra-type relationships. In this paper, we propose a novel HOCC method that incorporates multiple subspace learning with a heterogeneous manifold ensemble to learn complete and accurate intra-type relationships. Multiple subspace learning reconstructs the similarity between any pair of objects that belong to the same subspace. The heterogeneous manifold ensemble is created based on two-types of intra-type relationships learnt using p-nearest-neighbour graph and multiple subspaces learning. Moreover, in order to make sure the robustness of clustering process, we introduce a sparse error matrix into matrix decomposition and develop a novel iterative algorithm. Empirical experiments show that the proposed method achieves improved results over the state-of-art HOCC methods for FScore and NMI.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/79575/

Relação

http://eprints.qut.edu.au/79575/1/ICDE15_research_469.pdf

Hou, Jun & Nayak, Richi (2015) Robust clustering of multi-type relational data via a heterogeneous manifold ensemble. In The 31st International Conference on Data Engineering (ICDE 2015), 13 - 17 April 2015, Soul, Korea.

Direitos

Copyright 2015 IEEE

Fonte

School of Electrical Engineering & Computer Science; Science & Engineering Faculty

Palavras-Chave #080109 Pattern Recognition and Data Mining #Clustering #Multi-type relational data #Multiple subspace learning #Heterogeneous manifold ensemble #Optimization
Tipo

Conference Paper