Probabilistic consensus clustering using evidence accumulation


Autoria(s): Lourenço, André; Bulo, Samuel Rota; Rebagliati, Nicola; Fred, Ana; Figueiredo, Mário; Pelillo, Marcello
Data(s)

25/02/2016

25/02/2016

01/01/2015

Resumo

Clustering ensemble methods produce a consensus partition of a set of data points by combining the results of a collection of base clustering algorithms. In the evidence accumulation clustering (EAC) paradigm, the clustering ensemble is transformed into a pairwise co-association matrix, thus avoiding the label correspondence problem, which is intrinsic to other clustering ensemble schemes. In this paper, we propose a consensus clustering approach based on the EAC paradigm, which is not limited to crisp partitions and fully exploits the nature of the co-association matrix. Our solution determines probabilistic assignments of data points to clusters by minimizing a Bregman divergence between the observed co-association frequencies and the corresponding co-occurrence probabilities expressed as functions of the unknown assignments. We additionally propose an optimization algorithm to find a solution under any double-convex Bregman divergence. Experiments on both synthetic and real benchmark data show the effectiveness of the proposed approach.

Identificador

LOURENÇO, André; [et al.] - Probabilistic consensus clustering using evidence accumulation. Machine Learning. ISSN. 0885-6125 Vol. 98, Nr. 1-2, SI, (2015), 331-357

0885-6125

1573-0565

http://hdl.handle.net/10400.21/5750

10.1007/s10994-013-5339-6

Idioma(s)

eng

Publicador

Springer

Relação

SI;

http://link.springer.com/article/10.1007%2Fs10994-013-5339-6

Direitos

closedAccess

Palavras-Chave #Consensus clustering #Evidence Accumulation #Ensemble clustering #Bregman divergence
Tipo

article