Quality indices for (practical) clustering evaluation


Autoria(s): CARDOSO, Margarida G. M. S.; CARVALHO, Andre Ponce de Leon F. de
Contribuinte(s)

UNIVERSIDADE DE SÃO PAULO

Data(s)

20/10/2012

20/10/2012

2009

Resumo

Clustering quality or validation indices allow the evaluation of the quality of clustering in order to support the selection of a specific partition or clustering structure in its natural unsupervised environment, where the real solution is unknown or not available. In this paper, we investigate the use of quality indices mostly based on the concepts of clusters` compactness and separation, for the evaluation of clustering results (partitions in particular). This work intends to offer a general perspective regarding the appropriate use of quality indices for the purpose of clustering evaluation. After presenting some commonly used indices, as well as indices recently proposed in the literature, key issues regarding the practical use of quality indices are addressed. A general methodological approach is presented which considers the identification of appropriate indices thresholds. This general approach is compared with the simple use of quality indices for evaluating a clustering solution.

Identificador

INTELLIGENT DATA ANALYSIS, v.13, n.5, p.725-740, 2009

1088-467X

http://producao.usp.br/handle/BDPI/28785

10.3233/IDA-2009-0390

http://dx.doi.org/10.3233/IDA-2009-0390

Idioma(s)

eng

Publicador

IOS PRESS

Relação

Intelligent Data Analysis

Direitos

restrictedAccess

Copyright IOS PRESS

Palavras-Chave #Cluster validation #validation indices #quality indices #clustering #GENE-EXPRESSION DATA #DATA SET #VALIDATION INDEX #MODEL #VALIDITY #NUMBER #Computer Science, Artificial Intelligence
Tipo

article

original article

publishedVersion