Quality indices for (practical) clustering evaluation
Contribuinte(s) |
UNIVERSIDADE DE SÃO PAULO |
---|---|
Data(s) |
20/10/2012
20/10/2012
2009
|
Resumo |
Clustering quality or validation indices allow the evaluation of the quality of clustering in order to support the selection of a specific partition or clustering structure in its natural unsupervised environment, where the real solution is unknown or not available. In this paper, we investigate the use of quality indices mostly based on the concepts of clusters` compactness and separation, for the evaluation of clustering results (partitions in particular). This work intends to offer a general perspective regarding the appropriate use of quality indices for the purpose of clustering evaluation. After presenting some commonly used indices, as well as indices recently proposed in the literature, key issues regarding the practical use of quality indices are addressed. A general methodological approach is presented which considers the identification of appropriate indices thresholds. This general approach is compared with the simple use of quality indices for evaluating a clustering solution. |
Identificador |
INTELLIGENT DATA ANALYSIS, v.13, n.5, p.725-740, 2009 1088-467X http://producao.usp.br/handle/BDPI/28785 10.3233/IDA-2009-0390 |
Idioma(s) |
eng |
Publicador |
IOS PRESS |
Relação |
Intelligent Data Analysis |
Direitos |
restrictedAccess Copyright IOS PRESS |
Palavras-Chave | #Cluster validation #validation indices #quality indices #clustering #GENE-EXPRESSION DATA #DATA SET #VALIDATION INDEX #MODEL #VALIDITY #NUMBER #Computer Science, Artificial Intelligence |
Tipo |
article original article publishedVersion |