Automatic aspect discrimination in data clustering


Autoria(s): Horta, Danilo; Campello, Ricardo José Gabrielli Barreto
Contribuinte(s)

UNIVERSIDADE DE SÃO PAULO

Data(s)

24/10/2013

24/10/2013

2012

Resumo

The attributes describing a data set may often be arranged in meaningful subsets, each of which corresponds to a different aspect of the data. An unsupervised algorithm (SCAD) that simultaneously performs fuzzy clustering and aspects weighting was proposed in the literature. However, SCAD may fail and halt given certain conditions. To fix this problem, its steps are modified and then reordered to reduce the number of parameters required to be set by the user. In this paper we prove that each step of the resulting algorithm, named ASCAD, globally minimizes its cost-function with respect to the argument being optimized. The asymptotic analysis of ASCAD leads to a time complexity which is the same as that of fuzzy c-means. A hard version of the algorithm and a novel validity criterion that considers aspect weights in order to estimate the number of clusters are also described. The proposed method is assessed over several artificial and real data sets.

CNPq

FAPESP

Identificador

PATTERN RECOGNITION, OXFORD, v. 45, n. 12, pp. 4370-4388, DEC, 2012

0031-3203

http://www.producao.usp.br/handle/BDPI/35875

10.1016/j.patcog.2012.05.011

http://dx.doi.org/10.1016/j.patcog.2012.05.011

Idioma(s)

eng

Publicador

ELSEVIER SCI LTD

OXFORD

Relação

PATTERN RECOGNITION

Direitos

restrictedAccess

Copyright ELSEVIER SCI LTD

Palavras-Chave #CLUSTERING #ASPECT DISCRIMINATION #ATTRIBUTE WEIGHTING #CLUSTER VALIDATION #FUZZY EXTENSION #RELATIONAL DATA #ALGORITHM #VALIDITY #CLASSIFICATION #AGGREGATION #VALIDATION #COMPLEXITY #CRITERION #INDEXES #COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE #ENGINEERING, ELECTRICAL & ELECTRONIC
Tipo

article

original article

publishedVersion