Enhancing the selection of a model-based clustering with external categorical variables


Autoria(s): Baudry, Jean-Patrick; Cardoso, Margarida G. M. S.; Celeux, Gilles; Amorim, Maria José de Pina da Cruz; Ferreira, Ana Sousa
Data(s)

25/08/2015

25/08/2015

01/06/2014

Resumo

In cluster analysis, it can be useful to interpret the partition built from the data in the light of external categorical variables which are not directly involved to cluster the data. An approach is proposed in the model-based clustering context to select a number of clusters which both fits the data well and takes advantage of the potential illustrative ability of the external variables. This approach makes use of the integrated joint likelihood of the data and the partitions at hand, namely the model-based partition and the partitions associated to the external variables. It is noteworthy that each mixture model is fitted by the maximum likelihood methodology to the data, excluding the external variables which are used to select a relevant mixture model only. Numerical experiments illustrate the promising behaviour of the derived criterion. © 2014 Springer-Verlag Berlin Heidelberg.

Identificador

BAUDRY, Jean-Patrick; [et al] – Enhancing the selection of a model-based clustering with external categorical variables. Advances in Data Analysis and Classification. ISSN: 1862-5347. (2014)

1862-5347

1862-5355

http://hdl.handle.net/10400.21/5005

10.1007/s11634-014-0177-3

Idioma(s)

eng

Publicador

Springer-Verlag Berlin Heidelberg

Relação

http://link.springer.com/article/10.1007%2Fs11634-014-0177-3

Direitos

closedAccess

Palavras-Chave #BIC #Categorical Variables #ICL #Mixed Type Variables Clustering #Mixture Models #Model-Based Clustering #Number of Clusters #Penalised Criteria
Tipo

article