Issues of robustness and high dimensionality in cluster analysis


Autoria(s): Basford, Kaye; McLachlan, Geoff; Bean, Richard
Contribuinte(s)

A. Rizzi

M. Vichi

Data(s)

01/01/2006

Resumo

Finite mixture models are being increasingly used to model the distributions of a wide variety of random phenomena. While normal mixture models are often used to cluster data sets of continuous multivariate data, a more robust clustering can be obtained by considering the t mixture model-based approach. Mixtures of factor analyzers enable model-based density estimation to be undertaken for high-dimensional data where the number of observations n is very large relative to their dimension p. As the approach using the multivariate normal family of distributions is sensitive to outliers, it is more robust to adopt the multivariate t family for the component error and factor distributions. The computational aspects associated with robustness and high dimensionality in these approaches to cluster analysis are discussed and illustrated.

Identificador

http://espace.library.uq.edu.au/view/UQ:104032

Idioma(s)

eng

Publicador

Physica-Verlag

Palavras-Chave #E1 #230203 Statistical Theory #230204 Applied Statistics #780101 Mathematical sciences
Tipo

Conference Paper