Issues of robustness and high dimensionality in cluster analysis
Contribuinte(s) |
A. Rizzi M. Vichi |
---|---|
Data(s) |
01/01/2006
|
Resumo |
Finite mixture models are being increasingly used to model the distributions of a wide variety of random phenomena. While normal mixture models are often used to cluster data sets of continuous multivariate data, a more robust clustering can be obtained by considering the t mixture model-based approach. Mixtures of factor analyzers enable model-based density estimation to be undertaken for high-dimensional data where the number of observations n is very large relative to their dimension p. As the approach using the multivariate normal family of distributions is sensitive to outliers, it is more robust to adopt the multivariate t family for the component error and factor distributions. The computational aspects associated with robustness and high dimensionality in these approaches to cluster analysis are discussed and illustrated. |
Identificador | |
Idioma(s) |
eng |
Publicador |
Physica-Verlag |
Palavras-Chave | #E1 #230203 Statistical Theory #230204 Applied Statistics #780101 Mathematical sciences |
Tipo |
Conference Paper |