Issues of robustness and high dimensionality in cluster analysis
| Contribuinte(s) |
A. Rizzi M. Vichi |
|---|---|
| Data(s) |
01/01/2006
|
| Resumo |
Finite mixture models are being increasingly used to model the distributions of a wide variety of random phenomena. While normal mixture models are often used to cluster data sets of continuous multivariate data, a more robust clustering can be obtained by considering the t mixture model-based approach. Mixtures of factor analyzers enable model-based density estimation to be undertaken for high-dimensional data where the number of observations n is very large relative to their dimension p. As the approach using the multivariate normal family of distributions is sensitive to outliers, it is more robust to adopt the multivariate t family for the component error and factor distributions. The computational aspects associated with robustness and high dimensionality in these approaches to cluster analysis are discussed and illustrated. |
| Identificador | |
| Idioma(s) |
eng |
| Publicador |
Physica-Verlag |
| Palavras-Chave | #E1 #230203 Statistical Theory #230204 Applied Statistics #780101 Mathematical sciences |
| Tipo |
Conference Paper |