Modelling High-Dimensional Data by Mixtures of Factor Analyzers


Autoria(s): McLachlan, G. J.; Peel, D.; Bean, R. W.
Contribuinte(s)

S. P. Azen

E. Kontoghiorghes

Data(s)

28/01/2003

Resumo

We focus on mixtures of factor analyzers from the perspective of a method for model-based density estimation from high-dimensional data, and hence for the clustering of such data. This approach enables a normal mixture model to be fitted to a sample of n data points of dimension p, where p is large relative to n. The number of free parameters is controlled through the dimension of the latent factor space. By working in this reduced space, it allows a model for each component-covariance matrix with complexity lying between that of the isotropic and full covariance structure models. We shall illustrate the use of mixtures of factor analyzers in a practical example that considers the clustering of cell lines on the basis of gene expressions from microarray experiments. (C) 2002 Elsevier Science B.V. All rights reserved.

Identificador

http://espace.library.uq.edu.au/view/UQ:64331

Idioma(s)

eng

Publicador

Elsevier Science

Palavras-Chave #Computer Science, Interdisciplinary Applications #Mathematics, Applied #Statistics & Probability #Mixture Modelling #Factor Analyzers #Em Algorithm #Clustering Analysis #Maximum-likelihood #Principal #C1 #230204 Applied Statistics #780101 Mathematical sciences #0104 Statistics
Tipo

Journal Article