A simple permutation test for clusteredness


Autoria(s): Greenacre, Michael
Contribuinte(s)

Universitat Pompeu Fabra. Departament d'Economia i Empresa

Data(s)

26/11/2012

Resumo

Hierarchical clustering is a popular method for finding structure in multivariate data,resulting in a binary tree constructed on the particular objects of the study, usually samplingunits. The user faces the decision where to cut the binary tree in order to determine the numberof clusters to interpret and there are various ad hoc rules for arriving at a decision. A simplepermutation test is presented that diagnoses whether non-random levels of clustering are presentin the set of objects and, if so, indicates the specific level at which the tree can be cut. The test isvalidated against random matrices to verify the type I error probability and a power study isperformed on data sets with known clusteredness to study the type II error.

Identificador

http://hdl.handle.net/10230/19856

Idioma(s)

eng

Direitos

L'accés als continguts d'aquest document queda condicionat a l'acceptació de les condicions d'ús establertes per la següent llicència Creative Commons

info:eu-repo/semantics/openAccess

<a href="http://creativecommons.org/licenses/by-nc-nd/3.0/es/">http://creativecommons.org/licenses/by-nc-nd/3.0/es/</a>

Palavras-Chave #Statistics, Econometrics and Quantitative Methods #hierarchical clustering #distance #permutation test
Tipo

info:eu-repo/semantics/workingPaper