Nonparametric Estimation of Data Dimensionality Prior to Data Compression: the case of the Human Development Index


Autoria(s): Canning, D.; French, Declan; Moore, M.
Data(s)

01/09/2013

Resumo

In many applications in applied statistics researchers reduce the complexity of a data set by combining a group of variables into a single measure using factor analysis or an index number. We argue that such compression loses information if the data actually has high dimensionality. We advocate the use of a non-parametric estimator, commonly used in physics (the Takens estimator), to estimate the correlation dimension of the data prior to compression. The advantage of this approach over traditional linear data compression approaches is that the data does not have to be linearized. Applying our ideas to the United Nations Human Development Index we find that the four variables that are used in its construction have dimension three and the index loses information.

Identificador

http://pure.qub.ac.uk/portal/en/publications/nonparametric-estimation-of-data-dimensionality-prior-to-data-compression-the-case-of-the-human-development-index(2e6ba4e5-466e-40f7-b3fc-b5f82ccc36ca).html

http://dx.doi.org/10.1080/02664763.2013.798629

Idioma(s)

eng

Direitos

info:eu-repo/semantics/restrictedAccess

Fonte

Canning , D , French , D & Moore , M 2013 , ' Nonparametric Estimation of Data Dimensionality Prior to Data Compression: the case of the Human Development Index ' Journal of Applied Statistics , vol 40 , no. 9 , pp. 1853-1863 . DOI: 10.1080/02664763.2013.798629

Palavras-Chave #development #well-being #dimension #measure #indicator #/dk/atira/pure/subjectarea/asjc/2600/2613 #Statistics and Probability #/dk/atira/pure/subjectarea/asjc/1800/1804 #Statistics, Probability and Uncertainty
Tipo

article