Biblioteca Digital

941 resultados para Multivariate Statistics

A simple permutation test for clusteredness

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Hierarchical clustering is a popular method for finding structure in multivariate data,resulting in a binary tree constructed on the particular objects of the study, usually samplingunits. The user faces the decision where to cut the binary tree in order to determine the numberof clusters to interpret and there are various ad hoc rules for arriving at a decision. A simplepermutation test is presented that diagnoses whether non-random levels of clustering are presentin the set of objects and, if so, indicates the specific level at which the tree can be cut. The test isvalidated against random matrices to verify the type I error probability and a power study isperformed on data sets with known clusteredness to study the type II error.

The impact of "early" nineteenth-century globalization on foreign trade in the Southern Cone: A study of British trade statistics

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper deals with the impact of "early" nineteenth-century globalization (c.1815-1860) on foreign trade in the Southern Cone (SC). Most of the evidence is drawn from bilateral trades between Britain and the SC, at a time when Britain was the main commercial partner of the new republics. The main conclusion drawn is that early globalization had a positive impact on foreign trade in the SC, and this was due to: improvements in the SC's terms of trade during this period; the SC's per capita consumption of textiles (the main manufacture traded on world markets at that time) increased substantially during this period, at a time when clothing was one of the main items of SC household budgets; British merchants brought with them capital, shipping, insurance, and also facilitated the formation of vast global networks, which further promoted the SC's exports to a wider range of outlets.

FSTAT (Version 1.2): A computer program to calculate F-statistics

Relevância:

20.00% 20.00%

Publicador:

Asymptotic robust inferences in the analysis of mean and covariance structures

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Structural equation models are widely used in economic, socialand behavioral studies to analyze linear interrelationships amongvariables, some of which may be unobservable or subject to measurementerror. Alternative estimation methods that exploit different distributionalassumptions are now available. The present paper deals with issues ofasymptotic statistical inferences, such as the evaluation of standarderrors of estimates and chi--square goodness--of--fit statistics,in the general context of mean and covariance structures. The emphasisis on drawing correct statistical inferences regardless of thedistribution of the data and the method of estimation employed. A(distribution--free) consistent estimate of $\Gamma$, the matrix ofasymptotic variances of the vector of sample second--order moments,will be used to compute robust standard errors and a robust chi--squaregoodness--of--fit squares. Simple modifications of the usual estimateof $\Gamma$ will also permit correct inferences in the case of multi--stage complex samples. We will also discuss the conditions under which,regardless of the distribution of the data, one can rely on the usual(non--robust) inferential statistics. Finally, a multivariate regressionmodel with errors--in--variables will be used to illustrate, by meansof simulated data, various theoretical aspects of the paper.

Geometric and long run aspects of Granger causality

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper extends multivariate Granger causality to take into account the subspacesalong which Granger causality occurs as well as long run Granger causality. The propertiesof these new notions of Granger causality, along with the requisite restrictions, are derivedand extensively studied for a wide variety of time series processes including linear invertibleprocess and VARMA. Using the proposed extensions, the paper demonstrates that: (i) meanreversion in L2 is an instance of long run Granger non-causality, (ii) cointegration is a specialcase of long run Granger non-causality along a subspace, (iii) controllability is a special caseof Granger causality, and finally (iv) linear rational expectations entail (possibly testable)Granger causality restriction along subspaces.

Unfolding a symmetric matrix

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Graphical displays which show inter--sample distances are importantfor the interpretation and presentation of multivariate data. Except whenthe displays are two--dimensional, however, they are often difficult tovisualize as a whole. A device, based on multidimensional unfolding, isdescribed for presenting some intrinsically high--dimensional displays infewer, usually two, dimensions. This goal is achieved by representing eachsample by a pair of points, say $R_i$ and $r_i$, so that a theoreticaldistance between the $i$-th and $j$-th samples is represented twice, onceby the distance between $R_i$ and $r_j$ and once by the distance between$R_j$ and $r_i$. Self--distances between $R_i$ and $r_i$ need not be zero.The mathematical conditions for unfolding to exhibit symmetry are established.Algorithms for finding approximate fits, not constrained to be symmetric,are discussed and some examples are given.

Combined analysis of grey matter voxel-based morphometry and white matter tract-based spatial statistics in late-life bipolar disorder.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: Previous magnetic resonance imaging (MRI) studies in young patients with bipolar disorder indicated the presence of grey matter concentration changes as well as microstructural alterations in white matter in various neocortical areas and the corpus callosum. Whether these structural changes are also present in elderly patients with bipolar disorder with long-lasting clinical evolution remains unclear. Methods: We performed a prospective MRI study of consecutive elderly, euthymic patients with bipolar disorder and healthy, elderly controls. We conducted a voxel-based morphometry (VBM) analysis and a tract-based spatial statistics (TBSS) analysis to assess fractional anisotropy and longitudinal, radial and mean diffusivity derived by diffusion tensor imaging (DTI). Results: We included 19 patients with bipolar disorder and 47 controls in our study. Fractional anisotropy was the most sensitive DTI marker and decreased significantly in the ventral part of the corpus callosum in patients with bipolar disorder. Longitudinal, radial and mean diffusivity showed no significant between-group differences. Grey matter concentration was reduced in patients with bipolar disorder in the right anterior insula, head of the caudate nucleus, nucleus accumbens, ventral putamen and frontal orbital cortex. Conversely, there was no grey matter concentration or fractional anisotropy increase in any brain region in patients with bipolar disorder compared with controls. Limitations: The major limitation of our study is the small number of patients with bipolar disorder. Conclusion: Our data document the concomitant presence of grey matter concentration decreases in the anterior limbic areas and the reduced fibre tract coherence in the corpus callosum of elderly patients with long-lasting bipolar disorder.

A U-shaped association between intensity of Internet use and adolescent health.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

OBJECTIVE: To examine the relationship between different Internet-use intensities and adolescent mental and somatic health. METHODS: Data were drawn from the 2002 Swiss Multicenter Adolescent Survey on Health, a nationally representative survey of adolescents aged 16 to 20 years in post-mandatory school. From a self-administered anonymous questionnaire, 3906 adolescent boys and 3305 girls were categorized into 4 groups according to their intensity of Internet use: heavy Internet users (HIUs; >2 hours/day), regular Internet users (RIUs; several days per week and <= 2 hours/day), occasional users (<= 1 hour/week), and non-Internet users (NIUs; no use in the previous month). Health factors examined were perceived health, depression, overweight, headaches and back pain, and insufficient sleep. RESULTS: In controlled multivariate analysis, using RIUs as a reference, HIUs of both genders were more likely to report higher depressive scores, whereas only male users were found at increased risk of overweight and female users at increased risk of insufficient sleep. Male NIUs and female NIUs and occasional users also were found at increased risk of higher depressive scores. Back-pain complaints were found predominantly among male NIUs. CONCLUSIONS: Our study provides evidence of a U-shaped relationship between intensity of Internet use and poorer mental health of adolescents. In addition, HIUs were confirmed at increased risk for somatic health problems. Thus, health professionals should be on the alert when caring for adolescents who report either heavy Internet use or very little/none. Also, they should consider regular Internet use as a normative behavior without major health consequence. Pediatrics 2011;127:e330-e335

Vital Statistics of Iowa in Brief, 2006

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This publication is an historical recording of the most requested statistics on vital events and is a source of information that can be used in further analysis.

Combining multivariate density forecasts using predictive criteria

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper combines multivariate density forecasts of output growth, inflationand interest rates from a suite of models. An out-of-sample weighting scheme based onthe predictive likelihood as proposed by Eklund and Karlsson (2005) and Andersson andKarlsson (2007) is used to combine the models. Three classes of models are considered: aBayesian vector autoregression (BVAR), a factor-augmented vector autoregression (FAVAR)and a medium-scale dynamic stochastic general equilibrium (DSGE) model. Using Australiandata, we find that, at short forecast horizons, the Bayesian VAR model is assignedthe most weight, while at intermediate and longer horizons the factor model is preferred.The DSGE model is assigned little weight at all horizons, a result that can be attributedto the DSGE model producing density forecasts that are very wide when compared withthe actual distribution of observations. While a density forecast evaluation exercise revealslittle formal evidence that the optimally combined densities are superior to those from thebest-performing individual model, or a simple equal-weighting scheme, this may be a resultof the short sample available.

Distributional equivalence and subcompositional coherence in the analysis of contingency tables, ratio-scale measurements and compositional data

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We consider two fundamental properties in the analysis of two-way tables of positive data: the principle of distributional equivalence, one of the cornerstones of correspondence analysis of contingency tables, and the principle of subcompositional coherence, which forms the basis of compositional data analysis. For an analysis to be subcompositionally coherent, it suffices to analyse the ratios of the data values. The usual approach to dimension reduction in compositional data analysis is to perform principal component analysis on the logarithms of ratios, but this method does not obey the principle of distributional equivalence. We show that by introducing weights for the rows and columns, the method achieves this desirable property. This weighted log-ratio analysis is theoretically equivalent to spectral mapping , a multivariate method developed almost 30 years ago for displaying ratio-scale data from biological activity spectra. The close relationship between spectral mapping and correspondence analysis is also explained, as well as their connection with association modelling. The weighted log-ratio methodology is applied here to frequency data in linguistics and to chemical compositional data in archaeology.

Multiple correspondence analysis of a subset of response categories

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In the analysis of multivariate categorical data, typically the analysis of questionnaire data, it is often advantageous, for substantive and technical reasons, to analyse a subset of response categories. In multiple correspondence analysis, where each category is coded as a column of an indicator matrix or row and column of Burt matrix, it is not correct to simply analyse the corresponding submatrix of data, since the whole geometric structure is different for the submatrix . A simple modification of the correspondence analysis algorithm allows the overall geometric structure of the complete data set to be retained while calculating the solution for the selected subset of points. This strategy is useful for analysing patterns of response amongst any subset of categories and relating these patterns to demographic factors, especially for studying patterns of particular responses such as missing and neutral responses. The methodology is illustrated using data from the International Social Survey Program on Family and Changing Gender Roles in 1994.

Principal curves and principal oriented points

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Principal curves have been defined Hastie and Stuetzle (JASA, 1989) assmooth curves passing through the middle of a multidimensional dataset. They are nonlinear generalizations of the first principalcomponent, a characterization of which is the basis for the principalcurves definition.In this paper we propose an alternative approach based on a differentproperty of principal components. Consider a point in the space wherea multivariate normal is defined and, for each hyperplane containingthat point, compute the total variance of the normal distributionconditioned to belong to that hyperplane. Choose now the hyperplaneminimizing this conditional total variance and look for thecorresponding conditional mean. The first principal component of theoriginal distribution passes by this conditional mean and it isorthogonal to that hyperplane. This property is easily generalized todata sets with nonlinear structure. Repeating the search from differentstarting points, many points analogous to conditional means are found.We call them principal oriented points. When a one-dimensional curveruns the set of these special points it is called principal curve oforiented points. Successive principal curves are recursively definedfrom a generalization of the total variance.

Biplots of fuzzy coded data

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A biplot, which is the multivariate generalization of the two-variable scatterplot, can be used to visualize the results of many multivariate techniques, especially those that are based on the singular value decomposition. We consider data sets consisting of continuous-scale measurements, their fuzzy coding and the biplots that visualize them, using a fuzzy version of multiple correspondence analysis. Of special interest is the way quality of fit of the biplot is measured, since it is well-known that regular (i.e., crisp) multiple correspondence analysis seriously under-estimates this measure. We show how the results of fuzzy multiple correspondence analysis can be defuzzified to obtain estimated values of the original data, and prove that this implies an orthogonal decomposition of variance. This permits a measure of fit to be calculated in the familiar form of a percentage of explained variance, which is directly comparable to the corresponding fit measure used in principal component analysis of the original data. The approach is motivated initially by its application to a simulated data set, showing how the fuzzy approach can lead to diagnosing nonlinear relationships, and finally it is applied to a real set of meteorological data.

Geographical deviations in foreign trade statistics: A study into European trade with Latin American Countries, 1925

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We have analyzed the spatial accuracy of European foreign trade statistics compared to Latin American. We have also included USA s data because of the importance of this country in Latin American trade. We have developed a method for mapping discrepancies between exporters and importers, trying to isolate systematic spatial deviations. Although our results don t allow a unique explanation, they present some interesting clues to the distribution channels in the Latin American Continent as well as some spatial deviations for statistics in individual countries. Connecting our results with the literature specialized in the accuracy of foreign trade statistics; we can revisit Morgernstern (1963) as well as Federico and Tena (1991). Morgernstern had had a really pessimistic view on the reliability of this statistic source, but his main alert was focused on the trade balances, not in gross export or import values. Federico and Tena (1991) have demonstrated howaccuracy increases by aggregation, geographical and of product at the same time. But they still have a pessimistic view with relation to distribution questions, remarking that perhaps it will be more accurate to use import sources in this latest case. We have stated that the data set coming from foreign trade statistics for a sample in 1925, being it exporters or importers, it s a valuable tool for geography of trade patterns, although in some specific cases it needs some spatial adjustments.

«
1
2
...
23
24
25
26
27
28
29
...
62
63
»