974 resultados para canonical correspondence analysis (CCA)


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Correspondence analysis has found extensive use in ecology, archeology, linguisticsand the social sciences as a method for visualizing the patterns of association in a table offrequencies or nonnegative ratio-scale data. Inherent to the method is the expression of the datain each row or each column relative to their respective totals, and it is these sets of relativevalues (called profiles) that are visualized. This relativization of the data makes perfect sensewhen the margins of the table represent samples from sub-populations of inherently differentsizes. But in some ecological applications sampling is performed on equal areas or equalvolumes so that the absolute levels of the observed occurrences may be of relevance, in whichcase relativization may not be required. In this paper we define the correspondence analysis ofthe raw unrelativized data and discuss its properties, comparing this new method to regularcorrespondence analysis and to a related variant of non-symmetric correspondence analysis.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The generalization of simple correspondence analysis, for two categorical variables, to multiple correspondence analysis where they may be three or more variables, is not straighforward, both from a mathematical and computational point of view. In this paper we detail the exact computational steps involved in performing a multiple correspondence analysis, including the special aspects of adjusting the principal inertias to correct the percentages of inertia, supplementary points and subset analysis. Furthermore, we give the algorithm for joint correspondence analysis where the cross-tabulations of all unique pairs of variables are analysed jointly. The code in the R language for every step of the computations is given, as well as the results of each computation.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In the analysis of multivariate categorical data, typically the analysis of questionnaire data, it is often advantageous, for substantive and technical reasons, to analyse a subset of response categories. In multiple correspondence analysis, where each category is coded as a column of an indicator matrix or row and column of Burt matrix, it is not correct to simply analyse the corresponding submatrix of data, since the whole geometric structure is different for the submatrix . A simple modification of the correspondence analysis algorithm allows the overall geometric structure of the complete data set to be retained while calculating the solution for the selected subset of points. This strategy is useful for analysing patterns of response amongst any subset of categories and relating these patterns to demographic factors, especially for studying patterns of particular responses such as missing and neutral responses. The methodology is illustrated using data from the International Social Survey Program on Family and Changing Gender Roles in 1994.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A Method is offered that makes it possible to apply generalized canonicalcorrelations analysis (CANCOR) to two or more matrices of different row and column order. The new method optimizes the generalized canonical correlationanalysis objective by considering only the observed values. This is achieved byemploying selection matrices. We present and discuss fit measures to assessthe quality of the solutions. In a simulation study we assess the performance of our new method and compare it to an existing procedure called GENCOM,proposed by Green and Carroll. We find that our new method outperforms the GENCOM algorithm both with respect to model fit and recovery of the truestructure. Moreover, as our new method does not require any type of iteration itis easier to implement and requires less computation. We illustrate the methodby means of an example concerning the relative positions of the political parties inthe Netherlands based on provincial data.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Correspondence analysis is introduced in the brand associationliterature as an alternative tool to measure dominance, for theparticular case of free choice data. The method is also used to analysedifferences, or asymmetries, between brand-attribute associations whereattributes are associated with evoked brands, and brand-attributeassociations where brands are associated with the attributes. Anapplication to a sample of deodorants is used to illustrate the proposedmethodology.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The generalization of simple (two-variable) correspondence analysis to more than two categorical variables, commonly referred to as multiple correspondence analysis, is neither obvious nor well-defined. We present two alternative ways of generalizing correspondence analysis, one based on the quantification of the variables and intercorrelation relationships, and the other based on the geometric ideas of simple correspondence analysis. We propose a version of multiple correspondence analysis, with adjusted principal inertias, as the method of choice for the geometric definition, since it contains simple correspondence analysis as an exact special case, which is not the situation of the standard generalizations. We also clarify the issue of supplementary point representation and the properties of joint correspondence analysis, a method that visualizes all two-way relationships between the variables. The methodology is illustrated using data on attitudes to science from the International Social Survey Program on Environment in 1993.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The case of two transition tables is considered, that is two squareasymmetric matrices of frequencies where the rows and columns of thematrices are the same objects observed at three different timepoints. Different ways of visualizing the tables, either separatelyor jointly, are examined. We generalize an existing idea where asquare matrix is descomposed into symmetric and skew-symmetric partsto two matrices, leading to a decomposition into four components: (1)average symmetric, (2) average skew-symmetric, (3) symmetricdifference from average, and (4) skew-symmetric difference fromaverage. The method is illustrated with an artificial example and anexample using real data from a study of changing values over threegenerations.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Dual scaling of a subjects-by-objects table of dominance data (preferences,paired comparisons and successive categories data) has been contrasted with correspondence analysis, as if the two techniques were somehow different. In this note we show that dual scaling of dominance data is equivalent to the correspondence analysis of a table which is doubled with respect to subjects. We also show that the results of both methods can be recovered from a principal components analysis of the undoubled dominance table which is centred with respect to subject means.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

This paper presents findings from a study investigating a firm s ethical practices along the value chain. In so doing we attempt to better understand potential relationships between a firm s ethical stance with its customers and those of its suppliers within a supply chain and identify particular sectoral and cultural influences that might impinge on this. Drawing upon a database comprising of 667 industrial firms from 27 different countries, we found that ethical practices begin with the firm s relationship with its customers, the characteristics of which then influence the ethical stance with the firm s suppliers within the supply chain. Importantly, market structure along with some key cultural characteristics were also found to exert significant influence on the implementation of ethical policies in these firms.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

It is shown how correspondence analysis may be applied to a subset of response categories from a questionnaire survey, for example the subset of undecided responses or the subset of responses for a particular category. The idea is to maintain the original relative frequencies of the categories and not re-express them relative to totals within the subset, as would normally be done in a regular correspondence analysis of the subset. Furthermore, the masses and chi-square metric assigned to the data subset are the same as those in the correspondence analysis of the whole data set. This variant of the method, called Subset Correspondence Analysis, is illustrated on data from the ISSP survey on Family and Changing Gender Roles.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The objective of this work was to elevate gradient effect on diversity of Collembola, in a temperate forest on the northeast slope of Iztaccíhuatl Volcano, Mexico. Four expeditions were organized from November 2003 to August 2004, at four altitudes (2,753, 3,015, 3,250 and 3,687 m a.s.l.). In each site, air temperature, CO2 concentration, humidity, and terrain inclination were measured. The influence of abiotic factors on faunal composition was evaluated, at the four collecting sites, with canonical correspondence analyses (CCA). A total of 24,028 specimens were obtained, representing 12 families, 44 genera and 76 species. Mesaphorura phlorae, Proisotoma ca. tenella and Parisotoma ca. notabilis were the most abundant species. The highest diversity and evenness were recorded at 3,250 m (H' = 2.85; J' = 0.73). Canonical analyses axes 1 and 2 of the CCA explained 67.4% of the variance in species composition, with CO2 and altitude best explaining axis 1, while slope and humidity were better correlated to axis 2. The results showed that CO2 is an important factor to explain Collembola species assemblage, together with slope and humidity.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We compare correspondance análisis to the logratio approach based on compositional data. We also compare correspondance análisis and an alternative approach using Hellinger distance, for representing categorical data in a contingency table. We propose a coefficient which globally measures the similarity between these approaches. This coefficient can be decomposed into several components, one component for each principal dimension, indicating the contribution of the dimensions to the difference between the two representations. These three methods of representation can produce quite similar results. One illustrative example is given

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Starting with logratio biplots for compositional data, which are based on the principle of subcompositional coherence, and then adding weights, as in correspondence analysis, we rediscover Lewi's spectral map and many connections to analyses of two-way tables of non-negative data. Thanks to the weighting, the method also achieves the property of distributional equivalence

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The response of zooplankton assemblages to variations in the water quality of four man-made lakes, caused by eutrophication and siltation, was investigated by means of canonical correspondence analysis. Monte Carlo simulations using the CCA eingenvalues as test statistics revealed that changes in zooplankton species composition along the environmental gradients of trophic state and abiogenic turbidity were highly significant. The species Brachionus calyciflorus, Thermocyclops sp. and Argyrodiaptomus sp. were good indicators of eutrophic conditions while the species Brachionus dolabratus, Keratella tropica and Hexarthra sp. were good indicators of high turbidity due to suspended sediments. The rotifer genus Brachionus was the most species-rich taxon, comprising five species which were associated with different environmental conditions. Therefore, we tested whether this genus alone could potentially be a better biological indicator of these environmental gradients than the entire zooplankton assemblages or any other random set of five species. The ordination results show that the five Brachionus species alone did not explain better the observed pattern of environmental variation than most random sets of five species. Therefore, this genus could not be selected as a target taxon for more intensive environmental monitoring as has been previously suggested by Attayde and Bozelli (1998). Overall, our results show that changes in the water quality of man-made lakes in a tropical semi-arid region have significant effects on the structure of zooplankton assemblages that can potentially affect the functioning of these ecosystems