42 resultados para probabilistic principal component analysis (probabilistic PCA)
Resumo:
We consider the joint visualization of two matrices which have common rowsand columns, for example multivariate data observed at two time pointsor split accord-ing to a dichotomous variable. Methods of interest includeprincipal components analysis for interval-scaled data, or correspondenceanalysis for frequency data or ratio-scaled variables on commensuratescales. A simple result in matrix algebra shows that by setting up thematrices in a particular block format, matrix sum and difference componentscan be visualized. The case when we have more than two matrices is alsodiscussed and the methodology is applied to data from the InternationalSocial Survey Program.
Resumo:
Dual scaling of a subjects-by-objects table of dominance data (preferences,paired comparisons and successive categories data) has been contrasted with correspondence analysis, as if the two techniques were somehow different. In this note we show that dual scaling of dominance data is equivalent to the correspondence analysis of a table which is doubled with respect to subjects. We also show that the results of both methods can be recovered from a principal components analysis of the undoubled dominance table which is centred with respect to subject means.
Resumo:
The correlation between the species composition of pasture communities and soil properties in Plana de Vic has been studied using two multivariate methods, Correspondence Analysis (CA) for the vegetation data and Principal Component Analysis (PCA) for the soil data. To analyse the pastures, we took 144 vegetation relevés (comprising 201 species) that have been classified into 10 phytocoenological communities elsewhere. Most of these communities are almost entirely built up by perennials, ranging from xerophilous, clearly Mediterranean, to mesophilous, related to medium-European pastures, but a few occurring in shallow soils are dominated by therophytes. As for the soil properties, we analysed texture, pH, depth, bulk density, organic matter, C/N ratio and the carbonates content of 25 samples, correspondingto representative relevés of the communities studied.
Resumo:
We performed a spatiotemporal analysis of a network of 21 Scots pine (Pinus sylvestris) ring-width chronologies in northern Fennoscandia by means of chronology statistics and multivariate analyses. Chronologies are located on both sides (western and eastern) of the Scandes Mountains (67°N-70°N, 15°E-29°E). Growth relationships with temperature, precipitation, and North Atlantic Oscillation (NAO) indices were calculated for the period 1880-1991. We also assessed their temporal stability. Current July temperature and, to a lesser degree, May precipitation are the main growth limiting factors in the whole area of study. However, Principal Component Analysis (PCA) and mean interseries correlation revealed differences in radial growth between both sides of the Scandes Mountains, attributed to the Oceanic-Continental climatic gradient in the area. The gradient signal is temporally variable and has strengthened during the second half of the 20th century. Northern Fennoscandia Scots pine growth is positively related to early winter NAO indices previous to the growth season and to late spring NAO. NAO/growth relationships are unstable and have dropped in the second half of the 20th century. Moreover, they are noncontinuous through the range of NAO values: for early winter, only positive NAO indices enhance tree growth in the next growing season, while negative NAO does not. For spring, only negative NAO is correlated with radial growth.
Resumo:
In this present work, we are proposing a characteristics reduction system for a facial biometric identification system, using transformed domains such as discrete cosine transformed (DCT) and discrete wavelets transformed (DWT) as parameterization; and Support Vector Machines (SVM) and Neural Network (NN) as classifiers. The size reduction has been done with Principal Component Analysis (PCA) and with Independent Component Analysis (ICA). This system presents a similar success results for both DWT-SVM system and DWT-PCA-SVM system, about 98%. The computational load is improved on training mode due to the decreasing of input’s size and less complexity of the classifier.
Resumo:
Objective: To compare lower incisor dentoalveolar compensation and mandible symphysis morphology among Class I and Class III malocclusion patients with different facial vertical skeletal patterns. Materials and Methods: Lower incisor extrusion and inclination, as well as buccal (LA) and lingual (LP) cortex depth, and mandibular symphysis height (LH) were measured in 107 lateral cephalometric x-rays of adult patients without prior orthodontic treatment. In addition, malocclusion type (Class I or III) and facial vertical skeletal pattern were considered. Through a principal component analysis (PCA) related variables were reduced. Simple regression equation and multivariate analyses of variance were also used. Results: Incisor mandibular plane angle (P < .001) and extrusion (P = .03) values showed significant differences between the sagittal malocclusion groups. Variations in the mandibular plane have a negative correlation with LA (Class I P = .03 and Class III P = .01) and a positive correlation with LH (Class I P = .01 and Class III P = .02) in both groups. Within the Class III group, there was a negative correlation between the mandibular plane and LP (P = .02). PCA showed that the tendency toward a long face causes the symphysis to elongate and narrow. In Class III, alveolar narrowing is also found in normal faces. Conclusions: Vertical facial pattern is a significant factor in mandibular symphysis alveolar morphology and lower incisor positioning, both for Class I and Class III patients. Short-faced Class III patients have a widened alveolar bone. However, for long-faced and normal-faced Class III, natural compensation elongates the symphysis and influences lower incisor position.
Resumo:
In recent years there has been growing interest in composite indicators as an efficient tool of analysis and a method of prioritizing policies. This paper presents a composite index of intermediary determinants of child health using a multivariate statistical approach. The index shows how specific determinants of child health vary across Colombian departments (administrative subdivisions). We used data collected from the 2010 Colombian Demographic and Health Survey (DHS) for 32 departments and the capital city, Bogotá. Adapting the conceptual framework of Commission on Social Determinants of Health (CSDH), five dimensions related to child health are represented in the index: material circumstances, behavioural factors, psychosocial factors, biological factors and the health system. In order to generate the weight of the variables, and taking into account the discrete nature of the data, principal component analysis (PCA) using polychoric correlations was employed in constructing the index. From this method five principal components were selected. The index was estimated using a weighted average of the retained components. A hierarchical cluster analysis was also carried out. The results show that the biggest differences in intermediary determinants of child health are associated with health care before and during delivery.
Resumo:
This paper presents a composite index of early childhood health using a multivariate statistical approach. The index shows how child health varies across Colombian departments, -administrative subdivisions-. In recent years there has been growing interest in composite indicators as an efficient analysis tool and a way of prioritizing policies. These indicators not only enable multi-dimensional phenomena to be simplified but also make it easier to measure, visualize, monitor and compare a country’s performance in particular issues. We used data collected from the Colombian Demographic and Health Survey, DHS, for 32 departments and the capital city, Bogotá, in 2005 and 2010. The variables included in the index provide a measure of three dimensions related to child health: health status, health determinants and the health system. In order to generate the weight of the variables and take into account the discrete nature of the data, we employed a principal component analysis, PCA, using polychoric correlation. From this method, five principal components were selected. The index was estimated using a weighted average of the components retained. A hierarchical cluster analysis was also carried out. We observed that the departments ranking in the lowest positions are located on the Colombian periphery. They are departments with low per capita incomes and they present critical social indicators. The results suggest that the regional disparities in child health may be associated with differences in parental characteristics, household conditions and economic development levels, which makes clear the importance of context in the study of child health in Colombia.
Resumo:
This study presents a catalogue of synoptic patterns of torrential rainfall in northeast of the Iberian Peninsula (IP). These circulation patterns were obtained by applying a T-mode Principal Component Analysis (PCA) to a daily data grid (NCEP/NCAR reanalysis) at sea level pressure (SLP). The analysis made use of 304 days which recorded >100 mm in one or more stations in provinces of Barcelona, Girona and Tarragona (coastland area of Catalonia) throughout the 1950-2005 period. The catalogue comprises 7 circulation patterns showing a great variety of atmospheric conditions and seasonal or monthly distribution. Likewise, we computed the mean index value of the Western Mediterranean Oscillation index (WeMOi) for the synoptic patterns obtained by averaging all days grouped in each pattern. The results showed a clear association between the negative values of this teleconnection index and torrential rainfall in northeast of the IP. We therefore put forward the WeMO as an essential tool for forecasting heavy rainfall in northeast of Spain
Resumo:
Psychometric analysis of the AF5 multidimensional scale of self-concept in a sample of adolescents and adults in Catalonia. The aim of this study is to carry out a psychometric study of the AF5 scale in a sample of 4.825 Catalan subjects from 11 to 63 years-old. They are students from secondary compulsory education (ESO), from high school, middle-level vocational training (CFGM) and from the university. Using a principal component analysis (PCA) the theoretical validity of the components is established and the reliability of the instrument is also analyzed. Differential analyses are performed by gender and normative group using a 2 6 factorial design. The normative group variable includes the different levels classifi ed into 6 sub-groups: university, post-compulsory secondary education (high school and CFGM), 4th of ESO, 3rd of ESO, 2nd of ESO and 1st of ESO. The results indicate that the reliability of the Catalan version of the scale is similar to the original scale. The factorial structure also fi ts with the original model established beforehand. Signifi cant differences by normative group in the four components of self-concept explored (social, family, academic/occupational and physical) are observed. By gender, signifi cant differences appear in the component of physical self-concept, academic and social but not in the family component
Resumo:
Three multivariate statistical tools (principal component analysis, factor analysis, analysis discriminant) have been tested to characterize and model the sags registered in distribution substations. Those models use several features to represent the magnitude, duration and unbalanced grade of sags. They have been obtained from voltage and current waveforms. The techniques are tested and compared using 69 registers of sags. The advantages and drawbacks of each technique are listed
Resumo:
A statistical method for classification of sags their origin downstream or upstream from the recording point is proposed in this work. The goal is to obtain a statistical model using the sag waveforms useful to characterise one type of sags and to discriminate them from the other type. This model is built on the basis of multi-way principal component analysis an later used to project the available registers in a new space with lower dimension. Thus, a case base of diagnosed sags is built in the projection space. Finally classification is done by comparing new sags against the existing in the case base. Similarity is defined in the projection space using a combination of distances to recover the nearest neighbours to the new sag. Finally the method assigns the origin of the new sag according to the origin of their neighbours
Resumo:
The work presented in this paper belongs to the power quality knowledge area and deals with the voltage sags in power transmission and distribution systems. Propagating throughout the power network, voltage sags can cause plenty of problems for domestic and industrial loads that can financially cost a lot. To impose penalties to responsible party and to improve monitoring and mitigation strategies, sags must be located in the power network. With such a worthwhile objective, this paper comes up with a new method for associating a sag waveform with its origin in transmission and distribution networks. It solves this problem through developing hybrid methods which hire multiway principal component analysis (MPCA) as a dimension reduction tool. MPCA reexpresses sag waveforms in a new subspace just in a few scores. We train some well-known classifiers with these scores and exploit them for classification of future sags. The capabilities of the proposed method for dimension reduction and classification are examined using the real data gathered from three substations in Catalonia, Spain. The obtained classification rates certify the goodness and powerfulness of the developed hybrid methods as brand-new tools for sag classification
Resumo:
Rho GTPases are conformational switches that control a wide variety of signaling pathways critical for eukaryotic cell development and proliferation. They represent attractive targets for drug design as their aberrant function and deregulated activity is associated with many human diseases including cancer. Extensive high-resolution structures (.100) and recent mutagenesis studies have laid the foundation for the design of new structure-based chemotherapeutic strategies. Although the inhibition of Rho signaling with drug-like compounds is an active area of current research, very little attention has been devoted to directly inhibiting Rho by targeting potential allosteric non-nucleotide binding sites. By avoiding the nucleotide binding site, compounds may minimize the potential for undesirable off-target interactions with other ubiquitous GTP and ATP binding proteins. Here we describe the application of molecular dynamics simulations, principal component analysis, sequence conservation analysis, and ensemble small-molecule fragment mapping to provide an extensive mapping of potential small-molecule binding pockets on Rho family members. Characterized sites include novel pockets in the vicinity of the conformationaly responsive switch regions as well as distal sites that appear to be related to the conformations of the nucleotide binding region. Furthermore the use of accelerated molecular dynamics simulation, an advanced sampling method that extends the accessible time-scale of conventional simulations, is found to enhance the characterization of novel binding sites when conformational changes are important for the protein mechanism.
Resumo:
Els avenços en tècniques de genotipat de polimorfismes genètics a gran escala estan liderant una revolució en el camp de l’epidemiologia genètica i la genètica de poblacions humanes. La informació aportada per aquestes tècniques ha evidenciat l’existència d’estructuracions poblacionals que poden augmentar l’error en els estudis d’associació a escala genòmica (GWAS, genome-wide association studies). Estudis recents han demostrat la presència d’aquestes estructuracions a nivell interregional i intrarregional a Europa. El present projecte ha avaluat el grau d’estructuració genètica en poblacions de la Península Ibèrica i altres regions del sudoest europeu (Itàlia i França) per quantificar l’impacte que aquesta potencial estructuració pot tenir en el disseny d’estudis d’associació GWAS i reconstruir la història demogràfica de les poblacions de la Mediterrània. Per aconseguir aquests objectius, s’han analitzat mostres de DNA de 770 individus de 26 poblacions de la Península Ibèrica, França, Itàlia i d’altres països de la Mediterrània. Aquestes mostres van ser genotipades per 240000 SNPs utilitzant l’array 250K StyI d’Affymetrix en el marc d’aquest projecte o mitjançant altres arrays d’Affymetrix en els projectes internacionals HapMap i POPRES. S’han realitzat anàlisis estadístiques incloent anàlisis de components principals, Fst, identitat per descendència, desequilibri de lligament, barreres genètiques, etc. Aquests resultats han permés construir un marc de referència de la variabilitat en aquesta regió, avaluar el seu impacte en estudis d’associació i proposar mesures per evitar l’increment de qualsevol tipus d’error (tipus I i II) en estudis nacionals i internacionals. A més, també han permés reconstruir la història de les poblacions humanes de la Mediterrània així com analitzar les seves relacions demogràfiques. Donada la duració limitada d’aquesta acció (24 mesos, d’octubre de 2010 a setembre de 2012), els resultats d’aquest projecte es troben actualment en fase de redacció i conduiran a diverses publicacions en revistes internacionals i a la preparació de comunicacions a congressos.