928 resultados para improved principal components analysis (IPCA) algorithm
Resumo:
This paper presents a composite index of early childhood health using a multivariate statistical approach. The index shows how child health varies across Colombian departments, -administrative subdivisions-. In recent years there has been growing interest in composite indicators as an efficient analysis tool and a way of prioritizing policies. These indicators not only enable multi-dimensional phenomena to be simplified but also make it easier to measure, visualize, monitor and compare a country’s performance in particular issues. We used data collected from the Colombian Demographic and Health Survey, DHS, for 32 departments and the capital city, Bogotá, in 2005 and 2010. The variables included in the index provide a measure of three dimensions related to child health: health status, health determinants and the health system. In order to generate the weight of the variables and take into account the discrete nature of the data, we employed a principal component analysis, PCA, using polychoric correlation. From this method, five principal components were selected. The index was estimated using a weighted average of the components retained. A hierarchical cluster analysis was also carried out. We observed that the departments ranking in the lowest positions are located on the Colombian periphery. They are departments with low per capita incomes and they present critical social indicators. The results suggest that the regional disparities in child health may be associated with differences in parental characteristics, household conditions and economic development levels, which makes clear the importance of context in the study of child health in Colombia.
Resumo:
In recent years, new analytical tools have allowed researchers to extract historical information contained in molecular data, which has fundamentally transformed our understanding of processes ruling biological invasions. However, the use of these new analytical tools has been largely restricted to studies of terrestrial organisms despite the growing recognition that the sea contains ecosystems that are amongst the most heavily affected by biological invasions, and that marine invasion histories are often remarkably complex. Here, we studied the routes of invasion and colonisation histories of an invasive marine invertebrate Microcosmus squamiger (Ascidiacea) using microsatellite loci, mitochondrial DNA sequence data and 11 worldwide populations. Discriminant analysis of principal components, clustering methods and approximate Bayesian computation (ABC) methods showed that the most likely source of the introduced populations was a single admixture event that involved populations from two genetically differentiated ancestral regions - the western and eastern coasts of Australia. The ABC analyses revealed that colonisation of the introduced range of M. squamiger consisted of a series of non-independent introductions along the coastlines of Africa, North America and Europe. Furthermore, we inferred that the sequence of colonisation across continents was in line with historical taxonomic records - first the Mediterranean Sea and South Africa from an unsampled ancestral population, followed by sequential introductions in California and, more recently, the NE Atlantic Ocean. We revealed the most likely invasion history for world populations of M. squamiger, which is broadly characterized by the presence of multiple ancestral sources and non-independent introductions within the introduced range. The results presented here illustrate the complexity of marine invasion routes and identify a cause-effect relationship between human-mediated transport and the success of widespread marine non-indigenous species, which benefit from stepping-stone invasions and admixture processes involving different sources for the spread and expansion of their range.
Resumo:
Hoitotyön laatu - lasten näkökulma Tämän kolmivaiheisen tutkimuksen tarkoituksena oli kuvailla lasten odotuksia ja arviointeja lasten hoitotyön laadusta sekä kehittää mittari kouluikäisille sairaalassa oleville lapsille laadun arviointiin. Perimmäisenä tavoitteena oli lasten hoitotyön laadun kehittäminen sairaalassa. Ensimmäisessä vaiheessa 20 alle kouluikäistä (4-6v) sekä 20 kouluikäistä (7-11v) lasta kuvailivat odotuksiaan lasten hoitotyön laadusta. Aineisto kerättiin haastattelulla ja lasten piirustusten avulla, sekä analysoitiin sisällön analyysilla. Lasten odotukset lasten hoitotyön laadusta kohdistuivat hoitajaan, hoitotyön toimintoihin ja ympäristöön, fyysinen ympäristö korostui piirustuksissa. Ensimmäisen vaiheen tulosten, aikaisemman kirjallisuuden sekä Leino-Kilven “HYVÄ HOITO” mittarin pohjalta kehitettiin “Lasten Hoidon Laatu Sairaalassa” (LHLS) mittari ja testattiin sen psykometrisiä ominaisuuksia tutkimuksen toisessa vaiheessa. Mittaria kehitettiin ja testattiin kolmen vaiheen kautta. Aluksi asiantuntijapaneeli (n=7) arvioi mittarin sisältöä. Seuraavaksi mittari esitestattiin kahdesti kouluikäisillä sairaalassa olevilla lapsilla (n=41 ja n=16), samassa vaiheessa myös viiden lastenosaston hoitajat (n=19) yhdessä arvioivat mittarin sisältöä sekä 8 lasta. Lopuksi mittaria testattiin kouluikäisillä lapsilla (n=388) sairaalassa sekä hoitajat (n=198) arvioivat mittarin sisällön validiteettia. Mittarin kehittämisen aikana päälaatuluokkien: hoitajan ominaisuudet, hoitotyön toiminnot ja hoitotyön ympäristö Cronbachin alfa kertoimet paranivat. Pääkomponentti analyysi tuki mittarin hoitotyön toimintojen ja ympäristön alaluokkien teoreettista rakennetta. Kolmannessa vaiheessa “Lasten Hoidon Laatu Sairaalassa” (LHLS III, versio neljä) mittarilla kerättiin aineisto Suomen yliopistosairaaloiden lastenosastoilta kouluikäisiltä 7-11 -vuotiailta lapsilta (n=388). Mittarin lopussa lapsia pyydettiin lisäksi kuvailemaan kivointa ja ikävintä kokemustaan sairaalahoidon aikana lauseen täydennystehtävänä. Aineisto analysoitiin tilastollisesti sekä sisällön analyysilla. Lapset arvioivat fyysisen hoitoympäristön, hoitajien inhimillisyyden ja luotettavuuden sekä huolenpidon ja vuorovaikutustoiminnot kiitettäviksi. Lapset arvioivat hoitajien viihdyttämistoiminnot kaikkein alhaisimmiksi. Lapsen ikä ja sairaalantulotapa olivat yhteydessä lasten saamaan tiedon määrään. Lasten kivoimmat kokemukset liittyivät ihmisiin ja heidän ominaisuuksiinsa, toimintoihin, ympäristöön sekä lopputuloksiin. Ikävimmät kokemukset liittyivät potilaana oloon, tuntemuksiin sairauden oireista sekä erossaoloon, hoitotyön fyysisiin toimintoihin sekä ympäristöön. Tutkimuksen tulokset osoittavat lasten olevan kykeneviä arvioimaan omaa hoitoaan ja heidän näkökulmansa tulisi nähdä osana koko laadun kehittämisprosessia parannettaessa laatua käytännössä todella lapsilähtöisemmällä lähestymistavalla. “Lasten Hoidon Laatu Sairaalassa” (LHLS) mittari on mahdollinen väline saada tietoa lasten arvioinneista lasten hoitotyön laadusta, mutta mittarin testaamista tulisi jatkaa tulevaisuudessa
Resumo:
A continuous random variable is expanded as a sum of a sequence of uncorrelated random variables. These variables are principal dimensions in continuous scaling on a distance function, as an extension of classic scaling on a distance matrix. For a particular distance, these dimensions are principal components. Then some properties are studied and an inequality is obtained. Diagonal expansions are considered from the same continuous scaling point of view, by means of the chi-square distance. The geometric dimension of a bivariate distribution is defined and illustrated with copulas. It is shown that the dimension can have the power of continuum.
Resumo:
Water quality was monitored at the upper course of the Rio das Velhas, a major tributary of the São Francisco basin located in the state of Minas Gerais, over an extension of 108 km from its source up to the limits with the Sabara district. Monitoring was done at 37 different sites over a period of 2 years (2003-2004) for 39 parameters. Multivariate statistical techniques were applied to interpret the large water-quality data set and to establish an optimal long-term monitoring network. Cluster analysis separated the sampling sites into groups of similarity, and also indicated the stations investigated for correlation and recommended to be removed from the monitoring network. Principal component analysis identified four components, which are responsible for the data structure explaining 80% of the total variance of the data. The principal parameters are characterized as due to mining activities and domestic sewage. Significant data reduction was achieved.
Resumo:
The psychometric properties of the Personal Wellbeing Index are analyzed on a Spanish and Portuguese adolescent sample. We test the reliability of the scale using Cronbach’s alpha. And complementarily we analyze the item-total correlations in the different wellbeing domains included. We execute an exploratory factor analysis (principal components) and a multigroup Confirmatory Factor Analysis (CFA). The results show that Cronbach’s alpha is 0.79 for the Chilean version and in the Brazilian version is 0.78 confirming adequate levels of reliability found in previous studies. Correlations between fields of well-being shows values ranging between 0.224 and 0.496 for Chile and from 0.24 to 0.46 for Brazil. The results are similar to those obtained in other countries. The monofactorial structure of the scale is cinfirmed, also the adjustment to the scale structure to the data of the two samples and the comparability of means of global indices. The results suggest the existence of other well-being domains that had not been considered in the original proposal of the scale
Resumo:
The uncertainty of any analytical determination depends on analysis and sampling. Uncertainty arising from sampling is usually not controlled and methods for its evaluation are still little known. Pierre Gy’s sampling theory is currently the most complete theory about samplingwhich also takes the design of the sampling equipment into account. Guides dealing with the practical issues of sampling also exist, published by international organizations such as EURACHEM, IUPAC (International Union of Pure and Applied Chemistry) and ISO (International Organization for Standardization). In this work Gy’s sampling theory was applied to several cases, including the analysis of chromite concentration estimated on SEM (Scanning Electron Microscope) images and estimation of the total uncertainty of a drug dissolution procedure. The results clearly show that Gy’s sampling theory can be utilized in both of the above-mentioned cases and that the uncertainties achieved are reliable. Variographic experiments introduced in Gy’s sampling theory are beneficially applied in analyzing the uncertainty of auto-correlated data sets such as industrial process data and environmental discharges. The periodic behaviour of these kinds of processes can be observed by variographic analysis as well as with fast Fourier transformation and auto-correlation functions. With variographic analysis, the uncertainties are estimated as a function of the sampling interval. This is advantageous when environmental data or process data are analyzed as it can be easily estimated how the sampling interval is affecting the overall uncertainty. If the sampling frequency is too high, unnecessary resources will be used. On the other hand, if a frequency is too low, the uncertainty of the determination may be unacceptably high. Variographic methods can also be utilized to estimate the uncertainty of spectral data produced by modern instruments. Since spectral data are multivariate, methods such as Principal Component Analysis (PCA) are needed when the data are analyzed. Optimization of a sampling plan increases the reliability of the analytical process which might at the end have beneficial effects on the economics of chemical analysis,
Resumo:
Psychometric analysis of the AF5 multidimensional scale of self-concept in a sample of adolescents and adults in Catalonia. The aim of this study is to carry out a psychometric study of the AF5 scale in a sample of 4.825 Catalan subjects from 11 to 63 years-old. They are students from secondary compulsory education (ESO), from high school, middle-level vocational training (CFGM) and from the university. Using a principal component analysis (PCA) the theoretical validity of the components is established and the reliability of the instrument is also analyzed. Differential analyses are performed by gender and normative group using a 2 6 factorial design. The normative group variable includes the different levels classifi ed into 6 sub-groups: university, post-compulsory secondary education (high school and CFGM), 4th of ESO, 3rd of ESO, 2nd of ESO and 1st of ESO. The results indicate that the reliability of the Catalan version of the scale is similar to the original scale. The factorial structure also fi ts with the original model established beforehand. Signifi cant differences by normative group in the four components of self-concept explored (social, family, academic/occupational and physical) are observed. By gender, signifi cant differences appear in the component of physical self-concept, academic and social but not in the family component
Resumo:
In this work, the volatile chromatographic profiles of roasted Arabica coffees, previously analyzed for their sensorial attributes, were explored by principal component analysis. The volatile extraction technique used was the solid phase microextraction. The correlation optimized warping algorithm was used to align the gas chromatographic profiles. Fifty four compounds were found to be related to the sensorial attributes investigated. The volatiles pyrrole, 1-methyl-pyrrole, cyclopentanone, dihydro-2-methyl-3-furanone, furfural, 2-ethyl-5-methyl-pyrazine, 2-etenyl-n-methyl-pyrazine, 5-methyl-2-propionyl-furan compounds were important for the differentiation of coffee beverage according to the flavour, cleanliness and overall quality. Two figures of merit, sensitivity and specificity (or selectivity), were used to interpret the sensory attributes studied.
Resumo:
Raw measurement data does not always immediately convey useful information, but applying mathematical statistical analysis tools into measurement data can improve the situation. Data analysis can offer benefits like acquiring meaningful insight from the dataset, basing critical decisions on the findings, and ruling out human bias through proper statistical treatment. In this thesis we analyze data from an industrial mineral processing plant with the aim of studying the possibility of forecasting the quality of the final product, given by one variable, with a model based on the other variables. For the study mathematical tools like Qlucore Omics Explorer (QOE) and Sparse Bayesian regression (SB) are used. Later on, linear regression is used to build a model based on a subset of variables that seem to have most significant weights in the SB model. The results obtained from QOE show that the variable representing the desired final product does not correlate with other variables. For SB and linear regression, the results show that both SB and linear regression models built on 1-day averaged data seriously underestimate the variance of true data, whereas the two models built on 1-month averaged data are reliable and able to explain a larger proportion of variability in the available data, making them suitable for prediction purposes. However, it is concluded that no single model can fit well the whole available dataset and therefore, it is proposed for future work to make piecewise non linear regression models if the same available dataset is used, or the plant to provide another dataset that should be collected in a more systematic fashion than the present data for further analysis.
Resumo:
The hydroalcoholic extracts prepared from standard leaves of Maytenus ilicifolia and commercial samples of espinheira-santa were evaluated qualitatively (fingerprinting) and quantitatively. In this paper, fingerprinting chromatogram coupled with Principal Component Analysis (PCA) is described for the metabolomic analysis of standard and commercial espinheira-santa samples. The epicatechin standard was used as an external standard for the development and validation of a quantitative method for the analysis in herbal medicines using a photo diode array detector. This method has been applied for quantification of epicatechin in commercialized herbal medicines sold as espinheira-santa in Brazil and in the standard sample of M. ilicifolia.
Resumo:
This study optimized and validated a method to perform chemical speciation of inorganic arsenic in water samples collected under the Monitoring Program of the Port of Rio Grande-RS in July and October 2010 from the Laguna dos Patos Estuary (RS, Brazil). The flow injection hydride generation atomic absorption spectrometry technique was employed, allowing quantification of As3+ and As5+ present in estuarine water samples. Data interpretation for results generated using the improved method for analyzing water samples collected from Laguna dos Patos Estuary was done by main components analysis.
Resumo:
The application of automated correlation optimized warping (ACOW) to the correction of retention time shift in the chromatographic fingerprints of Radix Puerariae thomsonii (RPT) was investigated. Twenty-seven samples were extracted from 9 batches of RPT products. The fingerprints of the 27 samples were established by the HPLC method. Because there is a retention time shift in the established fingerprints, the quality of these samples cannot be correctly evaluated by using similarity estimation and principal component analysis (PCA). Thus, the ACOW method was used to align these fingerprints. In the ACOW procedure, the warping parameters, which have a significant influence on the alignment result, were optimized by an automated algorithm. After correcting the retention time shift, the quality of these RPT samples was correctly evaluated by similarity estimation and PCA. It is demonstrated that ACOW is a practical method for aligning the chromatographic fingerprints of RPT. The combination of ACOW, similarity estimation, and PCA is shown to be a promising method for evaluating the quality of Traditional Chinese Medicine.
Resumo:
The calyxes of Hibiscus sabdariffa are used in traditional medicine around the world. However, quality assurance protocols and chemical variability have not been previously analyzed. In the present study, chemical characterization of a set of samples of H. sabdariffa calyxes commercialized in Colombia was accomplished with the aim to explore the chemical variability among them. Chemometrics-based analyses on the data obtained from the HPLC-UV-DAD-derived profiles were then performed. Thus, the pre-processed single-wavelength data were subjected to principal component analysis (PCA). The PCA-derived results evidenced different groups which were well-correlated to the corresponding total phenolic and total anthocyanin contents. Multi-wavelength chromatographic (HPLC-UV-DAD surfaces) data were additionally examined via parallel factor analysis (PARAFAC) as data reduction method and the obtained loadings were subsequently submitted to PCA and orthogonal partial least squares discriminant analysis (OPLS-DA). Results were thus consistent with those from single-wavelength data. PCA loadings were employed to determine those chemical components responsible for the data variance and OPLS-DA model, constructed from PARAFAC loadings, and indicated differentiation according total anthocyanin contents among samples. The present chemometric analysis therefore demonstrated to be an excellent tool for differentiation of H. sabdariffacalyxes according to their chemical composition.
Resumo:
This work is devoted to the analysis of signal variation of the Cross-Direction and Machine-Direction measurements from paper web. The data that we possess comes from the real paper machine. Goal of the work is to reconstruct the basis weight structure of the paper and to predict its behaviour to the future. The resulting synthetic data is needed for simulation of paper web. The main idea that we used for describing the basis weight variation in the Cross-Direction is Empirical Orthogonal Functions (EOF) algorithm, which is closely related to Principal Component Analysis (PCA) method. Signal forecasting in time is based on Time-Series analysis. Two principal mathematical procedures that we used in the work are Autoregressive-Moving Average (ARMA) modelling and Ornstein–Uhlenbeck (OU) process.