951 resultados para principal component analysis (PCA)
Resumo:
The objective of this work is to demonstrate the efficient utilization of the Principal Components Analysis (PCA) as a method to pre-process the original multivariate data, that is rewrite in a new matrix with principal components sorted by it's accumulated variance. The Artificial Neural Network (ANN) with backpropagation algorithm is trained, using this pre-processed data set derived from the PCA method, representing 90.02% of accumulated variance of the original data, as input. The training goal is modeling Dissolved Oxygen using information of other physical and chemical parameters. The water samples used in the experiments are gathered from the Paraíba do Sul River in São Paulo State, Brazil. The smallest Mean Square Errors (MSE) is used to compare the results of the different architectures and choose the best. The utilization of this method allowed the reduction of more than 20% of the input data, which contributed directly for the shorting time and computational effort in the ANN training.
Resumo:
The present paper demonstrates the application of functional GGA hybrids, with long-range corrections, for the calculation of the electronic properties of artemisinin and two of its derivatives - artemether e artesunate. Due to the relatively large amount of data obtained, the statistical method of Principal Component Analysis was employed. The functionals of the WB97 family are observed to be the most appropriate for the determining of reactivity indexes, which are the principal descriptors that, probably, are associated with the antimalarial and anticancer properties of this group of molecules. In addition, it was also observed that all the functionals obtained satisfactorily describe the geometric properties of the studied.
Resumo:
Early identification of beginning readers at risk of developing reading and writing difficulties plays an important role in the prevention and provision of appropriate intervention. In Tanzania, as in other countries, there are children in schools who are at risk of developing reading and writing difficulties. Many of these children complete school without being identified and without proper and relevant support. The main language in Tanzania is Kiswahili, a transparent language. Contextually relevant, reliable and valid instruments of identification are needed in Tanzanian schools. This study aimed at the construction and validation of a group-based screening instrument in the Kiswahili language for identifying beginning readers at risk of reading and writing difficulties. In studying the function of the test there was special interest in analyzing the explanatory power of certain contextual factors related to the home and school. Halfway through grade one, 337 children from four purposively selected primary schools in Morogoro municipality were screened with a group test consisting of 7 subscales measuring phonological awareness, word and letter knowledge and spelling. A questionnaire about background factors and the home and school environments related to literacy was also used. The schools were chosen based on performance status (i.e. high, good, average and low performing schools) in order to include variation. For validation, 64 children were chosen from the original sample to take an individual test measuring nonsense word reading, word reading, actual text reading, one-minute reading and writing. School marks from grade one and a follow-up test half way through grade two were also used for validation. The correlations between the results from the group test and the three measures used for validation were very high (.83-.95). Content validity of the group test was established by using items drawn from authorized text books for reading in grade one. Construct validity was analyzed through item analysis and principal component analysis. The difficulty level of most items in both the group test and the follow-up test was good. The items also discriminated well. Principal component analysis revealed one powerful latent dimension (initial literacy factor), accounting for 93% of the variance. This implies that it could be possible to use any set of the subtests of the group test for screening and prediction. The K-Means cluster analysis revealed four clusters: at-risk children, strugglers, readers and good readers. The main concern in this study was with the groups of at-risk children (24%) and strugglers (22%), who need the most assistance. The predictive validity of the group test was analyzed by correlating the measures from the two school years and by cross tabulating grade one and grade two clusters. All the correlations were positive and very high, and 94% of the at-risk children in grade two were already identified in the group test in grade one. The explanatory power of some of the home and school factors was very strong. The number of books at home accounted for 38% of the variance in reading and writing ability measured by the group test. Parents´ reading ability and the support children received at home for schoolwork were also influential factors. Among the studied school factors school attendance had the strongest explanatory power, accounting for 21% of the variance in reading and writing ability. Having been in nursery school was also of importance. Based on the findings in the study a short version of the group test was created. It is suggested for use in the screening processes in grade one aiming at identifying children at risk of reading and writing difficulties in the Tanzanian context. Suggestions for further research as well as for actions for improving the literacy skills of Tanzanian children are presented.
Resumo:
Visual data mining (VDM) tools employ information visualization techniques in order to represent large amounts of high-dimensional data graphically and to involve the user in exploring data at different levels of detail. The users are looking for outliers, patterns and models – in the form of clusters, classes, trends, and relationships – in different categories of data, i.e., financial, business information, etc. The focus of this thesis is the evaluation of multidimensional visualization techniques, especially from the business user’s perspective. We address three research problems. The first problem is the evaluation of projection-based visualizations with respect to their effectiveness in preserving the original distances between data points and the clustering structure of the data. In this respect, we propose the use of existing clustering validity measures. We illustrate their usefulness in evaluating five visualization techniques: Principal Components Analysis (PCA), Sammon’s Mapping, Self-Organizing Map (SOM), Radial Coordinate Visualization and Star Coordinates. The second problem is concerned with evaluating different visualization techniques as to their effectiveness in visual data mining of business data. For this purpose, we propose an inquiry evaluation technique and conduct the evaluation of nine visualization techniques. The visualizations under evaluation are Multiple Line Graphs, Permutation Matrix, Survey Plot, Scatter Plot Matrix, Parallel Coordinates, Treemap, PCA, Sammon’s Mapping and the SOM. The third problem is the evaluation of quality of use of VDM tools. We provide a conceptual framework for evaluating the quality of use of VDM tools and apply it to the evaluation of the SOM. In the evaluation, we use an inquiry technique for which we developed a questionnaire based on the proposed framework. The contributions of the thesis consist of three new evaluation techniques and the results obtained by applying these evaluation techniques. The thesis provides a systematic approach to evaluation of various visualization techniques. In this respect, first, we performed and described the evaluations in a systematic way, highlighting the evaluation activities, and their inputs and outputs. Secondly, we integrated the evaluation studies in the broad framework of usability evaluation. The results of the evaluations are intended to help developers and researchers of visualization systems to select appropriate visualization techniques in specific situations. The results of the evaluations also contribute to the understanding of the strengths and limitations of the visualization techniques evaluated and further to the improvement of these techniques.
Resumo:
In this thesis, a classi cation problem in predicting credit worthiness of a customer is tackled. This is done by proposing a reliable classi cation procedure on a given data set. The aim of this thesis is to design a model that gives the best classi cation accuracy to e ectively predict bankruptcy. FRPCA techniques proposed by Yang and Wang have been preferred since they are tolerant to certain type of noise in the data. These include FRPCA1, FRPCA2 and FRPCA3 from which the best method is chosen. Two di erent approaches are used at the classi cation stage: Similarity classi er and FKNN classi er. Algorithms are tested with Australian credit card screening data set. Results obtained indicate a mean classi cation accuracy of 83.22% using FRPCA1 with similarity classi- er. The FKNN approach yields a mean classi cation accuracy of 85.93% when used with FRPCA2, making it a better method for the suitable choices of the number of nearest neighbors and fuzziness parameters. Details on the calibration of the fuzziness parameter and other parameters associated with the similarity classi er are discussed.
Resumo:
This study evaluated the photosynthetic responses of seven tropical trees of different successional groups under contrasting irradiance conditions, taking into account changes in gas exchange and chlorophyll a fluorescence. Although early successional species have shown higher values of CO2 assimilation (A) and transpiration (E), there was not a defined pattern of the daily gas exchange responses to high irradiance (FSL) among evaluated species. Cariniana legalis (Mart.) Kuntze (late secondary) and Astronium graveolens Jacq. (early secondary) exhibited larger reductions in daily-integrated CO2 assimilation (DIA) when transferred from medium light (ML) to FSL. On the other hand, the pioneer species Guazuma ulmifolia Lam. had significant DIA increase when exposed to FSL. The pioneers Croton spp. trended to show a DIA decrease around 19%, while Cytharexyllum myrianthum Cham. (pioneer) and Rhamnidium elaeocarpum Reiss. (early secondary) trended to increase DIA when transferred to FSL. Under this condition, all species showed dynamic photoinhibition, except for C. legalis that presented chronic photoinhibition of photosynthesis. Considering daily photosynthetic processes, our results supported the hypothesis of more flexible responses of early successional species (pioneer and early secondary species). The principal component analysis indicated that the photochemical parameters effective quantum efficiency of photosystem II and apparent electron transport rate were more suitable to separate the successional groups under ML condition, whereas A and E play a major role to this task under FSL condition.
Resumo:
Tutkimus käsittelee Yrittäjyyskasvatuksen Mittariston -projektia, jossa tutkimuskohteena on peruskoulun ensimmäisen asteen luokan- ja aineenopettajien näkemys ja kokemus yrittäjyyskasvatuksen verkostoyhteistyöstä. Tutkimuksen tarkoituksena oli selvittää miten hyvin opettajat tuntevat verkostoyhteistyötä, mikä on heidän tietämyksensä yrittäjyyskasvatuksesta ja kuinka tämä näkyy heidän työssään ja opetuksessaan. Tutkimuksen otos on 450 opettajaa. Tulokset analysoitiin SPSS-tilastomenetelmäohjelmalla. Tilastollisina tutkimusmenetelminä käytettiin jakaumien frekvenssianalyysiä, Faktorianalyysin Pääkomponenttianalyysiä ja Kaksisuuntaista varianssianalyysia (Anova). Tutkimuksen johtopäätöksenä voidaan todeta, että opettajien tiedot yhteistyö-verkostojen tarjoamista palveluista ovat hyvin hajanaiset. Ongelma jatkuu helposti niin kauan kunnes opettajien koulutusohjelmaan tuodaan lisää yrittäjyyskasvatus- ja yrittäjyysopintoja. Tämä pitäisi huomioida myös tulevissa opetussuunnitelmissa. Tämän tutkimuksen tavoitteena oli tuoda esille Yrittäjyyskasvatuksen mittariston tulosten kautta yrittäjyyskasvatuksen nykytila, tuoda ratkaisuja ehdotusten kautta opetukseen ja herättää keskustelua yrittäjyyskasvatuksen parantamiseksi.
Resumo:
Ferruginous "campos rupestres" are a particular type of vegetation growing on iron-rich primary soils. We investigated the influence of soil properties on plant species abundance at two sites of ferruginous "campos rupestres" and one site of quartzitic "campo rupestre", all of them in "Quadrilátero Ferrífero", in Minas Gerais State, southeastern Brazil. In each site, 30 quadrats were sampled to assess plant species composition and abundance, and soil samples were taken to perform chemical and physical analyses. The analyzed soils are strongly acidic and presented low fertility and high levels of metallic cations; a principal component analysis of soil data showed a clear segregation among sites due mainly to fertility and heavy metals content, especially Cu, Zn, and Pb. The canonical correspondence analysis indicated a strong correlation between plant species abundance and soil properties, also segregating the sites.
Resumo:
The objective of the present study was to investigate the psychometric properties and cross-cultural validity of the Beck Depression Inventory (BDI) among ethnic Chinese living in the city of São Paulo, Brazil. The study was conducted on 208 community individuals. Reliability and discriminant analysis were used to test the psychometric properties and validity of the BDI. Principal component analysis was performed to assess the BDI's factor structure for the total sample and by gender. The mean BDI score was lower (6.74, SD = 5.98) than observed in Western counterparts and showed no gender difference, good internal consistency (Cronbach's alpha 0.82), and high discrimination of depressive symptoms (75-100%). Factor analysis extracted two factors for the total sample and each gender: cognitive-affective dimension and somatic dimension. We conclude that depressive symptoms can be reliably assessed by the BDI in the Brazilian Chinese population, with a validity comparable to that for international studies. Indeed, cultural and measurement biases might have influenced the response of Chinese subjects.
Resumo:
Functional and technological properties of wheat depend on its chemical composition, which together with structural and microscopic characteristics, define flour quality. The aim of the present study was to characterize four Brazilian wheat cultivars (BRS Louro, BRS Timbauva, BRS Guamirim and BRS Pardela) and their respective flours in order to indicate specific technological applications. Kernels were analyzed for test weight, thousand kernel weight, hardness, moisture, and water activity. Flours were analyzed for water activity, color, centesimal composition, total dietary fiber, amylose content and identification of high molecular weight glutenins. The rheological properties of the flours were estimated by farinography, extensography, falling number, rapid visco amylography, and glutomatic and glutork equipment. Baking tests and scanning electron microscopy were also performed. The data were subjected to analysis of variance and principal component analysis. BRS Timbauva and BRS Guamirim presented results that did not allow for specific technological application. On the other hand, BRS Louro presented suitable characteristics for the elaboration of products with low dough strength such as cakes, pies and biscuits, while BRS Pardela seemed suitable for bread and pasta products.
Resumo:
The objective of this study was to determine the best lettuce cultivar (American 'Graciosa', 'Vanda', 'Marcela' and 'Lavínia') harvesting method. Therefore, quality and shelf-life were evaluated using sensory analyses. Lettuce heads was harvested with the root on by the producer, so that they were cut in different ways in the laboratory. The samples were stored in a cold chamber at 10 °C and 80% ± 2% of relative humidity for nine days, and the sensorial analyses were performed according to Qualitative Descriptive Analysis method on days 1, 3, 6, and 9 of storage by twelve trained testers. The results were evaluated by variance analysis, principal component analysis, and the Tukey test with a reliability of 95%. The results indicate that there was a reduction in the quality of lettuce between the 1st and the 5th day of storage and that after the sixth day of storage the lettuce samples were considered unfit for consumption, except for the 'Lavínia' lettuce cultivar with root and cut treatment 2. On the ninth day of storage all samples were considered inappropriate for consumption.
Resumo:
Identification of functional properties of wheat flour by specific tests allows genotypes with appropriate characteristics to be selected for specific industrial uses. The objective of wheat breeding programs is to improve the quality of germplasm bank in order to be able to develop wheat with suitable gluten strength and extensibility for bread making. The aim of this study was to evaluate 16 wheat genotypes by correlating both glutenin subunits of high and low molecular weight and gliadin subunits with the physicochemical characteristics of the grain. Protein content, sedimentation volume, sedimentation index, and falling number values were analyzed after the grains were milled. Hectoliter weight and mass of 1000 seeds were also determined. The glutenin and gliadin subunits were separated using polyacrylamide gel in the presence of sodium dodecyl sulfate. The data were evaluated using variance analysis, Pearson's correlation, principal component analysis, and cluster analysis. The IPR 85, IPR Catuara TM, T 091015, and T 091069 genotypes stood out from the others, which indicate their possibly superior grain quality with higher sedimentation volume, higher sedimentation index, and higher mass of 1000 seeds; these genotypes possessed the subunits 1 (Glu-A1), 5 + 10 (Glu-D1), c (Glu-A3), and b (Glu-B3), with exception of T 091069 genotype that possessed the g allele instead of b in the Glu-B3.
Resumo:
This study evaluated the effect of adding flaxseed flour to the diet of Nile tilapia on the fatty acid composition of fillets using chemometrics. A traditional and an experimental diet containing flaxseed flour were used to feed the fish for 60 days. An increase of 18:3 n-3 and 22:6 n-3 and a decrease of 18:2 n-6 were observed in the tilapia fillets fed the experimental diet. There was a reduction in the n-6:n-3 ratio. A period of 45 days of incorporation caused a significant change in tilapia chemical composition. Principal Component Analysis showed that the time periods of 45 and 60 days positively contributed to the total content of n-3, LNA, and DHA, highlighting the effect of omega-3 incorporation in the treatment containing flaxseed flour.
Resumo:
Abstract In search for renewable energy sources, the Brazilian residual biomasses stand out due to their favorable physical and chemical properties, low cost, and their being less pollutant. Therefore, they are likely to be used in biorefineries in the production of chemical inputs to substitute fossil fuels. This substitution is possible due to the high contents of carbohydrates (>40%), low contents of extractives (<20%), ashes (<8%) and moisture (<8%) found in these residual biomasses. High calorific values of all residues also offer them the chance to be used in combustion. A principal components analysis (PCA) was performed for better understanding of the samples and their hysic-chemical properties. Thus, this study aimed to characterize biomasses from the north (babassu residues, such as mesocarp and endocarp; pequi and Brazil nut) and northeast (agave and coconut) regions of Brazil, in order to contribute to the preservation of the environment and strengthen the economy of the region.
Resumo:
The study approaches student travel from the perspective of postmodern consumption. The background is in the observation that the student travel market has a vast potential, but it is not necessarily capitalized upon to the extent it could. This might partly have to do with the peculiarities of postmodernity: consumption is characterized by unpredictability and abstract motives. The research questions are built around what constitutes student travel consumption and how can students be categorized according to motivation, behaviour and values. Also identity and expressiveness are present and it is evaluated, if travel services facilitate these background is the observation that the student travel market has a vast potential, but it is not necessarily capitalized upon to the extent it could be. This might partly have to do with the peculiarities of postmodernity: consumption is characterized by unpredictability and abstract motives. The research questions are built around what constitutes student travel consumption and how can students be categorized according to motivation, behaviour and values. Also identity and expressiveness are present and it is evaluated whether travel services facilitate these constructs. The topic is approached by discovering the key concepts such as self-identity. This was done in order to create survey questions that reflect the underlying theories. The survey was sent to chosen student groups of Turku School of Economics. The data was analyzed using statistical methods, mainly principal component analysis, in order to categorize students’ motives and behaviour into distinct profiles. The findings indicate that students have a high level of awareness in their travel consumption choices. Travel services seem to facilitate identity and lifestyle expressiveness, one central dimension of postmodernity. Psychographics such as motivation seem to work well as a segmentation criteria when it comes to the student traveler market. Travel offers students an opportunity for relaxation, escape, enjoyment and gaining new experiences and social contacts. Furthermore, the enjoyment of the travel experience extends to the pre- and post-trip time.