929 resultados para Principal component analysis discriminant analysis
Resumo:
The present paper demonstrates the application of functional GGA hybrids, with long-range corrections, for the calculation of the electronic properties of artemisinin and two of its derivatives - artemether e artesunate. Due to the relatively large amount of data obtained, the statistical method of Principal Component Analysis was employed. The functionals of the WB97 family are observed to be the most appropriate for the determining of reactivity indexes, which are the principal descriptors that, probably, are associated with the antimalarial and anticancer properties of this group of molecules. In addition, it was also observed that all the functionals obtained satisfactorily describe the geometric properties of the studied.
Resumo:
Early identification of beginning readers at risk of developing reading and writing difficulties plays an important role in the prevention and provision of appropriate intervention. In Tanzania, as in other countries, there are children in schools who are at risk of developing reading and writing difficulties. Many of these children complete school without being identified and without proper and relevant support. The main language in Tanzania is Kiswahili, a transparent language. Contextually relevant, reliable and valid instruments of identification are needed in Tanzanian schools. This study aimed at the construction and validation of a group-based screening instrument in the Kiswahili language for identifying beginning readers at risk of reading and writing difficulties. In studying the function of the test there was special interest in analyzing the explanatory power of certain contextual factors related to the home and school. Halfway through grade one, 337 children from four purposively selected primary schools in Morogoro municipality were screened with a group test consisting of 7 subscales measuring phonological awareness, word and letter knowledge and spelling. A questionnaire about background factors and the home and school environments related to literacy was also used. The schools were chosen based on performance status (i.e. high, good, average and low performing schools) in order to include variation. For validation, 64 children were chosen from the original sample to take an individual test measuring nonsense word reading, word reading, actual text reading, one-minute reading and writing. School marks from grade one and a follow-up test half way through grade two were also used for validation. The correlations between the results from the group test and the three measures used for validation were very high (.83-.95). Content validity of the group test was established by using items drawn from authorized text books for reading in grade one. Construct validity was analyzed through item analysis and principal component analysis. The difficulty level of most items in both the group test and the follow-up test was good. The items also discriminated well. Principal component analysis revealed one powerful latent dimension (initial literacy factor), accounting for 93% of the variance. This implies that it could be possible to use any set of the subtests of the group test for screening and prediction. The K-Means cluster analysis revealed four clusters: at-risk children, strugglers, readers and good readers. The main concern in this study was with the groups of at-risk children (24%) and strugglers (22%), who need the most assistance. The predictive validity of the group test was analyzed by correlating the measures from the two school years and by cross tabulating grade one and grade two clusters. All the correlations were positive and very high, and 94% of the at-risk children in grade two were already identified in the group test in grade one. The explanatory power of some of the home and school factors was very strong. The number of books at home accounted for 38% of the variance in reading and writing ability measured by the group test. Parents´ reading ability and the support children received at home for schoolwork were also influential factors. Among the studied school factors school attendance had the strongest explanatory power, accounting for 21% of the variance in reading and writing ability. Having been in nursery school was also of importance. Based on the findings in the study a short version of the group test was created. It is suggested for use in the screening processes in grade one aiming at identifying children at risk of reading and writing difficulties in the Tanzanian context. Suggestions for further research as well as for actions for improving the literacy skills of Tanzanian children are presented.
Resumo:
The characterization of different ecological groups in a forest formation/succession is unclear. To better define the different successional classes, we have to consider ecophysiological aspects, such as the capacity to use or dissipate the light energy available. The main objective of this work was to assess the chlorophyll fluorescence emission of tropical tree species growing in a gap of a semi-deciduous forest. Three species of different ecological groups were selected: Croton floribundus Spreng. (pioneer, P), Astronium graveolens Jacq. (early secondary, Si), and Esenbeckia febrifuga A. Juss. (late secondary, St). The potential (Fv/Fm) and effective (deltaF/Fm') quantum efficiency of photosystem II, apparent electron transport rate (ETR), non-photochemical (qN) and photochemical (qP) quenching of fluorescence were evaluated, using a modulated fluorometer, between 7:30 and 11:00 h. Values of Fv/Fm remained constant in St, decreasing in P and Si after 9:30 h, indicating the occurrence of photoinhibition. Concerning the measurements taken under light conditions (deltaF/Fm', ETR, qP and qN), P and Si showed better photochemical performance, i.e., values of deltaF/Fm', ETR and qP were higher than St when light intensity was increased. Values of qN indicated that P and Si had an increasing tendency of dissipating the excess of energy absorbed by the leaf, whereas the opposite was found for St. The principal component analysis (PCA), considering all evaluated parameters, showed a clear distinction between St, P and Si, with P and Si being closer. The PCA results suggest that chlorophyll fluorescence may be a potential tool to differentiate tree species from distinct successional groups.
Resumo:
This work is devoted to the analysis of signal variation of the Cross-Direction and Machine-Direction measurements from paper web. The data that we possess comes from the real paper machine. Goal of the work is to reconstruct the basis weight structure of the paper and to predict its behaviour to the future. The resulting synthetic data is needed for simulation of paper web. The main idea that we used for describing the basis weight variation in the Cross-Direction is Empirical Orthogonal Functions (EOF) algorithm, which is closely related to Principal Component Analysis (PCA) method. Signal forecasting in time is based on Time-Series analysis. Two principal mathematical procedures that we used in the work are Autoregressive-Moving Average (ARMA) modelling and Ornstein–Uhlenbeck (OU) process.
Resumo:
The study of spatial variability of soil and plants attributes, or precision agriculture, a technique that aims the rational use of natural resources, is expanding commercially in Brazil. Nevertheless, there is a lack of mathematical analysis that supports the correlation of these independent variables and their interactions with the productivity, identifying scientific standards technologically applicable. The aim of this study was to identify patterns of soil variability according to the eleven physical and seven chemical indicators in an agricultural area. It was used two multivariate techniques: the hierarchical cluster analysis (HCA) and the principal component analysis (PCA). According to the HCA, the area was divided into five management zones: zone 1 with 2.87ha, zone 2 with 0.8ha, zone 3 with 1.84ha, zone 4 with 1.33ha and zone 5 with 2.76ha. By the PCA, it was identified the most important variables within each zone: V% for the zone 1, CTC in the zone 2, levels of H+Al in the zone 4 and sand content and altitude in the zone 5. The zone 3 was classified as an intermediate zone with characteristics of all others. According to the results it is concluded that it is possible to separate into groups (management zones) samples with the same patterns of variability by the multivariate statistical techniques.
Resumo:
In this thesis, a classi cation problem in predicting credit worthiness of a customer is tackled. This is done by proposing a reliable classi cation procedure on a given data set. The aim of this thesis is to design a model that gives the best classi cation accuracy to e ectively predict bankruptcy. FRPCA techniques proposed by Yang and Wang have been preferred since they are tolerant to certain type of noise in the data. These include FRPCA1, FRPCA2 and FRPCA3 from which the best method is chosen. Two di erent approaches are used at the classi cation stage: Similarity classi er and FKNN classi er. Algorithms are tested with Australian credit card screening data set. Results obtained indicate a mean classi cation accuracy of 83.22% using FRPCA1 with similarity classi- er. The FKNN approach yields a mean classi cation accuracy of 85.93% when used with FRPCA2, making it a better method for the suitable choices of the number of nearest neighbors and fuzziness parameters. Details on the calibration of the fuzziness parameter and other parameters associated with the similarity classi er are discussed.
Resumo:
Singular Value Decomposition (SVD), Principal Component Analysis (PCA) and Multiple Linear Regression (MLR) are some of the mathematical pre- liminaries that are discussed prior to explaining PLS and PCR models. Both PLS and PCR are applied to real spectral data and their di erences and similarities are discussed in this thesis. The challenge lies in establishing the optimum number of components to be included in either of the models but this has been overcome by using various diagnostic tools suggested in this thesis. Correspondence analysis (CA) and PLS were applied to ecological data. The idea of CA was to correlate the macrophytes species and lakes. The di erences between PLS model for ecological data and PLS for spectral data are noted and explained in this thesis. i
Resumo:
In this study, cantilever-enhanced photoacoustic spectroscopy (CEPAS) was applied in different drug detection schemes. The study was divided into two different applications: trace detection of vaporized drugs and drug precursors in the gas-phase, and detection of cocaine abuse in hair. The main focus, however, was the study of hair samples. In the gas-phase, methyl benzoate, a hydrolysis product of cocaine hydrochloride, and benzyl methyl ketone (BMK), a precursor of amphetamine and methamphetamine were investigated. In the solid-phase, hair samples from cocaine overdose patients were measured and compared to a drug-free reference group. As hair consists mostly of long fibrous proteins generally called keratin, proteins from fingernails and saliva were also studied for comparison. Different measurement setups were applied in this study. Gas measurements were carried out using quantum cascade lasers (QLC) as a source in the photoacoustic detection. Also, an external cavity (EC) design was used for a broader tuning range. Detection limits of 3.4 particles per billion (ppb) for methyl benzoate and 26 ppb for BMK in 0.9 s were achieved with the EC-QCL PAS setup. The achieved detection limits are sufficient for realistic drug detection applications. The measurements from drug overdose patients were carried out using Fourier transform infrared (FTIR) PAS. The drug-containing hair samples and drug-free samples were both measured with the FTIR-PAS setup, and the measured spectra were analyzed statistically with principal component analysis (PCA). The two groups were separated by their spectra with PCA and proper spectral pre-processing. To improve the method, ECQCL measurements of the hair samples, and studies using photoacoustic microsampling techniques, were performed. High quality, high-resolution spectra with a broad tuning range were recorded from a single hair fiber. This broad tuning range of an EC-QCL has not previously been used in the photoacoustic spectroscopy of solids. However, no drug detection studies were performed with the EC-QCL solid-phase setup.
Resumo:
This study evaluated the photosynthetic responses of seven tropical trees of different successional groups under contrasting irradiance conditions, taking into account changes in gas exchange and chlorophyll a fluorescence. Although early successional species have shown higher values of CO2 assimilation (A) and transpiration (E), there was not a defined pattern of the daily gas exchange responses to high irradiance (FSL) among evaluated species. Cariniana legalis (Mart.) Kuntze (late secondary) and Astronium graveolens Jacq. (early secondary) exhibited larger reductions in daily-integrated CO2 assimilation (DIA) when transferred from medium light (ML) to FSL. On the other hand, the pioneer species Guazuma ulmifolia Lam. had significant DIA increase when exposed to FSL. The pioneers Croton spp. trended to show a DIA decrease around 19%, while Cytharexyllum myrianthum Cham. (pioneer) and Rhamnidium elaeocarpum Reiss. (early secondary) trended to increase DIA when transferred to FSL. Under this condition, all species showed dynamic photoinhibition, except for C. legalis that presented chronic photoinhibition of photosynthesis. Considering daily photosynthetic processes, our results supported the hypothesis of more flexible responses of early successional species (pioneer and early secondary species). The principal component analysis indicated that the photochemical parameters effective quantum efficiency of photosystem II and apparent electron transport rate were more suitable to separate the successional groups under ML condition, whereas A and E play a major role to this task under FSL condition.
Resumo:
Tutkimus käsittelee Yrittäjyyskasvatuksen Mittariston -projektia, jossa tutkimuskohteena on peruskoulun ensimmäisen asteen luokan- ja aineenopettajien näkemys ja kokemus yrittäjyyskasvatuksen verkostoyhteistyöstä. Tutkimuksen tarkoituksena oli selvittää miten hyvin opettajat tuntevat verkostoyhteistyötä, mikä on heidän tietämyksensä yrittäjyyskasvatuksesta ja kuinka tämä näkyy heidän työssään ja opetuksessaan. Tutkimuksen otos on 450 opettajaa. Tulokset analysoitiin SPSS-tilastomenetelmäohjelmalla. Tilastollisina tutkimusmenetelminä käytettiin jakaumien frekvenssianalyysiä, Faktorianalyysin Pääkomponenttianalyysiä ja Kaksisuuntaista varianssianalyysia (Anova). Tutkimuksen johtopäätöksenä voidaan todeta, että opettajien tiedot yhteistyö-verkostojen tarjoamista palveluista ovat hyvin hajanaiset. Ongelma jatkuu helposti niin kauan kunnes opettajien koulutusohjelmaan tuodaan lisää yrittäjyyskasvatus- ja yrittäjyysopintoja. Tämä pitäisi huomioida myös tulevissa opetussuunnitelmissa. Tämän tutkimuksen tavoitteena oli tuoda esille Yrittäjyyskasvatuksen mittariston tulosten kautta yrittäjyyskasvatuksen nykytila, tuoda ratkaisuja ehdotusten kautta opetukseen ja herättää keskustelua yrittäjyyskasvatuksen parantamiseksi.
Resumo:
Ferruginous "campos rupestres" are a particular type of vegetation growing on iron-rich primary soils. We investigated the influence of soil properties on plant species abundance at two sites of ferruginous "campos rupestres" and one site of quartzitic "campo rupestre", all of them in "Quadrilátero Ferrífero", in Minas Gerais State, southeastern Brazil. In each site, 30 quadrats were sampled to assess plant species composition and abundance, and soil samples were taken to perform chemical and physical analyses. The analyzed soils are strongly acidic and presented low fertility and high levels of metallic cations; a principal component analysis of soil data showed a clear segregation among sites due mainly to fertility and heavy metals content, especially Cu, Zn, and Pb. The canonical correspondence analysis indicated a strong correlation between plant species abundance and soil properties, also segregating the sites.
Resumo:
This thesis describes the occurrence and sources of selected persistent organic pollutants (POPs) such as polychlorinated dibenzo-p-dioxins (PCDDs), polychlorinated dibenzofurans (PCDFs), polychlorinated biphenyls (PCBs), polybrominated diphenyl ethers (PBDEs) and hexachlorocyclohexanes (HCHs) in the northern watershed of Lake Victoria. Sediments and fish were collected from three highly polluted embayments (i.e. Murchison Bay, Napoleon Gulf and Thurston Bay) of the lake. The analysis for PCDD/Fs, PCBs and PBDEs was done using a high resolution mass spectrometer coupled to a gas chromatograph (GC), and a GC equipped with an electron capture detector was used for HCHs. Total (Σ) PCDD/Fs, PCBs and PBDEs in sediments ranged from 3.19 to 478, 313 to 4325 and 60.8 to 179 pg g-1 dry weight (dw), respectively. The highest concentrations of pollutants were found at sites close to industrial areas and wastewater discharge points. The maximum concentrations of PCDD/Fs, PCBs, PBDEs and HCHs in fish muscle homogenates were 49, 779, 495 and 45,900 pg g-1 wet weight (ww), respectively. The concentrations of the pollutants in Nile perch (Lates niloticus) were significantly greater than those in Nile tilapia (Oreochromis niloticus), possibly due to differences in trophic level and dietary feeding habits among fish species. World Health Organization-toxic equivalency quotient (WHO2005-TEQ) values in the sediments were up to 4.24 pg g-1 dw for PCDD/Fs and 0.55 pg TEQ g-1 dw for the 12 dioxin-like PCBs (dl-PCBs). 23.1% of the samples from the Napoleon Gulf were above the interim sediment quality guideline value of 0.85 pg WHO-TEQ g-1 dw set by the Canadian Council for Ministers of the Environment. The WHO2005-TEQs in fish were 0.001-0.16 pg g-1 for PCDD/Fs and 0.001-0.31 pg g-1 ww for dl- PCBs. The TEQ values were within a permissible level of 3.5 pg g−1 ww recommended by the European Commission. Based on the Commission set TEQs and minimum risk level criteria formulated by the Agency for Toxic Substances and Disease Registry, the consumption of fish from Lake Victoria gives no indication of health risks associated to PCDD/Fs and PCBs. Principal component analysis (PCA) indicated that anthropogenic activities such as agricultural straw open burning, medical waste incinerators and municipal solid waste combustors were the major sources of PCDD/Fs in the watershed of Lake Victoria. The ratios of α-/γ-HCH varied from 0.89 to 1.68 suggesting that the highest HCH residues mainly came from earlier usage and fresh γ-HCH (lindane). In the present study, the concentration of POPs in fish were not significantly related to those in sediments, and the biota sediment accumulation factor (BSAF) concept was found to be a poor predictor of the bioavailability and bioaccumulation of environmental pollutants.
Resumo:
This work evaluated the physicochemical composition of 171 red Brazilian wines from the 2006 vintage, which were represented by 21 varietals. These wines were produced by 58 Brazilian wineries in different regions of the country, with latitudes varying from 9º to 31º South. Physicochemical wine analysis was performed in the same year and discrimination in the viticultural regions, varietal wines, and wineries was performed by means of the principal component analysis (PCA). The main results show that wines from São Joaquim had higher values of A420, A520, A620, color intensity, total phenolic compounds, anthocyanins, and dry extracts, while those from Toledo had lower values of these variables; those from Vale do São Francisco had higher values of potassium, pH, density, and volatile acidity; from Serra do Nordeste A, they had higher titratable acidity; and from Planalto Superior B, higher hue. Regarding the varietal wines, PCA mainly discriminated the wines produced from the varieties Ancellotta, Teroldego, Egiodola, Refosco, Marselan, Cabernet Sauvignon, Pinotage, Pinot Noir, Malbec, Arinarnoa, Barbera, and Alfrocheiro. In relation to wineries, twenty two of them were discriminated by their higher values of some variables, i.e., three were characterized by color intensity; three by hue; eight by alcohol content; six by potassium, dry extract, density, and pH; and two by titratablel acidity.
Resumo:
The aim of this study was to determine the influence of process parameters and Passion Fruit Fiber (PFF) addition on the Glycemic Index (GI) of an extruded breakfast cereal. A 2³ Central Composite Rotational Design (CCRD) was used, with the following independent variables: raw material moisture content (18-28%), 2nd and 3rd barrel zone temperatures (120-160 ºC), and PFF (0-30%). Raw materials (organic corn flour and organic PFF) were characterized as to their proximate composition, particle size, and in vitro GI. The extrudates were characterized as to their in vitro GI. The Response Surface Methodology (RSM) and Principal Component Analysis (PCA) were used to analyze the results. Corn flour and PFF presented 8.55 and 7.63% protein, 2.61 and 0.60% fat, 0.52 and 6.17% ash, 78.77 and 78.86% carbohydrates (3 and 64% total dietary fiber), respectively. The corn flour particle size distribution was homogeneous, while PFF presented a heterogeneous particle size distribution. Corn flour and PFF presented values of GI of 48 and 45, respectively. When using RSM, no effect of the variables was observed in the GI of the extrudates (average value of 48.41), but PCA showed that the GI tended to be lower when processing at lower temperatures (<128 ºC) and at higher temperatures (>158 ºC). When compared to white bread, the extrudates showed a reduction of the GI of up to 50%, and could be considered an interesting alternative in weight and glycemia control diets.
Resumo:
The descriptive terminology and sensory prolife of four samples of Italian salami were determined using a methodology based on the Quantitative Descriptive Analysis (QDA). A sensory panel consensually defined sensory descriptors, their respective reference materials, and the descriptive evaluation ballot. Twelve individuals were selected as judges and properly trained. They used the following criteria: discriminating power, reproducibility, and individual consensus. Twelve descriptors were determined showing similarities and differences among the Italian salami samples. Each descriptor was evaluated using a 10 cm non-structured scale. The data were analyzed by ANOVA, Tukey test, and the Principal Component Analysis (PCA). The salami with coriander essential oil (T3) had lower rancid taste and rancid odor, whereas the control (T1) showed high values of these sensory attributes. Regarding brightness, T4 showed the best result. For the other attributes, T1, T2, T3, and T4 were similar.