937 resultados para principal components analysis (PCA) algorithm
Resumo:
Tässä diplomityössä tutkitaan tekniikoita, joillavesileima lisätään spektrikuvaan, ja menetelmiä, joilla vesileimat tunnistetaanja havaitaan spektrikuvista. PCA (Principal Component Analysis) -algoritmia käyttäen alkuperäisten kuvien spektriulottuvuutta vähennettiin. Vesileiman lisääminen spektrikuvaan suoritettiin muunnosavaruudessa. Ehdotetun mallin mukaisesti muunnosavaruuden komponentti korvattiin vesileiman ja toisen muunnosavaruuden komponentin lineaarikombinaatiolla. Lisäyksessä käytettävää parametrijoukkoa tutkittiin. Vesileimattujen kuvien laatu mitattiin ja analysoitiin. Suositukset vesileiman lisäykseen esitettiin. Useita menetelmiä käytettiin vesileimojen tunnistamiseen ja tunnistamisen tulokset analysoitiin. Vesileimojen kyky sietää erilaisia hyökkäyksiä tarkistettiin. Diplomityössä suoritettiin joukko havaitsemis-kokeita ottamalla huomioon vesileiman lisäyksessä käytetyt parametrit. ICA (Independent Component Analysis) -menetelmää pidetään yhtenä mahdollisena vaihtoehtona vesileiman havaitsemisessa.
Resumo:
Near-infrared spectroscopy (NIRS) was used to analyse the crude protein content of dried and milled samples of wheat and to discriminate samples according to their stage of growth. A calibration set of 72 samples from three growth stages of wheat (tillering, heading and harvest) and a validation set of 28 samples was collected for this purpose. Principal components analysis (PCA) of the calibration set discriminated groups of samples according to the growth stage of the wheat. Based on these differences, a classification procedure (SIMCA) showed a very accurate classification of the validation set samples : all of them were successfully classified in each group using this procedure when both the residual and the leverage were used in the classification criteria. Looking only at the residuals all the samples were also correctly classified except one of tillering stage that was assigned to both tillering and heading stages. Finally, the determination of the crude protein content of these samples was considered in two ways: building up a global model for all the growth stages, and building up local models for each stage, separately. The best prediction results for crude protein were obtained using a global model for samples in the two first growth stages (tillering and heading), and using a local model for the harvest stage samples.
Resumo:
BACKGROUND: quality of life (QoL) is a subjective perception whose components may vary in importance between individuals. Little is known about which domains of QoL older people deem most important. OBJECTIVE: this study investigated in community-dwelling older people the relationships between the importance given to domains defining their QoL and socioeconomic, demographic and health status. METHODS: data were compiled from older people enrolled in the Lc65+ cohort study and two additional, population-based, stratified random samples (n = 5,300). Principal components analysis (PCA) was used to determine the underlying domains among 28 items that participants defined as important to their QoL. The components extracted were used as dependent variables in multiple linear regression models to explore their associations with socioeconomic, demographic and health status. RESULTS: PCA identified seven domains that older persons considered important to their QoL. In order of importance (highest to lowest): feeling of safety, health and mobility, autonomy, close entourage, material resources, esteem and recognition, and social and cultural life. A total of six and five domains of importance were significantly associated with education and depressive symptoms, respectively. The importance of material resources was significantly associated with a good financial situation (β = 0.16, P = 0.011), as was close entourage with living with others (β = 0.20, P = 0.007) and as was health and mobility with age (β = -0.16, P = 0.014). CONCLUSION: the importance older people give to domains of their QoL appears strongly related to their actual resources and experienced losses. These findings may help clinicians, researchers and policy makers better adapt strategies to individuals' needs.
Resumo:
The combination of high performance exclusion chromatography (HPEC) and gas chromatography (GC) was applied to the analysis of six coffee samples that were previously characterized by sensory tests as of good or poor quality. The data obtained by the two techniques were statistically evaluated by "Principal Components Analysis" (PCA) using selected peak areas. The results showed the potential of the described techniques for coffee analysis. The HPEC technique monitored with the U.V. detector at 272 nm and followed by PCA may be correlated with sensorial data, particularly if a wider group of samples is used.
Resumo:
A dataset of chemical properties of the elements is used herein to introduce principal components analysis (PCA). The focus in this article is to verify the classification of the elements within the periodic table. The reclassification of the semimetals as metals or nonmetals emerges naturally from PCA and agrees with the current SBQ/IUPAC periodic table. Dataset construction, basic preprocessing, loading and score plots, and interpretation have been emphasized. This activity can be carried out even when students with distinct levels of formation are together in the same learning environment.
Resumo:
The concentration of 14 organic acids of 50 sugarcane spirits samples was determined by gas chromatography using flame ionization detection. The organic acids analytical quantitative profile in stills and column distilled spirits from wines obtained from the same must were compared. The comparison was also carried in "head", "heart" and "tail fractions of stills distilled spirits. The experimental data were analyzed by Principal Components Analysis (PCA) and pointed out that the distillation process (stills and column) strongly influences the lead spirits' organic acid composition and that producers' operational "cuts off" to produce "tail", "heart" and "head", fractions should be optimized.
Resumo:
Total spectrofluorimetry associated to Principal Components Analysis (PCA) were used to classify into different groups the samples of diesel oil, biodiesel, vegetal oil and residual oil, as well as, to identify addition of non-transesterified residual vegetable oil, instead of biodiesel, to the diesel oil. Using this method, the samples of diesel oil, mixtures of biodiesel in diesel and mixtures of residual oil in diesel were separated into well-defined groups.
Resumo:
We propose an analytical method based on fourier transform infrared-attenuated total reflectance (FTIR-ATR) spectroscopy to detect the adulteration of petrodiesel and petrodiesel/palm biodiesel blends with African crude palm oil. The infrared spectral fingerprints from the sample analysis were used to perform principal components analysis (PCA) and to construct a prediction model using partial least squares (PLS) regression. The PCA results separated the samples into three groups, allowing identification of those subjected to adulteration with palm oil. The obtained model shows a good predictive capacity for determining the concentration of palm oil in petrodiesel/biodiesel blends. Advantages of the proposed method include cost-effectiveness and speed; it is also environmentally friendly.
Resumo:
Visual data mining (VDM) tools employ information visualization techniques in order to represent large amounts of high-dimensional data graphically and to involve the user in exploring data at different levels of detail. The users are looking for outliers, patterns and models – in the form of clusters, classes, trends, and relationships – in different categories of data, i.e., financial, business information, etc. The focus of this thesis is the evaluation of multidimensional visualization techniques, especially from the business user’s perspective. We address three research problems. The first problem is the evaluation of projection-based visualizations with respect to their effectiveness in preserving the original distances between data points and the clustering structure of the data. In this respect, we propose the use of existing clustering validity measures. We illustrate their usefulness in evaluating five visualization techniques: Principal Components Analysis (PCA), Sammon’s Mapping, Self-Organizing Map (SOM), Radial Coordinate Visualization and Star Coordinates. The second problem is concerned with evaluating different visualization techniques as to their effectiveness in visual data mining of business data. For this purpose, we propose an inquiry evaluation technique and conduct the evaluation of nine visualization techniques. The visualizations under evaluation are Multiple Line Graphs, Permutation Matrix, Survey Plot, Scatter Plot Matrix, Parallel Coordinates, Treemap, PCA, Sammon’s Mapping and the SOM. The third problem is the evaluation of quality of use of VDM tools. We provide a conceptual framework for evaluating the quality of use of VDM tools and apply it to the evaluation of the SOM. In the evaluation, we use an inquiry technique for which we developed a questionnaire based on the proposed framework. The contributions of the thesis consist of three new evaluation techniques and the results obtained by applying these evaluation techniques. The thesis provides a systematic approach to evaluation of various visualization techniques. In this respect, first, we performed and described the evaluations in a systematic way, highlighting the evaluation activities, and their inputs and outputs. Secondly, we integrated the evaluation studies in the broad framework of usability evaluation. The results of the evaluations are intended to help developers and researchers of visualization systems to select appropriate visualization techniques in specific situations. The results of the evaluations also contribute to the understanding of the strengths and limitations of the visualization techniques evaluated and further to the improvement of these techniques.
Resumo:
Abstract In search for renewable energy sources, the Brazilian residual biomasses stand out due to their favorable physical and chemical properties, low cost, and their being less pollutant. Therefore, they are likely to be used in biorefineries in the production of chemical inputs to substitute fossil fuels. This substitution is possible due to the high contents of carbohydrates (>40%), low contents of extractives (<20%), ashes (<8%) and moisture (<8%) found in these residual biomasses. High calorific values of all residues also offer them the chance to be used in combustion. A principal components analysis (PCA) was performed for better understanding of the samples and their hysic-chemical properties. Thus, this study aimed to characterize biomasses from the north (babassu residues, such as mesocarp and endocarp; pequi and Brazil nut) and northeast (agave and coconut) regions of Brazil, in order to contribute to the preservation of the environment and strengthen the economy of the region.
Resumo:
Larval habitat for three highland Anopheles species: Anopheles albimanus Wiedemann, Anopheles pseudopunctipennis Theobald, and Anopheles punctimacula Dyar and Knab was related to human land uses, rivers, roads, and remotely sensed land cover classifications in the western Ecuadorian Andes. Of the five commonly observed human land uses, cattle pasture (n = 30) provided potentially suitable habitat for A. punctimacula and A. albimanus in less than 14% of sites, and was related in a principal components analysis (PCA) to the presence of macrophyte vegetation, greater surface area, clarity, and algae cover. Empty lots (n = 30) were related in the PCA to incident sunlight and provided potential habitat for A. pseudopunctipennis and A. albimanus in less than 14% of sites. The other land uses surveyed (banana, sugarcane, and mixed tree plantations; n = 28, 21, 25, respectively) provided very little standing water that could potentially be used for larval habitat. River edges and eddies (n = 41) were associated with greater clarity, depth, temperature, and algae cover, which provide potentially suitable habitat for A. albimanus in 58% of sites and A. pseudopunctipennis in 29% of sites. Road-associated water bodies (n = 38) provided potential habitat for A. punctimacula in 44% of sites and A. albimanus in 26% of sites surveyed. Species collection localities were compared to land cover classifications using Geographic Information Systems software. All three mosquito species were associated more often with the category “closed/open broadleaved evergreen and/or semi-deciduous forests” than expected (P ≤ 0.01 in all cases), given such a habitat’s abundance. This study provides evidence that specific human land uses create habitat for potential malaria vectors in highland regions of the Andes.
Resumo:
This thesis entitled “Studies on Nitrifying Microorganisms in Cochin Estuary and Adjacent Coastal Waters” reports for the first time the spatial andtemporal variations in the abundance and activity of nitrifiers (Ammonia oxidizingbacteria-AOB; Nitrite oxidizing bacteria- NOB and Ammonia oxidizing archaea-AOA) from the Cochin Estuary (CE), a monsoon driven, nutrient rich tropicalestuary along the southwest coast of India. To fulfil the above objectives, field observations were carried out for aperiod of one year (2011) in the CE. Surface (1 m below surface) and near-bottomwater samples were collected from four locations (stations 1 to 3 in estuary and 4 in coastal region), covering pre-monsoon, monsoon and post-monsoon seasons. Station 1 is a low saline station (salinity range 0-10) with high freshwater influx While stations 2 and 3 are intermediately saline stations (salinity ranges 10-25). Station 4 is located ~20 km away from station 3 with least influence of fresh water and is considered as high saline (salinity range 25- 35) station. Ambient physicochemical parameters like temperature, pH, salinity, dissolved oxygen (DO), Ammonium, nitrite, nitrate, phosphate and silicate of surface and bottom waters were measured using standard techniques. Abundance of Eubacteria, total Archaea and ammonia and nitrite oxidizing bacteria (AOB and NOB) were quantified using Fluorescent in situ Hybridization (FISH) with oligonucleotide probes labeled withCy3. Community structure of AOB and AOA was studied using PCR Denaturing Gradient Gel Electrophoresis (DGGE) technique. PCR products were cloned and sequenced to determine approximate phylogenetic affiliations. Nitrification rate in the water samples were analyzed using chemical NaClO3 (inhibitor of nitrite oxidation), and ATU (inhibitor of ammonium oxidation). Contribution of AOA and AOB in ammonia oxidation process was measured based on the recovered ammonia oxidation rate. The contribution of AOB and AOA were analyzed after inhibiting the activities of AOB and AOA separately using specific protein inhibitors. To understand the factors influencing or controlling nitrification, various statistical tools were used viz. Karl Pearson’s correlation (to find out the relationship between environmental parameters, bacterial abundance and activity), three-way ANOVA (to find out the significant variation between observations), Canonical Discriminant Analysis (CDA) (for the discrimination of stations based on observations), Multivariate statistics, Principal components analysis (PCA) and Step up multiple regression model (SMRM) (First order interaction effects were applied to determine the significantly contributing biological and environmental parameters to the numerical abundance of nitrifiers). In the CE, nitrification is modulated by the complex interplay between different nitrifiers and environmental variables which in turn is dictated by various hydrodynamic characteristics like fresh water discharge and seawater influx brought in by river water discharge and flushing. AOB in the CE are more adapted to varying environmental conditions compared to AOA though the diversity of AOA is higher than AOB. The abundance and seasonality of AOB and NOB is influenced by the concentration of ammonia in the water column. AOB are the major players in modulating ammonia oxidation process in the water column of CE. The distribution pattern and seasonality of AOB and NOB in the CE suggest that these organisms coexist, and are responsible for modulating the entire nitrification process in the estuary. This process is fuelled by the cross feeding among different nitrifiers, which in turn is dictated by nutrient levels especially ammonia. Though nitrification modulates the increasing anthropogenic ammonia concentration the anthropogenic inputs have to be controlled to prevent eutrophication and associated environmental changes.
Resumo:
Functional Data Analysis (FDA) deals with samples where a whole function is observed for each individual. A particular case of FDA is when the observed functions are density functions, that are also an example of infinite dimensional compositional data. In this work we compare several methods for dimensionality reduction for this particular type of data: functional principal components analysis (PCA) with or without a previous data transformation and multidimensional scaling (MDS) for diferent inter-densities distances, one of them taking into account the compositional nature of density functions. The difeerent methods are applied to both artificial and real data (households income distributions)
Resumo:
Aim Species distribution models (SDMs) based on current species ranges underestimate the potential distribution when projected in time and/or space. A multi-temporal model calibration approach has been suggested as an alternative, and we evaluate this using 13,000 years of data. Location Europe. Methods We used fossil-based records of presence for Picea abies, Abies alba and Fagus sylvatica and six climatic variables for the period 13,000 to 1000 yr bp. To measure the contribution of each 1000-year time step to the total niche of each species (the niche measured by pooling all the data), we employed a principal components analysis (PCA) calibrated with data over the entire range of possible climates. Then we projected both the total niche and the partial niches from single time frames into the PCA space, and tested if the partial niches were more similar to the total niche than random. Using an ensemble forecasting approach, we calibrated SDMs for each time frame and for the pooled database. We projected each model to current climate and evaluated the results against current pollen data. We also projected all models into the future. Results Niche similarity between the partial and the total-SDMs was almost always statistically significant and increased through time. SDMs calibrated from single time frames gave different results when projected to current climate, providing evidence of a change in the species realized niches through time. Moreover, they predicted limited climate suitability when compared with the total-SDMs. The same results were obtained when projected to future climates. Main conclusions The realized climatic niche of species differed for current and future climates when SDMs were calibrated considering different past climates. Building the niche as an ensemble through time represents a way forward to a better understanding of a species' range and its ecology in a changing climate.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)