53 resultados para Principal component analysis (PCA)
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)
Resumo:
This paper describes a chemotaxonomic analysis of a database of triterpenoid compounds from the Celastraceae family using principal component analysis (PCA). The numbers of occurrences of thirty types of triterpene skeleton in different tribes of the family were used as variables. The study shows that PCA applied to chemical data can contribute to an intrafamilial classification of Celastraceae, once some questionable taxa affinity was observed, from chemotaxonomic inferences about genera and they are in agreement with the phylogeny previously proposed. The inclusion of Hippocrateaceae within Celastraceae is supported by the triterpene chemistry.
Resumo:
Three-dimensional spectroscopy techniques are becoming more and more popular, producing an increasing number of large data cubes. The challenge of extracting information from these cubes requires the development of new techniques for data processing and analysis. We apply the recently developed technique of principal component analysis (PCA) tomography to a data cube from the center of the elliptical galaxy NGC 7097 and show that this technique is effective in decomposing the data into physically interpretable information. We find that the first five principal components of our data are associated with distinct physical characteristics. In particular, we detect a low-ionization nuclear-emitting region (LINER) with a weak broad component in the Balmer lines. Two images of the LINER are present in our data, one seen through a disk of gas and dust, and the other after scattering by free electrons and/or dust particles in the ionization cone. Furthermore, we extract the spectrum of the LINER, decontaminated from stellar and extended nebular emission, using only the technique of PCA tomography. We anticipate that the scattered image has polarized light due to its scattered nature.
Resumo:
Krameria plants are found in arid regions of the Americas and present a floral system that attracts oil-collecting bees. Niche modeling and multivariate tools were applied to examine ecological and geographical aspects of the 18 species of this genus, using occurrence data obtained from herbaria and literature. Niche modeling showed the potential areas of occurrence for each species and the analysis of climatic variables suggested that North American species occur mostly in deserted or xeric ecoregions with monthly precipitation below 140 mm and large temperature ranges. South American species are mainly found in deserted ecoregions and subtropical savannas where monthly precipitation often exceeds 150 mm and temperature ranges are smaller. Principal Component Analysis (PCA) performed with values of temperature and precipitation showed that the distribution limits of Krameria species are primarily associated with maximum and minimum temperatures. Modeling of Krameria species proved to be a useful tool for analyzing the influence of the ecological niche variables in the geographical distribution of species, providing new information to guide future investigations. (C) 2011 Elsevier Ltd. All rights reserved.
Resumo:
Aims. A model-independent reconstruction of the cosmic expansion rate is essential to a robust analysis of cosmological observations. Our goal is to demonstrate that current data are able to provide reasonable constraints on the behavior of the Hubble parameter with redshift, independently of any cosmological model or underlying gravity theory. Methods. Using type Ia supernova data, we show that it is possible to analytically calculate the Fisher matrix components in a Hubble parameter analysis without assumptions about the energy content of the Universe. We used a principal component analysis to reconstruct the Hubble parameter as a linear combination of the Fisher matrix eigenvectors (principal components). To suppress the bias introduced by the high redshift behavior of the components, we considered the value of the Hubble parameter at high redshift as a free parameter. We first tested our procedure using a mock sample of type Ia supernova observations, we then applied it to the real data compiled by the Sloan Digital Sky Survey (SDSS) group. Results. In the mock sample analysis, we demonstrate that it is possible to drastically suppress the bias introduced by the high redshift behavior of the principal components. Applying our procedure to the real data, we show that it allows us to determine the behavior of the Hubble parameter with reasonable uncertainty, without introducing any ad-hoc parameterizations. Beyond that, our reconstruction agrees with completely independent measurements of the Hubble parameter obtained from red-envelope galaxies.
Resumo:
Fatty acid synthase (FASN) is the metabolic enzyme responsible for the endogenous synthesis of the saturated long-chain fatty acid palmitate. In contrast to most normal cells, FASN is overexpressed in a variety of human cancers including cutaneous melanoma, in which its levels of expression are associated with a poor prognosis and depth of invasion. Recently, we have demonstrated the mitochondrial involvement in FASN inhibition-induced apoptosis in melanoma cells. Herein we compare, via electrospray ionization mass spectrometry (ESI-MS), free fatty acids (FFA) composition of mitochondria isolated from control (EtOH-treated cells) and Orlistat-treated B16-F10 mouse melanoma cells. Principal component analysis (PCA) was applied to the ESI-MS data and found to separate the two groups of samples. Mitochondria from control cells showed predominance of six ions, that is, those of m/z 157 (Pelargonic, 9:0), 255 (Palmitic, 16:0), 281 (Oleic, 18:1), 311 (Arachidic, 20:0), 327 (Docosahexaenoic, 22:6) and 339 (Behenic, 22:0). In contrast, FASN inhibition with Orlistat changes significantly mitochondrial FFA composition by reducing synthesis of palmitic acid, and its elongation and unsaturation products, such as arachidic and behenic acids, and oleic acid, respectively. ESI-MS of mitochondria isolated from Orlistat-treated cells presented therefore three major ions of m/z 157 (Pelargonic, 9:0), 193 (unknown) and 199 (Lauric, 12:0). These findings demonstrate therefore that FASN inhibition by Orlistat induces significant changes in the FFA composition of mitochondria. Copyright (C) 2011 John Wiley & Sons, Ltd.
Resumo:
The identification, modeling, and analysis of interactions between nodes of neural systems in the human brain have become the aim of interest of many studies in neuroscience. The complex neural network structure and its correlations with brain functions have played a role in all areas of neuroscience, including the comprehension of cognitive and emotional processing. Indeed, understanding how information is stored, retrieved, processed, and transmitted is one of the ultimate challenges in brain research. In this context, in functional neuroimaging, connectivity analysis is a major tool for the exploration and characterization of the information flow between specialized brain regions. In most functional magnetic resonance imaging (fMRI) studies, connectivity analysis is carried out by first selecting regions of interest (ROI) and then calculating an average BOLD time series (across the voxels in each cluster). Some studies have shown that the average may not be a good choice and have suggested, as an alternative, the use of principal component analysis (PCA) to extract the principal eigen-time series from the ROI(s). In this paper, we introduce a novel approach called cluster Granger analysis (CGA) to study connectivity between ROIs. The main aim of this method was to employ multiple eigen-time series in each ROI to avoid temporal information loss during identification of Granger causality. Such information loss is inherent in averaging (e.g., to yield a single ""representative"" time series per ROI). This, in turn, may lead to a lack of power in detecting connections. The proposed approach is based on multivariate statistical analysis and integrates PCA and partial canonical correlation in a framework of Granger causality for clusters (sets) of time series. We also describe an algorithm for statistical significance testing based on bootstrapping. By using Monte Carlo simulations, we show that the proposed approach outperforms conventional Granger causality analysis (i.e., using representative time series extracted by signal averaging or first principal components estimation from ROIs). The usefulness of the CGA approach in real fMRI data is illustrated in an experiment using human faces expressing emotions. With this data set, the proposed approach suggested the presence of significantly more connections between the ROIs than were detected using a single representative time series in each ROI. (c) 2010 Elsevier Inc. All rights reserved.
Resumo:
Astronomy has evolved almost exclusively by the use of spectroscopic and imaging techniques, operated separately. With the development of modern technologies, it is possible to obtain data cubes in which one combines both techniques simultaneously, producing images with spectral resolution. To extract information from them can be quite complex, and hence the development of new methods of data analysis is desirable. We present a method of analysis of data cube (data from single field observations, containing two spatial and one spectral dimension) that uses Principal Component Analysis (PCA) to express the data in the form of reduced dimensionality, facilitating efficient information extraction from very large data sets. PCA transforms the system of correlated coordinates into a system of uncorrelated coordinates ordered by principal components of decreasing variance. The new coordinates are referred to as eigenvectors, and the projections of the data on to these coordinates produce images we will call tomograms. The association of the tomograms (images) to eigenvectors (spectra) is important for the interpretation of both. The eigenvectors are mutually orthogonal, and this information is fundamental for their handling and interpretation. When the data cube shows objects that present uncorrelated physical phenomena, the eigenvector`s orthogonality may be instrumental in separating and identifying them. By handling eigenvectors and tomograms, one can enhance features, extract noise, compress data, extract spectra, etc. We applied the method, for illustration purpose only, to the central region of the low ionization nuclear emission region (LINER) galaxy NGC 4736, and demonstrate that it has a type 1 active nucleus, not known before. Furthermore, we show that it is displaced from the centre of its stellar bulge.
Resumo:
Medium density fiberboard (MDF) is an engineered wood product formed by breaking down selected lignin-cellulosic material residuals into fibers, combining it with wax and a resin binder, and then forming panels by applying high temperature and pressure. Because the raw material in the industrial process is ever-changing, the panel industry requires methods for monitoring the composition of their products. The aim of this study was to estimate the ratio of sugarcane (SC) bagasse to Eucalyptus wood in MDF panels using near infrared (NIR) spectroscopy. Principal component analysis (PCA) and partial least square (PLS) regressions were performed. MDF panels having different bagasse contents were easily distinguished from each other by the PCA of their NIR spectra with clearly different patterns of response. The PLS-R models for SC content of these MDF samples presented a strong coefficient of determination (0.96) between the NIR-predicted and Lab-determined values and a low standard error of prediction (similar to 1.5%) in the cross-validations. A key role of resins (adhesives), cellulose, and lignin for such PLS-R calibrations was shown. PLS-DA model correctly classified ninety-four percent of MDF samples by cross-validations and ninety-eight percent of the panels by independent test set. These NIR-based models can be useful to quickly estimate sugarcane bagasse vs. Eucalyptus wood content ratio in unknown MDF samples and to verify the quality of these engineered wood products in an online process.
Resumo:
Natural products have widespread biological activities, including inhibition of mitochondrial enzyme systems. Some of these activities, for example cytotoxicity, may be the result of alteration of cellular bioenergetics. Based on previous computer-aided drug design (CADD) studies and considering reported data on structure-activity relationships (SAR), an assumption regarding the mechanism of action of natural products against parasitic infections involves the NADH-oxidase inhibition. In this study, chemometric tools, such as: Principal Component Analysis (PCA), Consensus PCA (CPCA), and partial least squares regression (PLS), were applied to a set of forty natural compounds, acting as NADH-oxidase inhibitors. The calculations were performed using the VolSurf+ program. The formalisms employed generated good exploratory and predictive results. The independent variables or descriptors having a hydrophobic profile were strongly correlated to the biological data.
Resumo:
This paper proposes a novel computer vision approach that processes video sequences of people walking and then recognises those people by their gait. Human motion carries different information that can be analysed in various ways. The skeleton carries motion information about human joints, and the silhouette carries information about boundary motion of the human body. Moreover, binary and gray-level images contain different information about human movements. This work proposes to recover these different kinds of information to interpret the global motion of the human body based on four different segmented image models, using a fusion model to improve classification. Our proposed method considers the set of the segmented frames of each individual as a distinct class and each frame as an object of this class. The methodology applies background extraction using the Gaussian Mixture Model (GMM), a scale reduction based on the Wavelet Transform (WT) and feature extraction by Principal Component Analysis (PCA). We propose four new schemas for motion information capture: the Silhouette-Gray-Wavelet model (SGW) captures motion based on grey level variations; the Silhouette-Binary-Wavelet model (SBW) captures motion based on binary information; the Silhouette-Edge-Binary model (SEW) captures motion based on edge information and the Silhouette Skeleton Wavelet model (SSW) captures motion based on skeleton movement. The classification rates obtained separately from these four different models are then merged using a new proposed fusion technique. The results suggest excellent performance in terms of recognising people by their gait.
Resumo:
Sigma phase is a deleterious one which can be formed in duplex stainless steels during heat treatment or welding. Aiming to accompany this transformation, ferrite and sigma percentage and hardness were measured on samples of a UNS S31803 duplex stainless steel submitted to heat treatment. These results were compared to measurements obtained from ultrasound and eddy current techniques, i.e., velocity and impedance, respectively. Additionally, backscattered signals produced by wave propagation were acquired during ultrasonic inspection as well as magnetic Barkhausen noise during magnetic inspection. Both signal types were processed via a combination of detrended-fluctuation analysis (DFA) and principal component analysis (PCA). The techniques used were proven to be sensitive to changes in samples related to sigma phase formation due to heat treatment. Furthermore, there is an advantage using these methods since they are nondestructive. (C) 2010 Elsevier B.V. All rights reserved.
Resumo:
In this work, chemometric methods are reported as potential tools for monitoring the authenticity of Brazilian ultra-high temperature (UHT) milk processed in industrial plants located in different regions of the country. A total of 100 samples were submitted to the qualitative analysis of adulterants such as starch, chlorine, formal. hydrogen peroxide and urine. Except for starch, all the samples reported, at least, the presence of one adulterant. The use of chemometric methodologies such as the Principal Component Analysis (PCA) and Hierarchical Cluster Analysis (HCA) enabled the verification of the occurrence of certain adulterations in specific regions. The proposed multivariate approaches may allow the sanitary agency authorities to optimise materials, human and financial resources, as they associate the occurrence of adulterations to the geographical location of the industrial plants. (c) 2010 Elsevier Ltd. All rights reserved.
Resumo:
Particulate matter, especially PM2.5, is associated with increased morbidity and mortality from respiratory diseases. Studies that focus on the chemical composition of the material are frequent in the literature, but those that characterize the biological fraction are rare. The objectives of this study were to characterize samples collected in Sao Paulo, Brazil on the quantity of fungi and endotoxins associated with PM2.5, correlating with the mass of particulate matter, chemical composition and meteorological parameters. We did that by Principal Component Analysis (PCA) and multiple linear regressions. The results have shown that fungi and endotoxins represent significant portion of PM2.5, reaching average concentrations of 772.23 spores mu g(-1) of PM2.5 (SD: 400.37) and 5.52 EU mg(-1) of PM2.5 (SD: 4.51 EU mg(-1)), respectively. Hyaline basidiospores, Cladosporium and total spore counts were correlated to factor Ba/Ca/Fe/Zn/K/Si of PM2.5 (p < 0.05). Genera Pen/Asp were correlated to the total mass of PM2.5 (p < 0.05) and colorless ascospores were correlated to humidity (p < 0.05). Endotoxin was positively correlated with the atmospheric temperature (p < 0.05). This study has shown that bioaerosol is present in considerable amounts in PM2.5 in the atmosphere of Sao Paulo, Brazil. Some fungi were correlated with soil particle resuspension and mass of particulate matter. Therefore, the relative contribution of bioaerosol in PM2.5 should be considered in future studies aimed at evaluating the clinical impact of exposure to air pollution. (C) 2010 Elsevier Ltd. All rights reserved.
Resumo:
Objectives The study`s aims were to evaluate the antimycobacterial activity of 13 synthetic neolignan analogues and to perform structure activity relationship analysis (SAR). The cytotoxicity of the compound 2-phenoxy-1-phenylethanone (LS-2, 1) in mammalian cells, such as the acute toxicity in mice, was also evaluated. Methods The extra and intracellular antimycobacterial activity was evaluated on Mycobacterium tuberculosis H37Rv. Cytotoxicity studies were performed using V79 cells, J774 macrophages and rat hepatocytes. Additionally, the in-vivo acute toxicity was tested in mice. The SAR analysis was performed by Principal Component Analysis (PCA). Key findings Among the 13 analogues tested, LS-2 (1) was the most effective, showing promising antimycobacterial activity and very low cytotoxicity in V79 cells and in J774 macrophages, while no toxicity was observed in rat hepatocytes. The selectivity index (SI) of LS-2 (1) was 91 and the calculated LD50 was 1870 mg/kg, highlighting the very low toxicity in mice. SAR analysis showed that the highest electrophilicity and the lowest molar volume are physical-chemical characteristics important for the antimycobacterial activity of the LS-2 (1). Conclusions LS-2 (1) showed promising antimycobacterial activity and very weak cytotoxicity in cell culture, as well as an absence of toxicity in primary culture of hepatocytes. In the acute toxicity study there was an indication of absence of toxicity on murine models, in vivo.
Resumo:
We present two-dimensional stellar and gaseous kinematics of the inner 120 x 250 pc2 of the LINER/Seyfert 1 galaxy M81, from optical spectra obtained with the Gemini Multi-Object Spectrograph (GMOS) integral field spectrograph on the Gemini-North telescope at a spatial resolution of approximate to 10 pc. The stellar velocity field shows circular rotation and, overall, is very similar to the published large-scale velocity field, but deviations are observed close to the minor axis which can be attributed to stellar motions possibly associated with a nuclear bar. The stellar velocity dispersion of the bulge is 162 +/- 15 km s-1, in good agreement with previous measurements and leading to a black hole mass of M(BH) = 5.5+3.6(-2.0) x 107 M(circle dot) based on the M(BH)-Sigma relationship. The gas kinematics is dominated by non-circular motions and the subtraction of the stellar velocity field reveals blueshifts of approximate to-100 km s-1 on the far side of the galaxy and a few redshifts on the near side. These characteristics can be interpreted in terms of streaming towards the centre if the gas is in the plane. On the basis of the observed velocities and geometry of the flow, we estimate a mass inflow rate in ionized gas of approximate to 4.0 x 10-3 M(circle dot) yr-1, which is of the order of the accretion rate necessary to power the LINER nucleus of M81. We have also applied the technique of principal component analysis (PCA) to our data, which reveals the presence of a rotating nuclear gas disc within approximate to 50 pc from the nucleus and a compact outflow, approximately perpendicular to the disc. The PCA combined with the observed gas velocity field shows that the nuclear disc is being fed by gas circulating in the galaxy plane. The presence of the outflow is supported by a compact jet seen in radio observations at a similar orientation, as well as by an enhancement of the [O i]/H alpha line ratio, probably resulting from shock excitation of the circumnuclear gas by the radio jet. With these observations we are thus resolving both the feeding - via the nuclear disc and observed gas inflow, and the feedback - via the outflow, around the low-luminosity active nucleus of M81.