74 resultados para Principal Components
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)
Resumo:
Objectives: The aim of this work was to verify the differentiation between normal and pathological human carotid artery tissues by using fluorescence and reflectance spectroscopy in the 400- to 700-nm range and the spectral characterization by means of principal components analysis. Background Data: Atherosclerosis is the most common and serious pathology of the cardiovascular system. Principal components represent the main spectral characteristics that occur within the spectral data and could be used for tissue classification. Materials and Methods: Sixty postmortem carotid artery fragments (26 non-atherosclerotic and 34 atherosclerotic with non-calcified plaques) were studied. The excitation radiation consisted of a 488-nm argon laser. Two 600-mu m core optical fibers were used, one for excitation and one to collect the fluorescence radiation from the samples. The reflectance system was composed of a halogen lamp coupled to an excitation fiber positioned in one of the ports of an integrating sphere that delivered 5 mW to the sample. The photo-reflectance signal was coupled to a 1/4-m spectrograph via an optical fiber. Euclidean distance was then used to classify each principal component score into one of two classes, normal and atherosclerotic tissue, for both fluorescence and reflectance. Results: The principal components analysis allowed classification of the samples with 81% sensitivity and 88% specificity for fluorescence, and 81% sensitivity and 91% specificity for reflectance. Conclusions: Our results showed that principal components analysis could be applied to differentiate between normal and atherosclerotic tissue with high sensitivity and specificity.
Resumo:
Three-dimensional spectroscopy techniques are becoming more and more popular, producing an increasing number of large data cubes. The challenge of extracting information from these cubes requires the development of new techniques for data processing and analysis. We apply the recently developed technique of principal component analysis (PCA) tomography to a data cube from the center of the elliptical galaxy NGC 7097 and show that this technique is effective in decomposing the data into physically interpretable information. We find that the first five principal components of our data are associated with distinct physical characteristics. In particular, we detect a low-ionization nuclear-emitting region (LINER) with a weak broad component in the Balmer lines. Two images of the LINER are present in our data, one seen through a disk of gas and dust, and the other after scattering by free electrons and/or dust particles in the ionization cone. Furthermore, we extract the spectrum of the LINER, decontaminated from stellar and extended nebular emission, using only the technique of PCA tomography. We anticipate that the scattered image has polarized light due to its scattered nature.
Resumo:
Aims. A model-independent reconstruction of the cosmic expansion rate is essential to a robust analysis of cosmological observations. Our goal is to demonstrate that current data are able to provide reasonable constraints on the behavior of the Hubble parameter with redshift, independently of any cosmological model or underlying gravity theory. Methods. Using type Ia supernova data, we show that it is possible to analytically calculate the Fisher matrix components in a Hubble parameter analysis without assumptions about the energy content of the Universe. We used a principal component analysis to reconstruct the Hubble parameter as a linear combination of the Fisher matrix eigenvectors (principal components). To suppress the bias introduced by the high redshift behavior of the components, we considered the value of the Hubble parameter at high redshift as a free parameter. We first tested our procedure using a mock sample of type Ia supernova observations, we then applied it to the real data compiled by the Sloan Digital Sky Survey (SDSS) group. Results. In the mock sample analysis, we demonstrate that it is possible to drastically suppress the bias introduced by the high redshift behavior of the principal components. Applying our procedure to the real data, we show that it allows us to determine the behavior of the Hubble parameter with reasonable uncertainty, without introducing any ad-hoc parameterizations. Beyond that, our reconstruction agrees with completely independent measurements of the Hubble parameter obtained from red-envelope galaxies.
Resumo:
A origem e a dispersão dos povos Tupiguarani têm sido intensamente debatidas entre arqueólogos e linguistas nas últimas cinco décadas. Em resumo, pode-se dizer que a ideia de que esses povos, que ocuparam grande parte do território brasileiro e parte da Bolívia, do Paraguai, do Uruguai e da Argentina, tiveram sua etnogênese na Amazônia e dali partiram para o leste e para o sul, por volta de 2.500 anos antes do presente, é bastante aceita entre os especialistas, embora uma dispersão no sentido oposto, isto é, do sul para o norte, com origem na bacia do Tietê-Paraná, não seja completamente descartada. Entre os arqueólogos que consideram a Amazônia como berço desses povos, alguns acreditam que esse surgimento se deu na Amazônia central. Outros acreditam que a etnogênese Tupiguarani ocorreu no sudoeste da Amazônia, onde hoje se concentra a maior diversidade linguística do tronco Tupi. Neste trabalho, a morfologia de 19 crânios associados à cerâmica Tupiguarani ou etnograficamente classificados como tais foram comparados a várias séries cranianas pré-históricas e etnográficas brasileiras por meio de estatísticas multivariadas. Duas técnicas multivariadas foram empregadas: Análise de Componentes Principais, aplicada sobre os centróides de cada série, e Distâncias de Mahalanobis, aplicadas aos dados individuais. Os resultados obtidos sugerem uma origem amazônica para os povos Tupiguarani, sobretudo pela forte associação encontrada entre crânios Tupi e Guarani do sudeste e do sul brasileiro e dos Tupi do norte do Brasil, com os espécimes provenientes da ilha de Marajó incluídos no estudo.
Resumo:
The hedonic level of commercial cachaças, was evaluated by consumers and by a tasters. The results of sensorial methods analyzed trough Principal Components Analysis, Hierarchical Cluster Analysis and the Pearson linear correlation indicated that the best classified cachaças were produced in copper stills and aged in oak casks. By contrast the worst classified exhibited as the main features be not aged and high alcohol percentage. The index of preference is positively correlated with the intensity of yellow color, wood flavor, sweetness and fruit aroma. There is a negative preference correlation with the acidity, the taste of alcohol and bitterness.
Resumo:
The concentration of 14 organic acids of 50 sugarcane spirits samples was determined by gas chromatography using flame ionization detection. The organic acids analytical quantitative profile in stills and column distilled spirits from wines obtained from the same must were compared. The comparison was also carried in "head", "heart" and "tail fractions of stills distilled spirits. The experimental data were analyzed by Principal Components Analysis (PCA) and pointed out that the distillation process (stills and column) strongly influences the lead spirits' organic acid composition and that producers' operational "cuts off" to produce "tail", "heart" and "head", fractions should be optimized.
Resumo:
OBJETIVOS: identificar os padrões alimentares de crianças e sua associação com o nível socioeconômico das famílias. MÉTODOS: estudo transversal com 1260 crianças de 4 a 11 anos, residentes em Salvador-Bahia que incluiu aplicação de um Questionário de Frequência Alimentar semi-quantitativo. Os padrões alimentares foram identificados, empregando-se análise fatorial por componentes principais. O nível socioeconômico foi avaliado por meio de um indicador socioeconômico composto. Regressão logística multivariada foi empregada. RESULTADOS: identificaram-se quatro padrões que explicaram 45,9% da variabilidade dos dados de frequência alimentar. Crianças que pertencem ao nível socioeconômico mais alto têm 1,60 vezes mais chance (p<0,001) de apresentarem maior frequência de consumo de alimentos do padrão 1 (frutas, verduras, leguminosas, cereais e pescados) e 3,09 vezes mais chance (p<0,001) de apresentarem maior frequência de consumo dos alimentos do padrão 2 (leite/ derivados, catchup/ maionese/ mostarda e frango), quando se compara com aquele de crianças de nível socioeconômico mais baixo. Resultado inverso foi observado no padrão 4 (embutidos, ovos e carnes vermelhas); isto é, quanto maior o nível socioeconômico menor a chance da adoção desse padrão. Tendência similar foi notada para o padrão 3 (frituras, doces, salgadinhos, refrigerante/ suco artificial). CONCLUSÕES: padrões alimentares de crianças são dependentes das condições socioeconômicas das famílias e a adoção de itens alimentares mais saudáveis associa-se aos grupos de mais altos níveis socioeconômicos.
Resumo:
Objective: The biochemical alterations between inflammatory fibrous hyperplasia (IFH) and normal tissues of buccal mucosa were probed by using the FT-Raman spectroscopy technique. The aim was to find the minimal set of Raman bands that would furnish the best discrimination. Background: Raman-based optical biopsy is a widely recognized potential technique for noninvasive real-time diagnosis. However, few studies had been devoted to the discrimination of very common subtle or early pathologic states as inflammatory processes that are always present on, for example, cancer lesion borders. Methods: Seventy spectra of IFH from 14 patients were compared with 30 spectra of normal tissues from six patients. The statistical analysis was performed with principal components analysis and soft independent modeling class analogy cross-validated, leave-one-out methods. Results: Bands close to 574, 1,100, 1,250 to 1,350, and 1,500 cm(-1) (mainly amino acids and collagen bands) showed the main intragroup variations that are due to the acanthosis process in the IFH epithelium. The 1,200 (C-C aromatic/DNA), 1,350 (CH(2) bending/collagen 1), and 1,730 cm(-1) (collagen III) regions presented the main intergroup variations. This finding was interpreted as originating in an extracellular matrix-degeneration process occurring in the inflammatory tissues. The statistical analysis results indicated that the best discrimination capability (sensitivity of 95% and specificity of 100%) was found by using the 530-580 cm(-1) spectral region. Conclusions: The existence of this narrow spectral window enabling normal and inflammatory diagnosis also had useful implications for an in vivo dispersive Raman setup for clinical applications.
Resumo:
Gene clustering is a useful exploratory technique to group together genes with similar expression levels under distinct cell cycle phases or distinct conditions. It helps the biologist to identify potentially meaningful relationships between genes. In this study, we propose a clustering method based on multivariate normal mixture models, where the number of clusters is predicted via sequential hypothesis tests: at each step, the method considers a mixture model of m components (m = 2 in the first step) and tests if in fact it should be m - 1. If the hypothesis is rejected, m is increased and a new test is carried out. The method continues (increasing m) until the hypothesis is accepted. The theoretical core of the method is the full Bayesian significance test, an intuitive Bayesian approach, which needs no model complexity penalization nor positive probabilities for sharp hypotheses. Numerical experiments were based on a cDNA microarray dataset consisting of expression levels of 205 genes belonging to four functional categories, for 10 distinct strains of Saccharomyces cerevisiae. To analyze the method's sensitivity to data dimension, we performed principal components analysis on the original dataset and predicted the number of classes using 2 to 10 principal components. Compared to Mclust (model-based clustering), our method shows more consistent results.
Resumo:
Background: Prostate cancer cells in primary tumors have been typed CD10(-)/CD13(-)/CD24(hi)/CD26(+)/CD38(lo)/CD44(-)/CD104(-). This CD phenotype suggests a lineage relationship between cancer cells and luminal cells. The Gleason grade of tumors is a descriptive of tumor glandular differentiation. Higher Gleason scores are associated with treatment failure. Methods: CD26(+) cancer cells were isolated from Gleason 3+3 (G3) and Gleason 4+4 (G4) tumors by cell sorting, and their gene expression or transcriptome was determined by Affymetrix DNA array analysis. Dataset analysis was used to determine gene expression similarities and differences between G3 and G4 as well as to prostate cancer cell lines and histologically normal prostate luminal cells. Results: The G3 and G4 transcriptomes were compared to those of prostatic cell types of non-cancer, which included luminal, basal, stromal fibromuscular, and endothelial. A principal components analysis of the various transcriptome datasets indicated a closer relationship between luminal and G3 than luminal and G4. Dataset comparison also showed that the cancer transcriptomes differed substantially from those of prostate cancer cell lines. Conclusions: Genes differentially expressed in cancer are potential biomarkers for cancer detection, and those differentially expressed between G3 and G4 are potential biomarkers for disease stratification given that G4 cancer is associated with poor outcomes. Differentially expressed genes likely contribute to the prostate cancer phenotype and constitute the signatures of these particular cancer cell types.
Resumo:
Online music databases have increased significantly as a consequence of the rapid growth of the Internet and digital audio, requiring the development of faster and more efficient tools for music content analysis. Musical genres are widely used to organize music collections. In this paper, the problem of automatic single and multi-label music genre classification is addressed by exploring rhythm-based features obtained from a respective complex network representation. A Markov model is built in order to analyse the temporal sequence of rhythmic notation events. Feature analysis is performed by using two multi-variate statistical approaches: principal components analysis (unsupervised) and linear discriminant analysis (supervised). Similarly, two classifiers are applied in order to identify the category of rhythms: parametric Bayesian classifier under the Gaussian hypothesis (supervised) and agglomerative hierarchical clustering (unsupervised). Qualitative results obtained by using the kappa coefficient and the obtained clusters corroborated the effectiveness of the proposed method.
Resumo:
The flowpaths by which water moves from watersheds to streams has important consequences for the runoff dynamics and biogeochemistry of surface waters in the Amazon Basin. The clearing of Amazon forest to cattle pasture has the potential to change runoff sources to streams by shifting runoff to more surficial flow pathways. We applied end-member mixing analysis (EMMA) to 10 small watersheds throughout the Amazon in which solute composition of streamwater and groundwater, overland flow, soil solution, throughfall and rainwater were measured, largely as part of the Large-Scale Biosphere-Atmosphere Experiment in Amazonia. We found a range in the extent to which streamwater samples fell within the mixing space determined by potential flowpath end-members, suggesting that some water sources to streams were not sampled. The contribution of overland flow as a source of stream flow was greater in pasture watersheds than in forest watersheds of comparable size. Increases in overland flow contribution to pasture streams ranged in some cases from 0% in forest to 27-28% in pasture and were broadly consistent with results from hydrometric sampling of Amazon forest and pasture watersheds that indicate 17- to 18-fold increase in the overland flow contribution to stream flow in pastures. In forest, overland flow was an important contribution to stream flow (45-57%) in ephemeral streams where flows were dominated by stormflow. Overland flow contribution to stream flow decreased in importance with increasing watershed area, from 21 to 57% in forest and 60-89% in pasture watersheds of less than 10 ha to 0% in forest and 27-28% in pastures in watersheds greater than 100 ha. Soil solution contributions to stream flow were similar across watershed area and groundwater inputs generally increased in proportion to decreases in overland flow. Application of EMMA across multiple watersheds indicated patterns across gradients of stream size and land cover that were consistent with patterns determined by detailed hydrometric sampling.
Resumo:
Rare species are one of the principal components of the species richness and diversity encountered in Dense Ombrophilous Tropical Forests. This study sought to analyze the rare canopy species within the Atlantic Coastal Forest in Rio de Janeiro State, Brazil. Six different communities were examined: Dense Ombrophilous alluvial Forest; Dense sub-montane Ombrophilous Forest; Dense Montane Ombrophilous in Serra do Mar and Serra da Mantiqueira. In each area the vegetation was sampled within forty 10 x 25 m plots alternately distributed along a linear transect. All trees with DBH (1.3 m above ground level) a parts per thousand yen5 cm were sampled. The canopy was characterized using the allometric relationship between diameter and height, and included all trees with BDH a parts per thousand yen10 cm and height a parts per thousand yen10 m. A total of 64 families, 206 genera, and 542 species were sampled, of which 297 (54.8%) represented rare species (less than one individual per hectare). The percentage of rare species varied from 34 to 50% in each of the different communities sampled. A majority of these rare trees belonged to the Rosidae, and a smaller proportion to the Dilleniidae. It was concluded that there was no apparent pattern to rarity among families, that rarity was probably derived from a number of processes (such as gap formation), and that a great majority of the rare species sampled were consistently rare. This indicates that the restricted geographic distribution and high degree of endemism of many arboreal taxa justifies the conservation of even small fragments of Atlantic Forest.
Resumo:
Background: Depression is a common contributor to suffering and disability in people with chronic pain. However, the assessment of depression in this population has been hampered by the presence of a number of somatic symptoms that are shared between chronic pain, treatment side-effects and traditional concepts of depression. As a result, the use of depression measures that do not contain somatic items has been encouraged. Objective: This study examined the psychometric properties of the Depression sub-scale of the Depression Anxiety and Stress Scales (DASS) in a Brazilian chronic pain patient population. Method: Data on a number of measures were collected from 348 participants attending pain facilities. Results: Principal components and exploratory factor analyses indicated the presence of only one factor. Item analyses indicated adequate item-scale correlations. The Cronbach alpha was .96, which suggests an excellent internal consistency. Conclusion: The DASS-Depression scale has adequate psychometric properties and its further use with Brazilian chronic pain populations can now be supported. (c) 2008 Elsevier Inc. All rights reserved.
Resumo:
In this work, pyrolysis-molecular beam mass spectrometry analysis coupled with principal components analysis and (13)C-labeled tetramethylammonium hydroxide thermochemolysis were used to study lignin oxidation, depolymerization, and demethylation of spruce wood treated by biomimetic oxidative systems. Neat Fenton and chelator-mediated Fenton reaction (CMFR) systems as well as cellulosic enzyme treatments were used to mimic the nonenzymatic process involved in wood brown-rot biodegradation. The results suggest that compared with enzymatic processes, Fenton-based treatment more readily opens the structure of the lignocellulosic matrix, freeing cellulose fibrils from the matrix. The results demonstrate that, under the current treatment conditions, Fenton and CMFR treatment cause limited demethoxylation of lignin in the insoluble wood residue. However, analysis of a water-extractable fraction revealed considerable soluble lignin residue structures that had undergone side chain oxidation as well as demethoxylation upon CMFR treatment. This research has implications for our understanding of nonenzymatic degradation of wood and the diffusion of CMFR agents in the wood cell wall during fungal degradation processes.