795 resultados para hierarchical clustering
Resumo:
O objetivo deste trabalho foi comparar diferentes técnicas multivariadas na caracterização de 35 genótipos de gergelim mediante 769 marcadores RAPD. As distâncias genéticas foram obtidas pelo complemento aritmético do coeficiente de Jaccard e agrupadas pelos métodos hierárquicos do vizinho mais próximo, do vizinho mais distante, das médias aritméticas não ponderadas (UPGMA), do método de otimização de Tocher e análises de coordenadas principais. O agrupamento dos genótipos foi alterado em função dos diferentes métodos usados. Adotando-se a mesma distância genética (0,36) como valor de corte, diferenciaram-se quatro grupos no método do vizinho mais próximo, 13 para o vizinho mais distante, 11 no UPGMA e quatro no Tocher. Entre os métodos hierárquicos, o UPGMA apresentou o melhor ajuste das distâncias originais e estimadas (CCC = 0,89). As análises das coordenadas principais confirmaram a baixa diversidade existente entre os genótipos. A maior divergência ocorreu entre as cultivares Seridó 1 e Arawaca 4, e a menor, entre os genótipos VCR-101 e GP-3314. As três primeiras coordenadas principais contabilizaram 35,13% do total da variabilidade, e 18 autovalores foram necessários para explicar 81% da variação genética. Os métodos UPGMA, de otimização de Tocher, e as análises de coordenadas principais são complementares na formação dos grupos.
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Biomass burning is an important primary and secondary source of aerosol particles. The presence of carbonaceous particles in the respirable size range makes the study of this fraction important in view of possible health and climatic effects. The annual burning of sugar cane plantations causes emission of huge amounts of pyrogenic particles. Aerosol samples were collected in Araraquara city, São Paulo state, Brazil, during the harvest season for fine and coarse particles and bulk; they were analysed by electron-probe microanalysis, including facilities for low-Z element determination (low-Z EPMA) and by energy-dispersive X-ray fluorescence (EDXRF), in order to investigate the elemental composition of individual particles and bulk samples, respectively. Numerical analysis of the EPMA results by hierarchical clustering shows high contributions of carbonaceous particles that can be distinguished mainly in two different types: biogenic and carbon-rich. Additionally, two significant contributions of aluminosilicate particles were identified: as rather pure aluminosilicates or mixed with carbonaceous species. The EDXRF results are compatible with those of aerosol particles in Amazon, which is nowadays one of the main sources of biogenic particles in the world.
Resumo:
(10) Hygiea is the fourth largest asteroid of the main belt, by volume and mass, and it is the largest member of its family, that is made mostly by low-albedo, C-type asteroids, typical of the outer main belt. Like many other large families, it is associated with a 'halo' of objects, that extends far beyond the boundary of the core family, as detected by traditional hierarchical clustering methods (HCM) in proper element domains. Numerical simulations of the orbital evolution of family members may help in estimating the family and halo family age, and the original ejection velocity field. But, in order to minimize the errors associated with including too many interlopers, it is important to have good estimates of family membership that include available data on local asteroid taxonomy, geometrical albedo and local dynamics. For this purpose, we obtained synthetic proper elements and frequencies of asteroids in the Hygiea orbital region, with their errors. We revised the current knowledge on asteroid taxonomy, including Sloan Digital Sky Survey-Moving Object Catalog 4th release (SDSS-MOC 4) data, and geometric albedo data from Wide-field Infrared Survey Explorer (WISE) and Near-Earth Object WISE (NEOWISE). We identified asteroid family members using HCM in the domain of proper elements (a, e, sin (i)) and in the domains of proper frequencies most appropriate to study diffusion in the local web of secular resonances, and eliminated possible interlopers based on taxonomic and geometrical albedo considerations. To identify the family halo, we devised a new hierarchical clustering method in an extended domain that includes proper elements, principal components PC1, PC2 obtained based on SDSS photometric data and, for the first time, WISE and NEOWISE geometric albedo. Data on asteroid size distribution, light curves and rotations were also revised for the Hygiea family. The Hygiea family is the largest group in its region, with two smaller families in proper element domain and 18 families in various frequencies domains identified in this work for the first time. Frequency groups tend to extend vertically in the (a, sin (i)) plane and cross not only the Hygiea family but also the near C-type families of Themis and Veritas, causing a mixture of objects all of relatively low albedo in the Hygiea family area. A few high-albedo asteroids, most likely associated with the Eos family, are also present in the region. Finally, the new multidomains hierarchical clustering method allowed us to obtain a good and robust estimate of the membership of the Hygiea family halo, quite separated from other asteroids families halo in the region, and with a very limited (about 3 per cent) presence of likely interlopers. © 2013 The Author Published by Oxford University Press on behalf of the Royal Astronomical Society.
Genomic Signatures Predict Poor Outcome in Undifferentiated Pleomorphic Sarcomas and Leiomyosarcomas
Resumo:
Undifferentiated high-grade pleomorphic sarcomas (UPSs) display aggressive clinical behavior and frequently develop local recurrence and distant metastasis. Because these sarcomas often share similar morphological patterns with other tumors, particularly leiomyosarcomas (LMSs), classification by exclusion is frequently used. In this study, array-based comparative genomic hybridization (array CGH) was used to analyze 20 UPS and 17 LMS samples from untreated patients. The LMS samples presented a lower frequency of genomic alterations compared with the UPS samples. The most frequently altered UPS regions involved gains at 20q13.33 and 7q22.1 and losses at 3p26.3. Gains at 8q24.3 and 19q13.12 and losses at 9p21.3 were frequently detected in the LMS samples. Of these regions, gains at 1q21.3, 11q12.2-q12.3, 16p11.2, and 19q13.12 were significantly associated with reduced overall survival times in LMS patients. A multivariate analysis revealed that gains at 1q21.3 were an independent prognostic marker of shorter survival times in LMS patients (HR = 13.76; P = 0.019). Although the copy number profiles of the UPS and LMS samples could not be distinguished using unsupervised hierarchical clustering analysis, one of the three clusters presented cases associated with poor prognostic outcome (P = 0.022). A relative copy number analysis for the ARNT, SLC27A3, and PBXIP1 genes was performed using quantitative real-time PCR in 11 LMS and 16 UPS samples. Gains at 1q21-q22 were observed in both tumor types, particularly in the UPS samples. These findings provide strong evidence for the existence of a genomic signature to predict poor outcome in a subset of UPS and LMS patients. © 2013 Silveira et al.
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
The soybean crop is considered a high expression around the world. In plant breeding programs, knowledge of genetic diversity is extremely important and in this context, are frequently used multivariate analyzes. Thus, the aim of the present study was to evaluate the genetic divergence between soybean crosses through multivariate techniques. In total, 16 crosses were evaluated, which were in the F2 generation of inbreeding. The evaluated characteristics were plant height at maturity, height of the first pod, number of branches per plant, number of pods per plant, number of nodes per plant, hundred seed weight, grain yield and oil content. For the analyzes was used Euclidean distance, methods of hierarchical clustering UPGMA and Ward and principal component analysis. Genetic distances estimated using Euclidean distance ranged from 1.24 to 8.13, with the smallest distance observed between crosses C1 and C4, and the greatest distance between the C2 crosses and C6. The methods UPGMA clustering and Ward met crossings in five different groups. The principal component analysis explained 86.2% of the variance contained in the original eight variables with three main components. The APM characters, NV, NR, NN, PG% and oil were the main contributors to genetic divergence among traits. Multivariate techniques were crucial to the analysis of genetic diversity, and the methods of Ward and UPGMA clustering and principal components have consistent results in this way, the simultaneous use of these tools in genetic analysis of crosses is indicated
Resumo:
Pós-graduação em Agronomia (Produção Vegetal) - FCAV
Long-term clinical evaluation of the color stability and stainability of acrylic resin denture teeth
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Background: Large gene expression studies, such as those conducted using DNA arrays, often provide millions of different pieces of data. To address the problem of analyzing such data, we describe a statistical method, which we have called ‘gene shaving’. The method identifies subsets of genes with coherent expression patterns and large variation across conditions. Gene shaving differs from hierarchical clustering and other widely used methods for analyzing gene expression studies in that genes may belong to more than one cluster, and the clustering may be supervised by an outcome measure. The technique can be ‘unsupervised’, that is, the genes and samples are treated as unlabeled, or partially or fully supervised by using known properties of the genes or samples to assist in finding meaningful groupings. Results: We illustrate the use of the gene shaving method to analyze gene expression measurements made on samples from patients with diffuse large B-cell lymphoma. The method identifies a small cluster of genes whose expression is highly predictive of survival. Conclusions: The gene shaving method is a potentially useful tool for exploration of gene expression data and identification of interesting clusters of genes worth further investigation.
Resumo:
Biogeography has been difficult to apply as a methodological approach because organismic biology is incomplete at levels where the process of formulating comparisons and analogies is complex. The study of insect biogeography became necessary because insects possess numerous evolutionary traits and play an important role as pollinators. Among insects, the euglossine bees, or orchid bees, attract interest because the study of their biology allows us to explain important steps in the evolution of social behavior and many other adaptive tradeoffs. We analyzed the distribution of morphological characteristics in Colombian orchid bees from an ecological perspective. The aim of this study was to observe the distribution of these attributes on a regional basis. Data corresponding to Colombian euglossine species were ordered with a correspondence analysis and with subsequent hierarchical clustering. Later, and based on community proprieties, we compared the resulting hierarchical model with the collection localities to seek to identify a biogeographic classification pattern. From this analysis, we derived a model that classifies the territory of Colombia into 11 biogeographic units or natural clusters. Ecological assumptions in concordance with the derived classification levels suggest that species characteristics associated with flight performance, nectar uptake, and social behavior are the factors that served to produce the current geographical structure.
Resumo:
Alzheimer's disease (AD) is the most common cause of dementia in the human population, characterized by a spectrum of neuropathological abnormalities that results in memory impairment and loss of other cognitive processes as well as the presence of non-cognitive symptoms. Transcriptomic analyses provide an important approach to elucidating the pathogenesis of complex diseases like AD, helping to figure out both pre-clinical markers to identify susceptible patients and the early pathogenic mechanisms to serve as therapeutic targets. This study provides the gene expression profile of postmortem brain tissue from subjects with clinic-pathological AD (Braak IV, V, or V and CERAD B or C; and CDR >= 1), preclinical AD (Braak IV, V, or VI and CERAD B or C; and CDR = 0), and healthy older individuals (Braak <= II and CERAD 0 or A; and CDR = 0) in order to establish genes related to both AD neuropathology and clinical emergence of dementia. Based on differential gene expression, hierarchical clustering and network analysis, genes involved in energy metabolism, oxidative stress, DNA damage/repair, senescence, and transcriptional regulation were implicated with the neuropathology of AD; a transcriptional profile related to clinical manifestation of AD could not be detected with reliability using differential gene expression analysis, although genes involved in synaptic plasticity, and cell cycle seems to have a role revealed by gene classifier. In conclusion, the present data suggest gene expression profile changes secondary to the development of AD-related pathology and some genes that appear to be related to the clinical manifestation of dementia in subjects with significant AD pathology, making necessary further investigations to better understand these transcriptional findings on the pathogenesis and clinical emergence of AD.
Resumo:
Abstract Background Oral squamous cell carcinoma (OSCC) is a frequent neoplasm, which is usually aggressive and has unpredictable biological behavior and unfavorable prognosis. The comprehension of the molecular basis of this variability should lead to the development of targeted therapies as well as to improvements in specificity and sensitivity of diagnosis. Results Samples of primary OSCCs and their corresponding surgical margins were obtained from male patients during surgery and their gene expression profiles were screened using whole-genome microarray technology. Hierarchical clustering and Principal Components Analysis were used for data visualization and One-way Analysis of Variance was used to identify differentially expressed genes. Samples clustered mostly according to disease subsite, suggesting molecular heterogeneity within tumor stages. In order to corroborate our results, two publicly available datasets of microarray experiments were assessed. We found significant molecular differences between OSCC anatomic subsites concerning groups of genes presently or potentially important for drug development, including mRNA processing, cytoskeleton organization and biogenesis, metabolic process, cell cycle and apoptosis. Conclusion Our results corroborate literature data on molecular heterogeneity of OSCCs. Differences between disease subsites and among samples belonging to the same TNM class highlight the importance of gene expression-based classification and challenge the development of targeted therapies.