42 resultados para discriminant analysis and cluster analysis
Resumo:
The application of Discriminant function analysis (DFA) is not a new idea in the studyof tephrochrology. In this paper, DFA is applied to compositional datasets of twodifferent types of tephras from Mountain Ruapehu in New Zealand and MountainRainier in USA. The canonical variables from the analysis are further investigated witha statistical methodology of change-point problems in order to gain a betterunderstanding of the change in compositional pattern over time. Finally, a special caseof segmented regression has been proposed to model both the time of change and thechange in pattern. This model can be used to estimate the age for the unknown tephrasusing Bayesian statistical calibration
Resumo:
The purpose of this paper is to study the possible differences among countries as CO2 emitters and to examine the underlying causes of these differences. The starting point of the analysis is the Kaya identity, which allows us to break down per capita emissions in four components: an index of carbon intensity, transformation efficiency, energy intensity and social wealth. Through a cluster analysis we have identified five groups of countries with different behavior according to these four factors. One significant finding is that these groups are stable for the period analyzed. This suggests that a study based on these components can characterize quite accurately the polluting behavior of individual countries, that is to say, the classification found in the analysis could be used in other studies which look to study the behavior of countries in terms of CO2 emissions in homogeneous groups. In this sense, it supposes an advance over the traditional regional or rich-poor countries classifications .
Resumo:
Creative industries tend to concentrate mainly around large- and medium-sized cities, forming creative local production systems. The text analyses the forces behind clustering of creative industries to provide the first empirical explanation of the determinants of creative employment clustering following a multidisciplinary approach based on cultural and creative economics, evolutionary geography and urban economics. A comparative analysis has been performed for Italy and Spain. The results show different patterns of creative employment clustering in both countries. The small role of historical and cultural endowments, the size of the place, the average size of creative industries, the productive diversity and the concentration of human capital and creative class have been found as common factors of clustering in both countries.
Resumo:
HEMOLIA (a project under European community’s 7th framework programme) is a new generation Anti-Money Laundering (AML) intelligent multi-agent alert and investigation system which in addition to the traditional financial data makes extensive use of modern society’s huge telecom data source, thereby opening up a new dimension of capabilities to all Money Laundering fighters (FIUs, LEAs) and Financial Institutes (Banks, Insurance Companies, etc.). This Master-Thesis project is done at AIA, one of the partners for the HEMOLIA project in Barcelona. The objective of this thesis is to find the clusters in a network drawn by using the financial data. An extensive literature survey has been carried out and several standard algorithms related to networks have been studied and implemented. The clustering problem is a NP-hard problem and several algorithms like K-Means and Hierarchical clustering are being implemented for studying several problems relating to sociology, evolution, anthropology etc. However, these algorithms have certain drawbacks which make them very difficult to implement. The thesis suggests (a) a possible improvement to the K-Means algorithm, (b) a novel approach to the clustering problem using the Genetic Algorithms and (c) a new algorithm for finding the cluster of a node using the Genetic Algorithm.
Resumo:
Background: Peach fruit undergoes a rapid softening process that involves a number of metabolic changes. Storing fruit at low temperatures has been widely used to extend its postharvest life. However, this leads to undesired changes, such as mealiness and browning, which affect the quality of the fruit. In this study, a 2-D DIGE approach was designed to screen for differentially accumulated proteins in peach fruit during normal softening as well as under conditions that led to fruit chilling injury. Results:The analysis allowed us to identify 43 spots -representing about 18% of the total number analyzed- that show statistically significant changes. Thirty-nine of the proteins could be identified by mass spectrometry. Some of the proteins that changed during postharvest had been related to peach fruit ripening and cold stress in the past. However, we identified other proteins that had not been linked to these processes. A graphical display of the relationship between the differentially accumulated proteins was obtained using pairwise average-linkage cluster analysis and principal component analysis. Proteins such as endopolygalacturonase, catalase, NADP-dependent isocitrate dehydrogenase, pectin methylesterase and dehydrins were found to be very important for distinguishing between healthy and chill injured fruit. A categorization of the differentially accumulated proteins was performed using Gene Ontology annotation. The results showed that the 'response to stress', 'cellular homeostasis', 'metabolism of carbohydrates' and 'amino acid metabolism' biological processes were affected the most during the postharvest. Conclusions: Using a comparative proteomic approach with 2-D DIGE allowed us to identify proteins that showed stage-specific changes in their accumulation pattern. Several proteins that are related to response to stress, cellular homeostasis, cellular component organization and carbohydrate metabolism were detected as being differentially accumulated. Finally, a significant proportion of the proteins identified had not been associated with softening, cold storage or chilling injury-altered fruit before; thus, comparative proteomics has proven to be a valuable tool for understanding fruit softening and postharvest.
Resumo:
We present an analysis of the M-O chemical bonding in the binary oxides MgO, CaO, SrO, BaO, and Al2O3 based on ab initio wave functions. The model used to represent the local environment of a metal cation in the bulk oxide is an MO6 cluster which also includes the effect of the lattice Madelung potential. The analysis of the wave functions for these clusters leads to the conclusion that all the alkaline-earth oxides must be regarded as highly ionic oxides; however, the ionic character of the oxides decreases as one goes from MgO, almost perfectly ionic, to BaO. In Al2O3 the ionic character is further reduced; however, even in this case, the departure from the ideal, fully ionic, model of Al3+ is not exceptionally large. These conclusions are based on three measures, a decomposition of the Mq+-Oq- interaction energy, the number of electrons associated to the oxygen ions as obtained from a projection operator technique, and the analysis of the cation core-level binding energies. The increasing covalent character along the series MgO, CaO, SrO, and BaO is discussed in view of the existing theoretical models and experimental data.
Resumo:
We presented an integrated hierarchical model of psychopathology that more accurately captures empirical patterns of comorbidity between clinical syndromes and personality disorders.In order to verify the structural validity of the model proposed, this study aimed to analyze the convergence between the Restructured Clinical (RC) scales and Personality scales (PSY-5) of the MMPI-2-RF and the Clinical Syndrome and Personality Disorder scales of the MCMI-III.The MMPI-2-RF and MCMI-III were administered to a clinical sample of 377 outpatients (167 men and 210 women).The structural hypothesiswas assessed by using a Confirmatory Factor Analytic design with four common superordinate factors. An independent-cluster-basis solution was proposed based on maximum likelihood estimation and the application of several fit indices.The fit of the proposed model can be considered as good and more so if we take into account its complexity.
Resumo:
Background: The 22q11.2 deletion syndrome is the most frequent genomic disorder with an estimated frequency of 1/4000 live births. The majority of patients (90%) have the same deletion of 3 Mb (Typically Deleted Region, TDR) that results from aberrant recombination at meiosis between region specific low-copy repeats (LCRs). Methods: As a first step towards the characterization of recombination rates and breakpoints within the 22q11.2 region we have constructed a high resolution recombination breakpoint map based on pedigree analysis and a population-based historical recombination map based on LD analysis. Results: Our pedigree map allows the location of recombination breakpoints with a high resolution (potential recombination hotspots), and this approach has led to the identification of 5 breakpoint segments of 50 kb or less (8.6 kb the smallest), that coincide with historical hotspots. It has been suggested that aberrant recombination leading to deletion (and duplication) is caused by low rates of Allelic Homologous Recombination (AHR) within the affected region. However, recombination rate estimates for 22q11.2 region show that neither average recombination rates in the 22q11.2 region or within LCR22-2 (the LCR implicated in most deletions and duplications), are significantly below chromosome 22 averages. Furthermore, LCR22-2, the repeat most frequently implicated in rearrangements, is also the LCR22 with the highest levels of AHR. In addition, we find recombination events in the 22q11.2 region to cluster within families. Within this context, the same chromosome recombines twice in one family; first by AHR and in the next generation by NAHR resulting in an individual affected with the del22q11.2 syndrome. Conclusion: We show in the context of a first high resolution pedigree map of the 22q11.2 region that NAHR within LCR22 leading to duplications and deletions cannot be explained exclusively under a hypothesis of low AHR rates. In addition, we find that AHR recombination events cluster within families. If normal and aberrant recombination are mechanistically related, the fact that LCR22s undergo frequent AHR and that we find familial differences in recombination rates within the 22q11.2 region would have obvious health-related implications.
Resumo:
The objective of research was to analyse the potential of Normalized Difference Vegetation Index (NDVI) maps from satellite images, yield maps and grapevine fertility and load variables to delineate zones with different wine grape properties for selective harvesting. Two vineyard blocks located in NE Spain (Cabernet Sauvignon and Syrah) were analysed. The NDVI was computed from a Quickbird-2 multi-spectral image at veraison (July 2005). Yield data was acquired by means of a yield monitor during September 2005. Other variables, such as the number of buds, number of shoots, number of wine grape clusters and weight of 100 berries were sampled in a 10 rows × 5 vines pattern and used as input variables, in combination with the NDVI, to define the clusters as alternative to yield maps. Two days prior to the harvesting, grape samples were taken. The analysed variables were probable alcoholic degree, pH of the juice, total acidity, total phenolics, colour, anthocyanins and tannins. The input variables, alone or in combination, were clustered (2 and 3 Clusters) by using the ISODATA algorithm, and an analysis of variance and a multiple rang test were performed. The results show that the zones derived from the NDVI maps are more effective to differentiate grape maturity and quality variables than the zones derived from the yield maps. The inclusion of other grapevine fertility and load variables did not improve the results.
Resumo:
Background: We use an approach based on Factor Analysis to analyze datasets generated for transcriptional profiling. The method groups samples into biologically relevant categories, and enables the identification of genes and pathways most significantly associated to each phenotypic group, while allowing for the participation of a given gene in more than one cluster. Genes assigned to each cluster are used for the detection of pathways predominantly activated in that cluster by finding statistically significant associated GO terms. We tested the approach with a published dataset of microarray experiments in yeast. Upon validation with the yeast dataset, we applied the technique to a prostate cancer dataset. Results: Two major pathways are shown to be activated in organ-confined, non-metastatic prostate cancer: those regulated by the androgen receptor and by receptor tyrosine kinases. A number of gene markers (HER3, IQGAP2 and POR1) highlighted by the software and related to the later pathway have been validated experimentally a posteriori on independent samples. Conclusion: Using a new microarray analysis tool followed by a posteriori experimental validation of the results, we have confirmed several putative markers of malignancy associated with peptide growth factor signalling in prostate cancer and revealed others, most notably ERRB3 (HER3). Our study suggest that, in primary prostate cancer, HER3, together or not with HER4, rather than in receptor complexes involving HER2, could play an important role in the biology of these tumors. These results provide new evidence for the role of receptor tyrosine kinases in the establishment and progression of prostate cancer.
Resumo:
The main goal of the InterAmbAr reseach project is to analyze the relationships between landscape systems and human land-use strategies on mountains and littoral plains from a long-term perspective. The study adopts a high resolution analysis of small-scale study areas located in the Mediterranean region of north-eastern Catalonia. The study areas are distributed along an altitudinal transect from the high mountain (above 2000m a.s.l.) to the littoral plain of Empordà (Fig. 1). High resolution interdisciplinary research has been carried out from 2010, based on the integration of palaeoenvironmental and archaeological data. The micro-scale approach is used to understand human-environmental relationships. It allows better understanding of the local-regional nature of environmental changes and the synergies between catchment-based systems, hydro-sedimentary regimes, human mobility, land-uses, human environments, demography, etc.
Resumo:
Mechanisms underlying speciation in plants include detrimental (incompatible) genetic interactions between parental alleles that incur a fitness cost in hybrids. We reported on recessive hybrid incompatibility between an Arabidopsis thaliana strain from Poland, Landsberg erecta (Ler), and many Central Asian A. thaliana strains. The incompatible interaction is determined by a polymorphic cluster of Toll/interleukin-1 receptor-nucleotide binding-leucine rich repeat (TNL) RPP1 (Recognition of Peronospora parasitica1)-like genes in Ler and alleles of the receptor-like kinase Strubbelig Receptor Family 3 (SRF3) in Central Asian strains Kas-2 or Kond, causing temperature-dependent autoimmunity and loss of growth and reproductive fitness. Here, we genetically dissected the RPP1-like Ler locus to determine contributions of individual RPP1-like Ler (R1R8) genes to the incompatibility. In a neutral background, expression of most RPP1-like Ler genes, except R3, has no effect on growth or pathogen resistance. Incompatibility involves increased R3 expression and engineered R3 overexpression in a neutral background induces dwarfism and sterility. However, no individual RPP1-like Ler gene is sufficient for incompatibility between Ler and Kas-2 or Kond, suggesting that co-action of at least two RPP1-like members underlies this epistatic interaction. We find that the RPP1-like Ler haplotype is frequent and occurs with other Ler RPP1-like alleles in a local population in Gorzów Wielkopolski (Poland). Only Gorzów individuals carrying the RPP1-like Ler haplotype are incompatible with Kas-2 and Kond, whereas other RPP1-like alleles in the population are compatible. Therefore, the RPP1-like Ler haplotype has been maintained in genetically different individuals at a single site, allowing exploration of forces shaping the evolution of RPP1-like genes at local and regional population scales.