895 resultados para discriminant analysis and cluster analysis
Resumo:
Creative industries tend to concentrate mainly around large- and medium-sized cities, forming creative local production systems. The text analyses the forces behind clustering of creative industries to provide the first empirical explanation of the determinants of creative employment clustering following a multidisciplinary approach based on cultural and creative economics, evolutionary geography and urban economics. A comparative analysis has been performed for Italy and Spain. The results show different patterns of creative employment clustering in both countries. The small role of historical and cultural endowments, the size of the place, the average size of creative industries, the productive diversity and the concentration of human capital and creative class have been found as common factors of clustering in both countries.
Resumo:
The study of the Schistosoma mansoni genome, one of the etiologic agents of human schistosomiasis, is essential for a better understanding of the biology and development of this parasite. In order to get an overview of all S. mansoni catalogued gene sequences, we performed a clustering analysis of the parasite mRNA sequences available in public databases. This was made using softwares PHRAP and CAP3. The consensus sequences, generated after the alignment of cluster constituent sequences, allowed the identification by database homology searches of the most expressed genes in the worm. We analyzed these genes and looked for a correlation between their high expression and parasite metabolism and biology. We observed that the majority of these genes is related to the maintenance of basic cell functions, encoding genes whose products are related to the cytoskeleton, intracellular transport and energy metabolism. Evidences are presented here that genes for aerobic energy metabolism are expressed in all the developmental stages analyzed. Some of the most expressed genes could not be identified by homology searches and may have some specific functions in the parasite.
Resumo:
Triatoma dimidiata is one of the major vectors of Chagas disease in Latin America. Its range includes Mexico, all countries of Central America, Colombia, and Ecuador. In light of recent genetic analysis suggesting that the possible origin of this species is the Yucatan peninsula, we have analyzed populations from the state of Yucatan, San Luis Potosi, and Veracruz in Mexico, and a population from the southern region of the Yucatan peninsula located in Northern Guatemala, the region of El Peten. Classical morphometry including principal component, discriminant, sexual dimorphism, and wing asymmetry was analyzed. San Luis Potosi and Veracruz populations were indistinguishable while clearly separate from Yucatan and Peten populations. Despite important genetic differences, Yucatan and Peten populations were highly similar. Yucatan specimens were the smallest in size, while females were larger than males in all populations. Only head characters were necessary to distinguish population level differences, although wing fluctuating asymmetry was present in all populations. These results are discussed in light of recent findings suggesting genetic polymorphism in most populations of Triatoma dimidiata south of Chiapas to Ecuador.
Resumo:
The number of sequences generated by genome projects has increased exponentially, but gene characterization has not followed at the same rate. Sequencing and analysis of full-length cDNAs is an important step in gene characterization that has been used nowadays by several research groups. In this work, we have selected Schistosoma mansoni clones for full-length sequencing, using an algorithm that investigates the presence of the initial methionine in the parasite sequence based on the positions of alignment start between two sequences. BLAST searches to produce such alignments have been performed using parasite expressed sequence tags produced by Minas Gerais Genome Network against sequences from the database Eukaryotic Cluster of Orthologous Groups (KOG). This procedure has allowed the selection of clones representing 398 proteins which have not been deposited as S. mansoni complete CDS in any public database. Dedicated sequencing of 96 of such clones with reads from both 5' and 3' ends has been performed. These reads have been assembled using PHRAP, resulting in the production of 33 full-length sequences that represent novel S. mansoni proteins. These results shall contribute to construct a more complete view of the biology of this important parasite.
Resumo:
Sphingomonas wittichii RW1 is a bacterium isolated for its ability to degrade the xenobiotic compounds dibenzodioxin and dibenzofuran (DBF). A number of genes involved in DBF degradation have been previously characterized, such as the dxn cluster, dbfB, and the electron transfer components fdx1, fdx3, and redA2. Here we use a combination of whole genome transcriptome analysis and transposon library screening to characterize RW1 catabolic and other genes implicated in the reaction to or degradation of DBF. To detect differentially expressed genes upon exposure to DBF, we applied three different growth exposure experiments, using either short DBF exposures to actively growing cells or growing them with DBF as sole carbon and energy source. Genome-wide gene expression was examined using a custom-made microarray. In addition, proportional abundance determination of transposon insertions in RW1 libraries grown on salicylate or DBF by ultra-high throughput sequencing was used to infer genes whose interruption caused a fitness loss for growth on DBF. Expression patterns showed that batch and chemostat growth conditions, and short or long exposure of cells to DBF produced very different responses. Numerous other uncharacterized catabolic gene clusters putatively involved in aromatic compound metabolism increased expression in response to DBF. In addition, only very few transposon insertions completely abolished growth on DBF. Some of those (e.g., in dxnA1) were expected, whereas others (in a gene cluster for phenylacetate degradation) were not. Both transcriptomic data and transposon screening suggest operation of multiple redundant and parallel aromatic pathways, depending on DBF exposure. In addition, increased expression of other non-catabolic genes suggests that during initial exposure, S. wittichii RW1 perceives DBF as a stressor, whereas after longer exposure, the compound is recognized as a carbon source and metabolized using several pathways in parallel.
Resumo:
HEMOLIA (a project under European community’s 7th framework programme) is a new generation Anti-Money Laundering (AML) intelligent multi-agent alert and investigation system which in addition to the traditional financial data makes extensive use of modern society’s huge telecom data source, thereby opening up a new dimension of capabilities to all Money Laundering fighters (FIUs, LEAs) and Financial Institutes (Banks, Insurance Companies, etc.). This Master-Thesis project is done at AIA, one of the partners for the HEMOLIA project in Barcelona. The objective of this thesis is to find the clusters in a network drawn by using the financial data. An extensive literature survey has been carried out and several standard algorithms related to networks have been studied and implemented. The clustering problem is a NP-hard problem and several algorithms like K-Means and Hierarchical clustering are being implemented for studying several problems relating to sociology, evolution, anthropology etc. However, these algorithms have certain drawbacks which make them very difficult to implement. The thesis suggests (a) a possible improvement to the K-Means algorithm, (b) a novel approach to the clustering problem using the Genetic Algorithms and (c) a new algorithm for finding the cluster of a node using the Genetic Algorithm.
Resumo:
Previously published scientific papers have reported a negative correlation between drinking water hardness and cardiovascular mortality. Some ecologic and case-control studies suggest the protective effect of calcium and magnesium concentration in drinking water. In this article we present an analysis of this protective relationship in 538 municipalities of Comunidad Valenciana (Spain) from 1991-1998. We used the Spanish version of the Rapid Inquiry Facility (RIF) developed under the European Environment and Health Information System (EUROHEIS) research project. The strategy of analysis used in our study conforms to the exploratory nature of the RIF that is used as a tool to obtain quick and flexible insight into epidemiologic surveillance problems. This article describes the use of the RIF to explore possible associations between disease indicators and environmental factors. We used exposure analysis to assess the effect of both protective factors--calcium and magnesium--on mortality from cerebrovascular (ICD-9 430-438) and ischemic heart (ICD-9 410-414) diseases. This study provides statistical evidence of the relationship between mortality from cardiovascular diseases and hardness of drinking water. This relationship is stronger in cerebrovascular disease than in ischemic heart disease, is more pronounced for women than for men, and is more apparent with magnesium than with calcium concentration levels. Nevertheless, the protective nature of these two factors is not clearly established. Our results suggest the possibility of protectiveness but cannot be claimed as conclusive. The weak effects of these covariates make it difficult to separate them from the influence of socioeconomic and environmental factors. We have also performed disease mapping of standardized mortality ratios to detect clusters of municipalities with high risk. Further standardization by levels of calcium and magnesium in drinking water shows changes in the maps when we remove the effect of these covariates.
Resumo:
Background: Peach fruit undergoes a rapid softening process that involves a number of metabolic changes. Storing fruit at low temperatures has been widely used to extend its postharvest life. However, this leads to undesired changes, such as mealiness and browning, which affect the quality of the fruit. In this study, a 2-D DIGE approach was designed to screen for differentially accumulated proteins in peach fruit during normal softening as well as under conditions that led to fruit chilling injury. Results:The analysis allowed us to identify 43 spots -representing about 18% of the total number analyzed- that show statistically significant changes. Thirty-nine of the proteins could be identified by mass spectrometry. Some of the proteins that changed during postharvest had been related to peach fruit ripening and cold stress in the past. However, we identified other proteins that had not been linked to these processes. A graphical display of the relationship between the differentially accumulated proteins was obtained using pairwise average-linkage cluster analysis and principal component analysis. Proteins such as endopolygalacturonase, catalase, NADP-dependent isocitrate dehydrogenase, pectin methylesterase and dehydrins were found to be very important for distinguishing between healthy and chill injured fruit. A categorization of the differentially accumulated proteins was performed using Gene Ontology annotation. The results showed that the 'response to stress', 'cellular homeostasis', 'metabolism of carbohydrates' and 'amino acid metabolism' biological processes were affected the most during the postharvest. Conclusions: Using a comparative proteomic approach with 2-D DIGE allowed us to identify proteins that showed stage-specific changes in their accumulation pattern. Several proteins that are related to response to stress, cellular homeostasis, cellular component organization and carbohydrate metabolism were detected as being differentially accumulated. Finally, a significant proportion of the proteins identified had not been associated with softening, cold storage or chilling injury-altered fruit before; thus, comparative proteomics has proven to be a valuable tool for understanding fruit softening and postharvest.
Resumo:
Splenic marginal zone lymphoma (SMZL) is an indolent B-cell lymphoproliferative disorder characterised by 7q32 deletion, but the target genes of this deletion remain unknown. In order to elucidate the genetic target of this deletion, we performed an integrative analysis of the genetic, epigenetic, transcriptomic and miRNomic data. High resolution array comparative genomic hybridization of 56 cases of SMZL delineated a minimally deleted region (2.8 Mb) at 7q32, but showed no evidence of any cryptic homozygous deletion or recurrent breakpoint in this region. Integrated transcriptomic analysis confirmed significant under-expression of a number of genes in this region in cases of SMZL with deletion, several of which showed hypermethylation. In addition, a cluster of 8 miRNA in this region showed under-expression in cases with the deletion, and three (miR-182/96/183) were also significantly under-expressed (P<0.05) in SMZL relative to other lymphomas. Genomic sequencing of these miRNA and IRF5, a strong candidate gene, did not show any evidence of somatic mutation in SMZL. These observations provide valuable guidance for further characterisation of 7q deletion.
Resumo:
PURPOSE: In contrast to other human tumors, a repression of the cell-surface glycoprotein CD44 on neuroblastoma is a marker of aggressiveness that usually correlates to N-myc amplification. We thus compared the prognostic value of both markers in the initial staging of 121 children treated for neuroblastoma in collaborative institutions. METHODS: Frozen samples were analyzed by a rapid and well-standardized technique of immunostaining with monoclonal antibodies (MoAbs) against epitopes in the CD44 constant region. RESULTS: In this retrospective series, CD44 was expressed on 102 specimens and strongly correlated with favorable tumor stages and histology, younger age, and normal N-myc copy numbers. In univariate analysis, CD44 expression and normal N-myc were the most powerful markers of favorable clinical outcome (P < 10(-6) and chi 2 = 65.40 and P < 10(-6) and chi 2 = 42.56, respectively), but analysis of CD44 affords significant prognostic discrimination in subgroups of patients with or without N-myc-amplified tumors. In the subgroup of stage IV neuroblastomas, CD44 was the only significant prognostic marker (P < .02, chi 2 = 5.76), whereas N-myc status was not discriminant. In multivariate analysis of five factors, ie, N-myc amplification, CD44 expression, age, tumor stage, and histology, the only independent prognostic factors of event-free survival were CD44 expression and tumor stage. CONCLUSION: The analysis of CD44 cell-surface expression must be recommended as an additional biologic marker in the initial staging of the disease.
Resumo:
We present an analysis of the M-O chemical bonding in the binary oxides MgO, CaO, SrO, BaO, and Al2O3 based on ab initio wave functions. The model used to represent the local environment of a metal cation in the bulk oxide is an MO6 cluster which also includes the effect of the lattice Madelung potential. The analysis of the wave functions for these clusters leads to the conclusion that all the alkaline-earth oxides must be regarded as highly ionic oxides; however, the ionic character of the oxides decreases as one goes from MgO, almost perfectly ionic, to BaO. In Al2O3 the ionic character is further reduced; however, even in this case, the departure from the ideal, fully ionic, model of Al3+ is not exceptionally large. These conclusions are based on three measures, a decomposition of the Mq+-Oq- interaction energy, the number of electrons associated to the oxygen ions as obtained from a projection operator technique, and the analysis of the cation core-level binding energies. The increasing covalent character along the series MgO, CaO, SrO, and BaO is discussed in view of the existing theoretical models and experimental data.
Resumo:
INTRODUCTION: Breast cancer subtyping and prognosis have been studied extensively by gene expression profiling, resulting in disparate signatures with little overlap in their constituent genes. Although a previous study demonstrated a prognostic concordance among gene expression signatures, it was limited to only one dataset and did not fully elucidate how the different genes were related to one another nor did it examine the contribution of well-known biological processes of breast cancer tumorigenesis to their prognostic performance. METHOD: To address the above issues and to further validate these initial findings, we performed the largest meta-analysis of publicly available breast cancer gene expression and clinical data, which are comprised of 2,833 breast tumors. Gene coexpression modules of three key biological processes in breast cancer (namely, proliferation, estrogen receptor [ER], and HER2 signaling) were used to dissect the role of constituent genes of nine prognostic signatures. RESULTS: Using a meta-analytical approach, we consolidated the signatures associated with ER signaling, ERBB2 amplification, and proliferation. Previously published expression-based nomenclature of breast cancer 'intrinsic' subtypes can be mapped to the three modules, namely, the ER-/HER2- (basal-like), the HER2+ (HER2-like), and the low- and high-proliferation ER+/HER2- subtypes (luminal A and B). We showed that all nine prognostic signatures exhibited a similar prognostic performance in the entire dataset. Their prognostic abilities are due mostly to the detection of proliferation activity. Although ER- status (basal-like) and ERBB2+ expression status correspond to bad outcome, they seem to act through elevated expression of proliferation genes and thus contain only indirect information about prognosis. Clinical variables measuring the extent of tumor progression, such as tumor size and nodal status, still add independent prognostic information to proliferation genes. CONCLUSION: This meta-analysis unifies various results of previous gene expression studies in breast cancer. It reveals connections between traditional prognostic factors, expression-based subtyping, and prognostic signatures, highlighting the important role of proliferation in breast cancer prognosis.
Resumo:
We investigate the evolutionary history of the greater white-toothed shrew across its distribution in northern Africa and mainland Europe using sex-specific (mtDNA and Y chromosome) and biparental (X chromosome) markers. All three loci confirm a large divergence between eastern (Tunisia and Sardinia) and western (Morocco and mainland Europe) lineages, and application of a molecular clock to mtDNA divergence estimates indicates a more ancient separation (2.25 M yr ago) than described by some previous studies, supporting claims for taxonomic revision. Moroccan ancestry for the mainland European population is inconclusive from phylogenetic trees, but is supported by greater nucleotide diversity and a more ancient population expansion in Morocco than in Europe. Signatures of rapid population expansion in mtDNA, combined with low X and Y chromosome diversity, suggest a single colonization of mainland Europe by a small number of Moroccan shrews >38 K yr ago. This study illustrates that multilocus genetic analyses can facilitate the interpretation of species' evolutionary history but that phylogeographic inference using X and Y chromosomes is restricted by low levels of observed polymorphism.
Resumo:
Monalysin was recently described as a novel pore-forming toxin (PFT) secreted by the Drosophila pathogen Pseudomonas entomophila. Recombinant monalysin is multimeric in solution, whereas PFTs are supposed to be monomeric until target membrane association. Monalysin crystals were obtained by the hanging-drop vapour-diffusion method using PEG 8000 as precipitant. Preliminary X-ray diffraction analysis revealed that monalysin crystals belonged to the monoclinic space group C2, with unit-cell parameters a = 162.4, b = 146.2, c = 144.4 Å, β = 122.8°, and diffracted to 2.85 Å resolution using synchrotron radiation. Patterson self-rotation analysis and Matthews coefficient calculation indicate that the asymmetric unit contains nine copies of monalysin. Heavy-atom derivative data were collected and a Ta6Br14 cluster derivative data set confirmed the presence of ninefold noncrystallographic symmetry.
Resumo:
We presented an integrated hierarchical model of psychopathology that more accurately captures empirical patterns of comorbidity between clinical syndromes and personality disorders.In order to verify the structural validity of the model proposed, this study aimed to analyze the convergence between the Restructured Clinical (RC) scales and Personality scales (PSY-5) of the MMPI-2-RF and the Clinical Syndrome and Personality Disorder scales of the MCMI-III.The MMPI-2-RF and MCMI-III were administered to a clinical sample of 377 outpatients (167 men and 210 women).The structural hypothesiswas assessed by using a Confirmatory Factor Analytic design with four common superordinate factors. An independent-cluster-basis solution was proposed based on maximum likelihood estimation and the application of several fit indices.The fit of the proposed model can be considered as good and more so if we take into account its complexity.