813 resultados para microarray data classification


Relevância:

80.00% 80.00%

Publicador:

Resumo:

Background: There are several studies in the literature depicting measurement error in gene expression data and also, several others about regulatory network models. However, only a little fraction describes a combination of measurement error in mathematical regulatory networks and shows how to identify these networks under different rates of noise. Results: This article investigates the effects of measurement error on the estimation of the parameters in regulatory networks. Simulation studies indicate that, in both time series (dependent) and non-time series (independent) data, the measurement error strongly affects the estimated parameters of the regulatory network models, biasing them as predicted by the theory. Moreover, when testing the parameters of the regulatory network models, p-values computed by ignoring the measurement error are not reliable, since the rate of false positives are not controlled under the null hypothesis. In order to overcome these problems, we present an improved version of the Ordinary Least Square estimator in independent (regression models) and dependent (autoregressive models) data when the variables are subject to noises. Moreover, measurement error estimation procedures for microarrays are also described. Simulation results also show that both corrected methods perform better than the standard ones (i.e., ignoring measurement error). The proposed methodologies are illustrated using microarray data from lung cancer patients and mouse liver time series data. Conclusions: Measurement error dangerously affects the identification of regulatory network models, thus, they must be reduced or taken into account in order to avoid erroneous conclusions. This could be one of the reasons for high biological false positive rates identified in actual regulatory network models.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Today several different unsupervised classification algorithms are commonly used to cluster similar patterns in a data set based only on its statistical properties. Specially in image data applications, self-organizing methods for unsupervised classification have been successfully applied for clustering pixels or group of pixels in order to perform segmentation tasks. The first important contribution of this paper refers to the development of a self-organizing method for data classification, named Enhanced Independent Component Analysis Mixture Model (EICAMM), which was built by proposing some modifications in the Independent Component Analysis Mixture Model (ICAMM). Such improvements were proposed by considering some of the model limitations as well as by analyzing how it should be improved in order to become more efficient. Moreover, a pre-processing methodology was also proposed, which is based on combining the Sparse Code Shrinkage (SCS) for image denoising and the Sobel edge detector. In the experiments of this work, the EICAMM and other self-organizing models were applied for segmenting images in their original and pre-processed versions. A comparative analysis showed satisfactory and competitive image segmentation results obtained by the proposals presented herein. (C) 2008 Published by Elsevier B.V.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Moniliophthora perniciosa is a hemibiotrophic fungus that causes witches` broom disease (WBD) in cacao. Marked dimorphism characterizes this fungus, showing a monokaryotic or biotrophic phase that causes disease symptoms and a later dikaryotic or saprotrophic phase. A combined strategy of DNA microarray, expressed sequence tag, and real-time reverse-transcriptase polymerase chain reaction analyses was employed to analyze differences between these two fungal stages in vitro. In all, 1,131 putative genes were hybridized with cDNA from different phases, resulting in 189 differentially expressed genes, and 4,595 reads were clusterized, producing 1,534 unigenes. The analysis of these genes, which represent approximately 21% of the total genes, indicates that the biotrophic-like phase undergoes carbon and nitrogen catabollite repression that correlates to the expression of phytopathogenicity genes. Moreover, downregulation of mitochondrial oxidative phosphorylation and the presence of a putative ngr1 of Saccharomyces cerevisiae could help explain its lower growth rate. In contrast, the saprotrophic mycelium expresses genes related to the metabolism of hexoses, ammonia, and oxidative phosphorylation, which could explain its faster growth. Antifungal toxins were upregulated and could prevent the colonization by competing fungi. This work significantly contributes to our understanding of the molecular mechanisms of WBD and, to our knowledge, is the first to analyze differential gene expression of the different phases of a hemibiotrophic fungus.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

1. Cluster analysis of reference sites with similar biota is the initial step in creating River Invertebrate Prediction and Classification System (RIVPACS) and similar river bioassessment models such as Australian River Assessment System (AUSRIVAS). This paper describes and tests an alternative prediction method, Assessment by Nearest Neighbour Analysis (ANNA), based on the same philosophy as RIVPACS and AUSRIVAS but without the grouping step that some people view as artificial. 2. The steps in creating ANNA models are: (i) weighting the predictor variables using a multivariate approach analogous to principal axis correlations, (ii) calculating the weighted Euclidian distance from a test site to the reference sites based on the environmental predictors, (iii) predicting the faunal composition based on the nearest reference sites and (iv) calculating an observed/expected (O/E) analogous to RIVPACS/AUSRIVAS. 3. The paper compares AUSRIVAS and ANNA models on 17 datasets representing a variety of habitats and seasons. First, it examines each model's regressions for Observed versus Expected number of taxa, including the r(2), intercept and slope. Second, the two models' assessments of 79 test sites in New Zealand are compared. Third, the models are compared on test and presumed reference sites along a known trace metal gradient. Fourth, ANNA models are evaluated for western Australia, a geographically distinct region of Australia. The comparisons demonstrate that ANNA and AUSRIVAS are generally equivalent in performance, although ANNA turns out to be potentially more robust for the O versus E regressions and is potentially more accurate on the trace metal gradient sites. 4. The ANNA method is recommended for use in bioassessment of rivers, at least for corroborating the results of the well established AUSRIVAS- and RIVPACS-type models, if not to replace them.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The Down syndrome (DS) immune phenotype is characterized by thymus hypotrophy, higher propensity to organ-specific autoimmune disorders, and higher susceptibility to infections, among other features. Considering that AIRE (autoimmune regulator) is located on 21q22.3, we analyzed protein and gene expression in surgically removed thymuses from 14 DS patients with congenital heart defects, who were compared with 42 age-matched controls with heart anomaly as an isolated malformation. Immunohistochemistry revealed 70.48 +/- 49.59 AIRE-positive cells/mm(2) in DS versus 154.70 +/- 61.16 AIRE-positive cells/mm(2) in controls (p < 0.0001), and quantitative PCR as well as DNA microarray data confirmed those results. The number of FOXP3-positive cells/mm(2) was equivalent in both groups. Thymus transcriptome analysis showed 407 genes significantly hypoexpressed in DS, most of which were related, according to network transcriptional analysis (FunNet), to cell division and to immunity. Immune response-related genes included those involved in 1) Ag processing and presentation (HLA-DQB1, HLA-DRB3, CD1A, CD1B, CD1C, ERAP) and 2) thymic T cell differentiation (IL2RG, RAG2, CD3D, CD3E, PRDX2, CDK6) and selection (SH2D1A, CD74). It is noteworthy that relevant AIRE-partner genes, such as TOP2A, LAMNB1, and NUP93, were found hypoexpressed in DNA microarrays and quantitative real-time PCR analyses. These findings on global thymic hypofunction in DS revealed molecular mechanisms underlying DS immune phenotype and strongly suggest that DS immune abnormalities are present since early development, rather than being a consequence of precocious aging, as widely hypothesized. Thus, DS should be considered as a non-monogenic primary immunodeficiency. The Journal of Immunology, 2011, 187: 3422-3430.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Chagas disease, characterized by acute myocarditis and chronic cardiomyopathy, is caused by infection with the protozoan parasite Trypanosoma cruzi. We sought to identify genes altered during the development of parasite-induced cardiomyopathy. Microarrays containing 27,400 sequence-verified mouse cDNAs were used to analyze global gene expression changes in the myocardium of a murine model of chagasic cardiomyopathy. Changes in gene expression were determined as the acute stage of infection developed into the chronic stage. This analysis was performed on the hearts of male CD-1 mice infected with trypomastigotes of T. cruzi (Brazil strain). At each interval we compared infected and uninfected mice and confirmed the microarray data with dye reversal. We identified eight distinct categories of mRNAs that were differentially regulated during infection and identified dysregulation of several key genes. These data may provide insight into the pathogenesis of chagasic cardiomyopathy and provide new targets for intervention. (c) 2008 Elsevier Inc. All rights reserved.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Objectives To evaluate the gene expression profile of fibroblasts from affected and non-affected skin of systemic sclerosis (SSc) patients and from controls. Materials and methods Labeled cDNA from fibroblast cultures from forearm (affected) and axillary (non-affected) skin from six diffuse SSc patients, from three normal controls, and from MOLT-4/HEp-2/normal fibroblasts (reference pool) was probed in microarrays generated with 4193 human cDNAs from the IMAGE Consortium. Microarray images were converted into numerical data and gene expression was calculated as the ratio between fibroblast cDNA (Cy5) and reference pool cDNA (Cy3) data and analyzed by R environment/Aroma, Cluster, Tree View, and SAM softwares. Differential expression was confirmed by real time PCR for a set of selected genes. Results Eighty-eight genes were up- and 241 genes down-regulated in SSc fibroblasts. Gene expression correlation was strong between affected and non-affected fibroblast samples from the same patient (r>0.8), moderate among fibroblasts from all patients (r=0.72) and among fibroblasts from all controls (r=0.70), and modest among fibroblasts from patients and controls (r=0.55). The differential expression was confirmed by real time PCR for all selected genes. Conclusions Fibroblasts from affected and non-affected skin of SSc patients shared a similar abnormal gene expression profile, suggesting that the widespread molecular disturbance in SSc fibroblasts is more sensitive than histological and clinical alterations. Novel molecular elements potentially involved in SSc pathogenesis were identified.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Urinary bladder cancer is the fourth most common malignancy in the Western world. Transitional cell carcinoma (TCC) is the most common subtype, accounting for about 90% of all bladder cancers. The TP53 gene plays an essential role in the regulation of the cell cycle and apoptosis and therefore contributes to cellular transformation and malignancy; however, little is known about the differential gene expression patterns in human tumors that present with the wild-type or mutated TP53 gene. Therefore, because gene profiling can provide new insights into the molecular biology of bladder cancer, the present study aimed to compare the molecular profiles of bladder cancer cell lines with different TP53 alleles, including the wild type (RT4) and two mutants (5637, with mutations in codons 280 and 72; and T24, a TP53 allele encoding an in-frame deletion of tyrosine 126). Unsupervised hierarchical clustering and gene networks were constructed based on data generated by cDNA microarrays using mRNA from the three cell lines. Differentially expressed genes related to the cell cycle, cell division, cell death, and cell proliferation were observed in the three cell lines. However, the cDNA microarray data did not cluster cell lines based on their TP53 allele. The gene profiles of the RT4 cells were more similar to those of T24 than to those of the 5637 cells. While the deregulation of both the cell cycle and the apoptotic pathways was particularly related to TCC, these alterations were not associated with the TP53 status.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Gene expression profiling by cDNA microarrays during murine thymus ontogeny has contributed to dissecting the large-scale molecular genetics of T cell maturation. Gene profiling, although useful for characterizing the thymus developmental phases and identifying the differentially expressed genes, does not permit the determination of possible interactions between genes. In order to reconstruct genetic interactions, on RNA level, within thymocyte differentiation, a pair of microarrays containing a total of 1,576 cDNA sequences derived from the IMAGE MTB library was applied on samples of developing thymuses (14-17 days of gestation). The data were analyzed using the GeneNetwork program. Genes that were previously identified as differentially expressed during thymus ontogeny showed their relationships with several other genes. The present method provided the detection of gene nodes coding for proteins implicated in the calcium signaling pathway, such as Prrg2 and Stxbp3, and in protein transport toward the cell membrane, such as Gosr2. The results demonstrate the feasibility of reconstructing networks based on cDNA microarray gene expression determinations, contributing to a clearer understanding of the complex interactions between genes involved in thymus/thymocyte development.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Dissertação para a obtenção do grau de Mestre em Engenharia Electrotécnica Ramo de Energia

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Este documento foi redigido no âmbito da dissertação do Mestrado em Engenharia Informática na área de Arquiteturas, Sistemas e Redes, do Departamento de Engenharia Informática, do ISEP, cujo tema é diagnóstico cardíaco a partir de dados acústicos e clínicos. O objetivo deste trabalho é produzir um método que permita diagnosticar automaticamente patologias cardíacas utilizando técnicas de classificação de data mining. Foram utilizados dois tipos de dados: sons cardíacos gravados em ambiente hospitalar e dados clínicos. Numa primeira fase, exploraram-se os sons cardíacos usando uma abordagem baseada em motifs. Numa segunda fase, utilizamos os dados clínicos anotados dos pacientes. Numa terceira fase, avaliamos a combinação das duas abordagens. Na avaliação experimental os modelos baseados em motifs obtiveram melhores resultados do que os construídos a partir dos dados clínicos. A combinação das abordagens mostrou poder ser vantajosa em situações pontuais.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

BACKGROUND: Zebrafish is a clinically-relevant model of heart regeneration. Unlike mammals, it has a remarkable heart repair capacity after injury, and promises novel translational applications. Amputation and cryoinjury models are key research tools for understanding injury response and regeneration in vivo. An understanding of the transcriptional responses following injury is needed to identify key players of heart tissue repair, as well as potential targets for boosting this property in humans. RESULTS: We investigated amputation and cryoinjury in vivo models of heart damage in the zebrafish through unbiased, integrative analyses of independent molecular datasets. To detect genes with potential biological roles, we derived computational prediction models with microarray data from heart amputation experiments. We focused on a top-ranked set of genes highly activated in the early post-injury stage, whose activity was further verified in independent microarray datasets. Next, we performed independent validations of expression responses with qPCR in a cryoinjury model. Across in vivo models, the top candidates showed highly concordant responses at 1 and 3 days post-injury, which highlights the predictive power of our analysis strategies and the possible biological relevance of these genes. Top candidates are significantly involved in cell fate specification and differentiation, and include heart failure markers such as periostin, as well as potential new targets for heart regeneration. For example, ptgis and ca2 were overexpressed, while usp2a, a regulator of the p53 pathway, was down-regulated in our in vivo models. Interestingly, a high activity of ptgis and ca2 has been previously observed in failing hearts from rats and humans. CONCLUSIONS: We identified genes with potential critical roles in the response to cardiac damage in the zebrafish. Their transcriptional activities are reproducible in different in vivo models of cardiac injury.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

BACKGROUND: The Nuclear Factor I (NFI) family of DNA binding proteins (also called CCAAT box transcription factors or CTF) is involved in both DNA replication and gene expression regulation. Using chromatin immuno-precipitation and high throughput sequencing (ChIP-Seq), we performed a genome-wide mapping of NFI DNA binding sites in primary mouse embryonic fibroblasts. RESULTS: We found that in vivo and in vitro NFI DNA binding specificities are indistinguishable, as in vivo ChIP-Seq NFI binding sites matched predictions based on previously established position weight matrix models of its in vitro binding specificity. Combining ChIP-Seq with mRNA profiling data, we found that NFI preferentially associates with highly expressed genes that it up-regulates, while binding sites were under-represented at expressed but unregulated genes. Genomic binding also correlated with markers of transcribed genes such as histone modifications H3K4me3 and H3K36me3, even outside of annotated transcribed loci, implying NFI in the control of the deposition of these modifications. Positional correlation between + and - strand ChIP-Seq tags revealed that, in contrast to other transcription factors, NFI associates with a nucleosomal length of cleavage-resistant DNA, suggesting an interaction with positioned nucleosomes. In addition, NFI binding prominently occurred at boundaries displaying discontinuities in histone modifications specific of expressed and silent chromatin, such as loci submitted to parental allele-specific imprinted expression. CONCLUSIONS: Our data thus suggest that NFI nucleosomal interaction may contribute to the partitioning of distinct chromatin domains and to epigenetic gene expression regulation.NFI ChIP-Seq and input control DNA data were deposited at Gene Expression Omnibus (GEO) repository under accession number GSE15844. Gene expression microarray data for mouse embryonic fibroblasts are on GEO accession number GSE15871.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Dysregulation of intestinal epithelial cell performance is associated with an array of pathologies whose onset mechanisms are incompletely understood. While whole-genomics approaches have been valuable for studying the molecular basis of several intestinal diseases, a thorough analysis of gene expression along the healthy gastrointestinal tract is still lacking. The aim of this study was to map gene expression in gastrointestinal regions of healthy human adults and to implement a procedure for microarray data analysis that would allow its use as a reference when screening for pathological deviations. We analyzed the gene expression signature of antrum, duodenum, jejunum, ileum, and transverse colon biopsies using a biostatistical method based on a multivariate and univariate approach to identify region-selective genes. One hundred sixty-six genes were found responsible for distinguishing the five regions considered. Nineteen had never been described in the GI tract, including a semaphorin probably implicated in pathogen invasion and six novel genes. Moreover, by crossing these genes with those retrieved from an existing data set of gene expression in the intestine of ulcerative colitis and Crohn's disease patients, we identified genes that might be biomarkers of Crohn's and/or ulcerative colitis in ileum and/or colon. These include CLCA4 and SLC26A2, both implicated in ion transport. This study furnishes the first map of gene expression along the healthy human gastrointestinal tract. Furthermore, the approach implemented here, and validated by retrieving known gene profiles, allowed the identification of promising new leads in both healthy and disease states.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

PURPOSE: To evaluate and validate mRNA expression markers capable of identifying patients with ErbB2-positive breast cancer associated with distant metastasis and reduced survival. PATIENTS AND METHODS: Expression of 60 genes involved in breast cancer biology was assessed by quantitative real-time PCR (qrt-PCR) in 317 primary breast cancer patients and correlated with clinical outcome data. Results were validated subsequently using two previously published and publicly available microarray data sets with different patient populations comprising 295 and 286 breast cancer samples, respectively. RESULTS: Of the 60 genes measured by qrt-PCR, urokinase-type plasminogen activator (uPA or PLAU) mRNA expression was the most significant marker associated with distant metastasis-free survival (MFS) by univariate Cox analysis in patients with ErbB2-positive tumors and an independent factor in multivariate analysis. Subsequent validation in two microarray data sets confirmed the prognostic value of uPA in ErbB2-positive tumors by both univariate and multivariate analysis. uPA mRNA expression was not significantly associated with MFS in ErbB2-negative tumors. Kaplan-Meier analysis showed in all three study populations that patients with ErbB2-positive/uPA-positive tumors exhibited significantly reduced MFS (hazard ratios [HR], 4.3; 95% CI, 1.6 to 11.8; HR, 2.7; 95% CI, 1.2 to 6.2; and, HR, 2.8; 95% CI, 1.1 to 7.1; all P < .02) as compared with the group with ErbB2-positive/uPA-negative tumors who exhibited similar outcome to those with ErbB2-negative tumors, irrespective of uPA status. CONCLUSION: After evaluation of 898 breast cancer patients, uPA mRNA expression emerged as a powerful prognostic indicator in ErbB2-positive tumors. These results were consistent among three independent study populations assayed by different techniques, including qrt-PCR and two microarray platforms.