889 resultados para GENE-EXPRESSION DIFFERENCES
Resumo:
Changes in gene expression are thought to underlie many of the phenotypic differences between species. However, large-scale analyses of gene expression evolution were until recently prevented by technological limitations. Here we report the sequencing of polyadenylated RNA from six organs across ten species that represent all major mammalian lineages (placentals, marsupials and monotremes) and birds (the evolutionary outgroup), with the goal of understanding the dynamics of mammalian transcriptome evolution. We show that the rate of gene expression evolution varies among organs, lineages and chromosomes, owing to differences in selective pressures: transcriptome change was slow in nervous tissues and rapid in testes, slower in rodents than in apes and monotremes, and rapid for the X chromosome right after its formation. Although gene expression evolution in mammals was strongly shaped by purifying selection, we identify numerous potentially selectively driven expression switches, which occurred at different rates across lineages and tissues and which probably contributed to the specific organ biology of various mammals.
Resumo:
To test the hypotheses that mutant huntingtin protein length and wild-type huntingtin dosage have important effects on disease-related transcriptional dysfunction, we compared the changes in mRNA in seven genetic mouse models of Huntington's disease (HD) and postmortem human HD caudate. Transgenic models expressing short N-terminal fragments of mutant huntingtin (R6/1 and R6/2 mice) exhibited the most rapid effects on gene expression, consistent with previous studies. Although changes in the brains of knock-in and full-length transgenic models of HD took longer to appear, 15- and 22-month CHL2(Q150/Q150), 18-month Hdh(Q92/Q92) and 2-year-old YAC128 animals also exhibited significant HD-like mRNA signatures. Whereas it was expected that the expression of full-length huntingtin transprotein might result in unique gene expression changes compared with those caused by the expression of an N-terminal huntingtin fragment, no discernable differences between full-length and fragment models were detected. In addition, very high correlations between the signatures of mice expressing normal levels of wild-type huntingtin and mice in which the wild-type protein is absent suggest a limited effect of the wild-type protein to change basal gene expression or to influence the qualitative disease-related effect of mutant huntingtin. The combined analysis of mouse and human HD transcriptomes provides important temporal and mechanistic insights into the process by which mutant huntingtin kills striatal neurons. In addition, the discovery that several available lines of HD mice faithfully recapitulate the gene expression signature of the human disorder provides a novel aspect of validation with respect to their use in preclinical therapeutic trials.
Resumo:
Arabidopsis thaliana contains two genes encoding farnesyl diphosphate (FPP) synthase (FPS), the prenyl diphoshate synthase that catalyzes the synthesis of FPP from isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP). In this study, we provide evidence that the two Arabidopsis short FPS isozymes FPS1S and FPS2 localize to the cytosol. Both enzymes were expressed in E. coli, purified and biochemically characterized. Despite FPS1S and FPS2 share more than 90% amino acid sequence identity, FPS2 was found to be more efficient as a catalyst, more sensitive to the inhibitory effect of NaCl, and more resistant to thermal inactivation than FPS1S. Homology modelling for FPS1S and FPS2 and analysis of the amino acid differences between the two enzymes revealed an increase in surface polarity and a greater capacity to form surface salt bridges of FPS2 compared to FPS1S. These factors most likely account for the enhanced thermostability of FPS2. Expression analysis of FPS::GUS genes in seeds showed that FPS1 and FPS2 display complementary patterns of expression particularly at late stages of seed development, which suggests that Arabidopsis seeds have two spatially segregated sources of FPP. Functional complementation studies of the Arabidopsis fps2 knockout mutant seed phenotypes demonstrated that under normal conditions FPS1S and FPS2 are functionally interchangeable. A putative role for FPS2 in maintaining seed germination capacity under adverse environmental conditions is discussed.
Resumo:
Expression data contribute significantly to the biological value of the sequenced human genome, providing extensive information about gene structure and the pattern of gene expression. ESTs, together with SAGE libraries and microarray experiment information, provide a broad and rich view of the transcriptome. However, it is difficult to perform large-scale expression mining of the data generated by these diverse experimental approaches. Not only is the data stored in disparate locations, but there is frequent ambiguity in the meaning of terms used to describe the source of the material used in the experiment. Untangling semantic differences between the data provided by different resources is therefore largely reliant on the domain knowledge of a human expert. We present here eVOC, a system which associates labelled target cDNAs for microarray experiments, or cDNA libraries and their associated transcripts with controlled terms in a set of hierarchical vocabularies. eVOC consists of four orthogonal controlled vocabularies suitable for describing the domains of human gene expression data including Anatomical System, Cell Type, Pathology and Developmental Stage. We have curated and annotated 7016 cDNA libraries represented in dbEST, as well as 104 SAGE libraries,with expression information,and provide this as an integrated, public resource that allows the linking of transcripts and libraries with expression terms. Both the vocabularies and the vocabulary-annotated libraries can be retrieved from http://www.sanbi.ac.za/evoc/. Several groups are involved in developing this resource with the aim of unifying transcript expression information.
Resumo:
During my PhD, my aim was to provide new tools to increase our capacity to analyse gene expression patterns, and to study on a large-scale basis the evolution of gene expression in animals. Gene expression patterns (when and where a gene is expressed) are a key feature in understanding gene function, notably in development. It appears clear now that the evolution of developmental processes and of phenotypes is shaped both by evolution at the coding sequence level, and at the gene expression level.Studying gene expression evolution in animals, with complex expression patterns over tissues and developmental time, is still challenging. No tools are available to routinely compare expression patterns between different species, with precision, and on a large-scale basis. Studies on gene expression evolution are therefore performed only on small genes datasets, or using imprecise descriptions of expression patterns.The aim of my PhD was thus to develop and use novel bioinformatics resources, to study the evolution of gene expression. To this end, I developed the database Bgee (Base for Gene Expression Evolution). The approach of Bgee is to transform heterogeneous expression data (ESTs, microarrays, and in-situ hybridizations) into present/absent calls, and to annotate them to standard representations of anatomy and development of different species (anatomical ontologies). An extensive mapping between anatomies of species is then developed based on hypothesis of homology. These precise annotations to anatomies, and this extensive mapping between species, are the major assets of Bgee, and have required the involvement of many co-workers over the years. My main personal contribution is the development and the management of both the Bgee database and the web-application.Bgee is now on its ninth release, and includes an important gene expression dataset for 5 species (human, mouse, drosophila, zebrafish, Xenopus), with the most data from mouse, human and zebrafish. Using these three species, I have conducted an analysis of gene expression evolution after duplication in vertebrates.Gene duplication is thought to be a major source of novelty in evolution, and to participate to speciation. It has been suggested that the evolution of gene expression patterns might participate in the retention of duplicate genes. I performed a large-scale comparison of expression patterns of hundreds of duplicated genes to their singleton ortholog in an outgroup, including both small and large-scale duplicates, in three vertebrate species (human, mouse and zebrafish), and using highly accurate descriptions of expression patterns. My results showed unexpectedly high rates of de novo acquisition of expression domains after duplication (neofunctionalization), at least as high or higher than rates of partitioning of expression domains (subfunctionalization). I found differences in the evolution of expression of small- and large-scale duplicates, with small-scale duplicates more prone to neofunctionalization. Duplicates with neofunctionalization seemed to evolve under more relaxed selective pressure on the coding sequence. Finally, even with abundant and precise expression data, the majority fate I recovered was neither neo- nor subfunctionalization of expression domains, suggesting a major role for other mechanisms in duplicate gene retention.
Resumo:
TLR4 (Toll-like receptor 4) is essential for sensing the endotoxin of Gram-negative bacteria. Mutations or deletion of the TLR4 gene in humans or mice have been associated with altered predisposition to or outcome of Gram-negative sepsis. In the present work, we studied the expression and regulation of the Tlr4 gene of mouse. In vivo, TLR4 levels were higher in macrophages compared with B, T or natural killer cells. High basal TLR4 promoter activity was observed in RAW 264.7, J774 and P388D1 macrophages transfected with a TLR4 promoter reporter vector. Analysis of truncated and mutated promoter constructs identified several positive [two Ets (E twenty-six) and one AP-1 (activator protein-1) sites] and negative (a GATA-like site and an octamer site) regulatory elements within 350 bp upstream of the transcriptional start site. The myeloid and B-cell-specific transcription factor PU.1 bound to the proximal Ets site. In contrast, none among PU.1, Ets-1, Ets-2 and Elk-1, but possibly one member of the ESE (epithelium-specific Ets) subfamily of Ets transcription factors, bound to the distal Ets site, which was indispensable for Tlr4 gene transcription. Endotoxin did not affect macrophage TLR4 promoter activity, but it decreased TLR4 steady-state mRNA levels by increasing the turnover of TLR4 transcripts. TLR4 expression was modestly altered by other pro- and anti-inflammatory stimuli, except for PMA plus ionomycin which strongly increased promoter activity and TLR4 mRNA levels. The mouse and human TLR4 genes were highly conserved. Yet, notable differences exist with respect to the elements implicated in gene regulation, which may account for species differences in terms of tissue expression and modulation by microbial and inflammatory stimuli.
Resumo:
The biocontrol activity of the root-colonizing Pseudomonas fluorescens strain CHA0 is largely determined by the production of antifungal metabolites, especially 2,4-diacetylphloroglucinol. The expression of these metabolites depends on abiotic and biotic environmental factors, in particular, elements present in the rhizosphere. In this study, we have developed a new method for the in situ analysis of antifungal gene expression using flow cytometry combined with green fluorescent protein (GFP)-based reporter fusions to the phlA and prnA genes essential for the production of the antifungal compounds 2,4-diacetylphloroglucinol and pyrrolnitrin, respectively, in strain CHA0. Expression of phlA-gfp and prnA-gfp in CHA0 cells harvested from the rhizosphere of a set of plant species as well as from the roots of healthy, leaf pathogen-attacked, and physically stressed plants were analyzed using a FACSCalibur. After subtraction of background fluorescence emitted by plant-derived particles and CHA0 cells not carrying the gfp reporters, the average gene expression per bacterial cell could be calculated. Levels of phlA and prnA expression varied significantly in the rhizospheres of different plant species. Physical stress and leaf pathogen infection lowered phlA expression levels in the rhizosphere of cucumber. Our results demonstrate that the newly developed approach is suitable to monitor differences in levels of antifungal gene expression in response to various plant-derived factors. An advantage of the method is that it allows quantification of bacterial gene expression in rhizosphere populations at a single-cell level. To our best knowledge, this is the first study using flow cytometry for the in situ analysis of biocontrol gene expression in a plant-beneficial bacterium in the rhizosphere.
Resumo:
The recognition that colorectal cancer (CRC) is a heterogeneous disease in terms of clinical behaviour and response to therapy translates into an urgent need for robust molecular disease subclassifiers that can explain this heterogeneity beyond current parameters (MSI, KRAS, BRAF). Attempts to fill this gap are emerging. The Cancer Genome Atlas (TGCA) reported two main CRC groups, based on the incidence and spectrum of mutated genes, and another paper reported an EMT expression signature defined subgroup. We performed a prior free analysis of CRC heterogeneity on 1113 CRC gene expression profiles and confronted our findings to established molecular determinants and clinical, histopathological and survival data. Unsupervised clustering based on gene modules allowed us to distinguish at least five different gene expression CRC subtypes, which we call surface crypt-like, lower crypt-like, CIMP-H-like, mesenchymal and mixed. A gene set enrichment analysis combined with literature search of gene module members identified distinct biological motifs in different subtypes. The subtypes, which were not derived based on outcome, nonetheless showed differences in prognosis. Known gene copy number variations and mutations in key cancer-associated genes differed between subtypes, but the subtypes provided molecular information beyond that contained in these variables. Morphological features significantly differed between subtypes. The objective existence of the subtypes and their clinical and molecular characteristics were validated in an independent set of 720 CRC expression profiles. Our subtypes provide a novel perspective on the heterogeneity of CRC. The proposed subtypes should be further explored retrospectively on existing clinical trial datasets and, when sufficiently robust, be prospectively assessed for clinical relevance in terms of prognosis and treatment response predictive capacity. Original microarray data were uploaded to the ArrayExpress database (http://www.ebi.ac.uk/arrayexpress/) under Accession Nos E-MTAB-990 and E-MTAB-1026. © 2013 Swiss Institute of Bioinformatics. Journal of Pathology published by John Wiley & Sons Ltd on behalf of Pathological Society of Great Britain and Ireland.
Resumo:
Estradiol and progesterone are crucial for the acquisition of receptivity and the change in transcriptional activity of target genes in the implantation window. The aim of this study was to differentiate the regulation of genes in the endometrium of patients with recurrent implantation failure (IF) versus those who became pregnant after in vitro fertilization (IVF) treatment. Moreover, the effect of embryo-derived factors on endometrial transcriptional activity was studied. Nine women with known IVF outcome (IF, M, miscarriage, OP, ongoing pregnancy) and undergoing hysteroscopy with endometrial biopsy were enrolled. Biopsies were taken during the midluteal phase. After culture in the presence of embryo-conditioned IVF media, total RNA was extracted and submitted to reverse transcription, target cDNA synthesis, biotin labelling, fragmentation and hybridization using the Affymetrix Human Genome U133A 2.0 Chip. Differential expression of selected genes was re-analysed by quantitative PCR, in which the results were calculated as threshold cycle differences between the groups and normalized to Glyceraldehyde phosphate dehydrogenase and beta-actin. Differences were seen for several genes from endometrial tissue between the IF and the pregnancy groups, and when comparing OP with M, 1875 up- and 1807 down-regulated genes were returned. Real-time PCR analysis confirmed up-regulation for somatostatin, PLAP-2, mucin 4 and CD163, and down-regulation of glycodelin, IL-24, CD69, leukaemia inhibitory factor and prolactin receptor between Op and M. When the different embryo-conditioned media were compared, no significant differential regulation could be demonstrated. Although microarray profiling may currently not be sensitive enough for studying the effects of embryo-derived factors on the endometrium, the observed differences in gene expression between M and OP suggest that it will become an interesting tool for the identification of fertility-relevant markers produced by the endometrium.
Resumo:
BACKGROUND: Cellular processes underlying memory formation are evolutionary conserved, but natural variation in memory dynamics between animal species or populations is common. The genetic basis of this fascinating phenomenon is poorly understood. Closely related species of Nasonia parasitic wasps differ in long-term memory (LTM) formation: N. vitripennis will form transcription-dependent LTM after a single conditioning trial, whereas the closely-related species N. giraulti will not. Genes that were differentially expressed (DE) after conditioning in N. vitripennis, but not in N. giraulti, were identified as candidate genes that may regulate LTM formation. RESULTS: RNA was collected from heads of both species before and immediately, 4 or 24 hours after conditioning, with 3 replicates per time point. It was sequenced strand-specifically, which allows distinguishing sense from antisense transcripts and improves the quality of expression analyses. We determined conditioning-induced DE compared to naïve controls for both species. These expression patterns were then analysed with GO enrichment analyses for each species and time point, which demonstrated an enrichment of signalling-related genes immediately after conditioning in N. vitripennis only. Analyses of known LTM genes and genes with an opposing expression pattern between the two species revealed additional candidate genes for the difference in LTM formation. These include genes from various signalling cascades, including several members of the Ras and PI3 kinase signalling pathways, and glutamate receptors. Interestingly, several other known LTM genes were exclusively differentially expressed in N. giraulti, which may indicate an LTM-inhibitory mechanism. Among the DE transcripts were also antisense transcripts. Furthermore, antisense transcripts aligning to a number of known memory genes were detected, which may have a role in regulating these genes. CONCLUSION: This study is the first to describe and compare expression patterns of both protein-coding and antisense transcripts, at different time points after conditioning, of two closely related animal species that differ in LTM formation. Several candidate genes that may regulate differences in LTM have been identified. This transcriptome analysis is a valuable resource for future in-depth studies to elucidate the role of candidate genes and antisense transcription in natural variation in LTM formation.
Resumo:
The amount of biological data has grown exponentially in recent decades. Modern biotechnologies, such as microarrays and next-generation sequencing, are capable to produce massive amounts of biomedical data in a single experiment. As the amount of the data is rapidly growing there is an urgent need for reliable computational methods for analyzing and visualizing it. This thesis addresses this need by studying how to efficiently and reliably analyze and visualize high-dimensional data, especially that obtained from gene expression microarray experiments. First, we will study the ways to improve the quality of microarray data by replacing (imputing) the missing data entries with the estimated values for these entries. Missing value imputation is a method which is commonly used to make the original incomplete data complete, thus making it easier to be analyzed with statistical and computational methods. Our novel approach was to use curated external biological information as a guide for the missing value imputation. Secondly, we studied the effect of missing value imputation on the downstream data analysis methods like clustering. We compared multiple recent imputation algorithms against 8 publicly available microarray data sets. It was observed that the missing value imputation indeed is a rational way to improve the quality of biological data. The research revealed differences between the clustering results obtained with different imputation methods. On most data sets, the simple and fast k-NN imputation was good enough, but there were also needs for more advanced imputation methods, such as Bayesian Principal Component Algorithm (BPCA). Finally, we studied the visualization of biological network data. Biological interaction networks are examples of the outcome of multiple biological experiments such as using the gene microarray techniques. Such networks are typically very large and highly connected, thus there is a need for fast algorithms for producing visually pleasant layouts. A computationally efficient way to produce layouts of large biological interaction networks was developed. The algorithm uses multilevel optimization within the regular force directed graph layout algorithm.
Resumo:
Clinical stage (CS) is an established indicator of breast cancer outcome. In the present study, a cDNA microarray platform containing 692 genes was used to identify molecular differences between CSII and CSIII disease. Tumor samples were collected from patients with CSII or CSIII breast cancer, and normal breast tissue was collected from women without invasive cancer. Seventy-eight genes were deregulated in CSIII tumors and 22 in CSII tumors when compared to normal tissue, and 20 of them were differentially expressed in both CSII and CSIII tumors. In addition, 58 genes were specifically altered in CSIII and expression of 6 of them was tested by real time RT-PCR in another cohort of patients with CSII or CSIII breast cancer and in women without cancer. Among these genes, MAX, KRT15 and S100A14, but not APOBEC3G or KRT19, were differentially expressed on both CSIII and CSII tumors as compared to normal tissue. Increased HMOX1 levels were detected only in CSIII tumors and may represent a molecular marker of this stage. A clear difference in gene expression pattern occurs at the normal-to-cancer transition; however, most of the differentially expressed genes are deregulated in tumors of both CS (II and III) compared to normal breast tissue.
Resumo:
In the canine species, the precise mechanisms of pregnancy maintenance and the initiation of parturition are not completely understood. The expression of genes encoding the receptors for estrogen (ERα mRNA) and oxytocin (OTR mRNA) was studied in the endometrium and myometrium during pregnancy and parturition in dogs. Real-time PCR was performed to quantify the levels of ERα mRNA and OTR mRNA in the uterus of bitches during early (up to 20 days of gestation), mid (20 to 40 days) and late pregnancy (41 to 60 days), and parturition (first stage of labor). All tissues expressed ERα and OTR mRNA, and are thus possibly able to respond to eventual estrogen and oxytocin hormonal stimuli. No statistically significant differences in the expression of ERα mRNA were verified in the endometrium and myometrium throughout pregnancy and parturition, but expression of OTR mRNA increased at both parturition and late pregnancy. We concluded that the increase of endometrial and myometrial OTR mRNA expression in dogs is not an event dependent on estrogenic stimulation. Moreover, the contractility response of the canine uterus to oxytocin begins during pregnancy and maintains myometrial activity. The expression of OTR mRNA in canine uterine tissues varied over time, which supports an interpretation that the sensitivity and response to hormone therapy varies during the course of pregnancy and labor. Further studies are needed to elucidate the factors underlying the synthesis of uterine oxytocin receptors and the possible role of ERβ rather than ERα in the uterine tissues during pregnancy and parturition in dogs.
Resumo:
Lichens are symbiotic organisms, which consist of the fungal partner and the photosynthetic partner, which can be either an alga or a cyanobacterium. In some lichen species the symbiosis is tripartite, where the relationship includes both an alga and a cyanobacterium alongside the primary symbiont, fungus. The lichen symbiosis is an evolutionarily old adaptation to life on land and many extant fungal species have evolved from lichenised ancestors. Lichens inhabit a wide range of habitats and are capable of living in harsh environments and on nutrient poor substrates, such as bare rocks, often enduring frequent cycles of drying and wetting. Most lichen species are desiccation tolerant, and they can survive long periods of dehydration, but can rapidly resume photosynthesis upon rehydration. The molecular mechanisms behind lichen desiccation tolerance are still largely uncharacterised and little information is available for any lichen species at the genomic or transcriptomic level. The emergence of the high-throughput next generation sequencing (NGS) technologies and the subsequent decrease in the cost of sequencing new genomes and transcriptomes has enabled non-model organism research on the whole genome level. In this doctoral work the transcriptome and genome of the grey reindeer lichen, Cladonia rangiferina, were sequenced, de novo assembled and characterised using NGS and traditional expressed sequence tag (EST) technologies. RNA extraction methods were optimised to improve the yield and quality of RNA extracted from lichen tissue. The effects of rehydration and desiccation on C. rangiferina gene expression on whole transcriptome level were studied and the most differentially expressed genes were identified. The secondary metabolites present in C. rangiferina decreased the quality – integrity, optical characteristics and utility for sensitive molecular biological applications – of the extracted RNA requiring an optimised RNA extraction method for isolating sufficient quantities of high-quality RNA from lichen tissue in a time- and cost-efficient manner. The de novo assembly of the transcriptome of C. rangiferina was used to produce a set of contiguous unigene sequences that were used to investigate the biological functions and pathways active in a hydrated lichen thallus. The de novo assembly of the genome yielded an assembly containing mostly genes derived from the fungal partner. The assembly was of sufficient quality, in size similar to other lichen-forming fungal genomes and included most of the core eukaryotic genes. Differences in gene expression were detected in all studied stages of desiccation and rehydration, but the largest changes occurred during the early stages of rehydration. The most differentially expressed genes did not have any annotations, making them potentially lichen-specific genes, but several genes known to participate in environmental stress tolerance in other organisms were also identified as differentially expressed.
Resumo:
Since the alkyl esters of p-hydroxybenzoic acid (parabens) can be measured intact in the human breast and possess oestrogenic properties, it has been suggested that they could contribute to an aberrant burden of oestrogen signalling in the human breast and so play a role in the rising incidence of breast cancer. However, although parabens have been shown to regulate a few single genes (reporter genes, pS2, progesterone receptor) in a manner similar to that of 17 beta-oestradiol, the question remains as to the full extent of the similarity in the overall gene profile induced in response to parabens compared with 17 beta-oestradiol. The GE-Amersham CodeLink 20 K human expression microarray system was used to profile the expression of 19881 genes in MCF7 human breast cancer cells following a 7-day exposure to 5 x 10(-4) m methylparaben, 10(-5) m n-butylparaben and 10(-8) m 17 beta-oestradiol. At these concentrations, the parabens gave growth responses in MCF7 cells of similar magnitude to 17 beta-oestradiol. The study identified genes which are upregulated or downregulated to a similar extent by methylparaben, n-butylparaben and 17 beta-oestradiol. However, the majority of genes were not regulated in the same way by all three treatments. Some genes responded differently to parabens from 17 beta-oestradiol, and furthermore, differences in expression of some genes could be detected even between the two individual parabens. Therefore, although parabens possess oestrogenic properties, their mimicry in terms of global gene expression patterns is not perfect and differences in gene expression profiles could result in consequences to the cells that are not identical to those following exposure to 17 beta-oestradiol. Copyright (c) 2006 John Wiley & Sons, Ltd.