996 resultados para Human Transcriptome
Resumo:
Only a small proportion of the mouse genome is transcribed into mature messenger RNA transcripts. There is an international collaborative effort to identify all full-length mRNA transcripts from the mouse, and to ensure that each is represented in a physical collection of clones. Here we report the manual annotation of 60,770 full-length mouse complementary DNA sequences. These are clustered into 33,409 'transcriptional units', contributing 90.1% of a newly established mouse transcriptome database. Of these transcriptional units, 4,258 are new protein-coding and 11,665 are new non-coding messages, indicating that non-coding RNA is a major component of the transcriptome. 41% of all transcriptional units showed evidence of alternative splicing. In protein-coding transcripts, 79% of splice variations altered the protein product. Whole-transcriptome analyses resulted in the identification of 2,431 sense-antisense pairs. The present work, completely supported by physical clones, provides the most comprehensive survey of a mammalian transcriptome so far, and is a valuable resource for functional genomics.
Resumo:
The classic organization of a gene structure has followed the Jacob and Monod bacterial gene model proposed more than 50 years ago. Since then, empirical determinations of the complexity of the transcriptomes found in yeast to human has blurred the definition and physical boundaries of genes. Using multiple analysis approaches we have characterized individual gene boundaries mapping on human chromosomes 21 and 22. Analyses of the locations of the 5' and 3' transcriptional termini of 492 protein coding genes revealed that for 85% of these genes the boundaries extend beyond the current annotated termini, most often connecting with exons of transcripts from other well annotated genes. The biological and evolutionary importance of these chimeric transcripts is underscored by (1) the non-random interconnections of genes involved, (2) the greater phylogenetic depth of the genes involved in many chimeric interactions, (3) the coordination of the expression of connected genes and (4) the close in vivo and three dimensional proximity of the genomic regions being transcribed and contributing to parts of the chimeric RNAs. The non-random nature of the connection of the genes involved suggest that chimeric transcripts should not be studied in isolation, but together, as an RNA network.
Resumo:
2 Abstract2.1 En françaisLe séquençage du génome humain est un pré-requis fondamental à la compréhension de la biologie de l'être humain. Ce projet achevé, les scientifiques ont dû faire face à une tâche aussi importante, comprendre cette suite de 3 milliards de lettres qui compose notre génome. Le consortium ENCODE (ENCyclopedia Of Dna Elements) fût formé comme une suite logique au projet du génome humain. Son rôle est d'identifier tous les éléments fonctionnels de notre génome incluant les régions transcrites, les sites d'attachement des facteurs de transcription, les sites hypersensibles à la DNAse I ainsi que les marqueurs de modification des histones. Dans le cadre de ma thèse doctorale, j'ai participé à 2 sous-projets d'ENCODE. En premier lieu, j'ai eu la tâche de développer et d'optimiser une technique de validation expérimentale à haut rendement de modèles de gènes qui m'a permis d'estimer la qualité de la plus récente annotation manuelle. Ce nouveau processus de validation est bien plus efficace que la technique RNAseq qui est actuellement en train de devenir la norme. Cette technique basée sur la RT-PCR, m'a notamment permis de découvrir de nouveaux exons dans 10% des régions interrogées. En second lieu j'ai participé à une étude ayant pour but d'identifier les extrémités de tous les gènes des chromosomes humains 21 et 22. Cette étude à permis l'identification à large échelle de transcrits chimères comportant des séquences provenant de deux gènes distincts pouvant être à une grande distance l'un de autre.2.2 In EnglishThe completion of the human genome sequence js the prerequisite to fully understand the biology of human beings. This project achieved, scientists had to face another challenging task, understanding the meaning of the 3 billion letters composing this genome. As a logical continuation of the human genome project, the ENCODE (ENCyclopedia Of DNA Elements) consortium was formed with the aim of annotating all its functional elements. These elements include transcribed regions, transcription binding sites, DNAse I hypersensitive sites and histone modification marks. In the frame of my PhD thesis, I was involved in two sub-projects of ENCODE. Firstly I developed and optimized an high throughput method to validate gene models, which allowed me to assess the quality of the most recent manually-curated annotation. This novel experimental validation pipeline is extremely effective, far more so than transcriptome profiling through RNA sequencing, which is becoming the norm. This RT-PCR-seq targeted-approach is likewise particularly efficient in identifying novel exons, as we discovered about 10% of loci with unannotated exons. Secondly, I participated to a study aiming to identify the gene boundaries of all genes in the human chromosome 21 and 22. This study led to the identification of chimeric transcripts that are composed of sequences coming form two distinct genes that can be map far away from each other.
Resumo:
Human RNA polymerase (Pol) III-transcribed genes are thought to share a simple termination signal constituted by four or more consecutive thymidine residues in the coding DNA strand, just downstream of the RNA 3'-end sequence. We found that a large set of human tRNA genes (tDNAs) do not display any T(≥4) stretch within 50 bp of 3'-flanking region. In vitro analysis of tDNAs with a distanced T(≥4) revealed the existence of non-canonical terminators resembling degenerate T(≥5) elements, which ensure significant termination but at the same time allow for the production of Pol III read-through pre-tRNAs with unusually long 3' trailers. A panel of such non-canonical signals was found to direct transcription termination of unusual Pol III-synthesized viral pre-miRNA transcripts in gammaherpesvirus 68-infected cells. Genome-wide location analysis revealed that human Pol III tends to trespass into the 3'-flanking regions of tDNAs, as expected from extensive terminator read-through. The widespread occurrence of partial termination suggests that the Pol III primary transcriptome in mammals is unexpectedly enriched in 3'-trailer sequences with the potential to contribute novel functional ncRNAs.
Resumo:
BACKGROUND: A characteristic SYT-SSX fusion gene resulting from the chromosomal translocation t(X;18)(p11;q11) is detectable in almost all synovial sarcomas, a malignant soft tissue tumor widely believed to originate from as yet unidentified pluripotent stem cells. The resulting fusion protein has no DNA binding motifs but possesses protein-protein interaction domains that are believed to mediate association with chromatin remodeling complexes. Despite recent advances in the identification of molecules that interact with SYT-SSX and with the corresponding wild type SYT and SSX proteins, the mechanisms whereby the SYT-SSX might contribute to neoplastic transformation remain unclear. Epigenetic deregulation has been suggested to be one possible mechanism. METHODOLOGY/PRINCIPAL FINDINGS: We addressed the effect of SYT/SSX expression on the transcriptome of four independent isolates of primary human bone marrow mesenchymal stem cells (hMSC). We observed transcriptional changes similar to the gene expression signature of synovial sarcoma, principally involving genes whose regulation is linked to epigenetic factors, including imprinted genes, genes with transcription start sites within a CpG island and chromatin related genes. Single population analysis revealed hMSC isolate-specific transcriptional changes involving genes that are important for biological functions of stem cells as well as genes that are considered to be molecular markers of synovial sarcoma including IGF2, EPHRINS, and BCL2. Methylation status analysis of sequences at the H19/IGF2 imprinted locus indicated that distinct epigenetic features characterize hMSC populations and condition the transcriptional effects of SYT-SSX expression. CONCLUSIONS/SIGNIFICANCE: Our observations suggest that epigenetic features may define the cellular microenvironment in which SYT-SSX displays its functional effects.
Resumo:
The publication of a draft of the human genome and of large collections of transcribed sequences has made it possible to study the complex relationship between the transcriptome and the genome. In the work presented here, we have focused on mapping mRNA 3' ends onto the genome by use of the raw data generated by the expressed sequence tag (EST) sequencing projects. We find that at least half of the human genes encode multiple transcripts whose polyadenylation is driven by multiple signals. The corresponding transcript 3' ends are spread over distances in the kilobase range. This finding has profound implications for our understanding of gene expression regulation and of the diversity of human transcripts, for the design of cDNA microarray probes, and for the interpretation of gene expression profiling experiments.
Hypothalamus transcriptome profile suggests an anorexia-cachexia syndrome in the anx/anx mouse model
Resumo:
The anx/anx mouse displays poor appetite and lean appearance and is considered a good model for the study of anorexia nervosa. To identify new genes involved in feeding behavior and body weight regulation we performed an expression profiling in the hypothalamus of the anx/anx mice. Using commercial microarrays we detected 156 differentially expressed genes and validated 92 of those using TaqMan low-density arrays. The expression of a set of 87 candidate genes selected based on literature evidences was also quantified by TaqMan low-density arrays. Our results showed enrichment in deregulated genes involved in cell death, cell morphology and cancer as well as an alteration of several signaling circuits involved in energy balance including neuropeptide Y and melanocortin signaling. The expression profile along with the phenotype led us to conclude that anx/anx mice resemble the anorexia-cachexia syndrome typically observed in cancer, infection with human immunodeficiency virus or chronic diseases, rather than starvation, and that anx/anx mice could be considered a good model for the treatment and investigation of this condition.
Resumo:
Eukaryotic cells make many types of primary and processed RNAs that are found either in specific subcellular compartments or throughout the cells. A complete catalogue of these RNAs is not yet available and their characteristic subcellular localizations are also poorly understood. Because RNA represents the direct output of the genetic information encoded by genomes and a significant proportion of a cell's regulatory capabilities are focused on its synthesis, processing, transport, modification and translation, the generation of such a catalogue is crucial for understanding genome function. Here we report evidence that three-quarters of the human genome is capable of being transcribed, as well as observations about the range and levels of expression, localization, processing fates, regulatory regions and modifications of almost all currently annotated and thousands of previously unannotated RNAs. These observations, taken together, prompt a redefinition of the concept of a gene.
Resumo:
Cancer is one of the world's leading causes of death with a rising trend in incidence. These epidemiologic observations underline the need for novel treatment strategies. In this regard, a promising approach takes advantage of the adaptive effector mechanisms of the immune system, using T lymphocytes to specifically target and destroy tumour cells. However, whereas current approaches mainly depend on short-lived, terminally differentiated effector T cells, increasing evidence suggests that long lasting and maximum efficient immune responses are mediated by low differentiated memory T cells. These memory T cells should display characteristics of stem cells, such as longevity, self-renewal capacity and the ability to continuously give rise to further differentiated effectors. These stem celllike memory T (TSCM) cells are thought to be of key therapeutic value as they might not only attack differentiated tumour cells, but also eradicate the root cause of cancer, the cancer stem cells themselves. Thus, efforts are made to characterize TSCM cells and to identify the signalling pathways which mediate their induction. Recently, a human TSCM cell subset was described and the activation of the Wnt-ß-catenin signalling pathway by the drug TWS119 during naive CD8+ T (TN) cell priming was suggested to mediate their induction. However, a precise deciphering of the signalling pathways leading to TSCM cell induction and an in-depth characterization of in vitro induced and in vivo occurring TSCM cells remain to be performed. Here, evidence is presented that the induction of human and mouse CD8+ and CD4+ TSCM cells may be triggered by inhibition of mechanistic/mammalian target of rapamycin (mTOR) complex 1 with simultaneously active mTOR complex 2. This molecular mechanism arrests a fraction of activated TN cells in a stem cell-like differentiation state independently of the Wnt-ß-catenin signalling pathway. Of note, TWS119 was found to also inhibit mTORCl, thereby mediating the induction of TSCM cells. Suggesting an immunostimulatory effect, the acquired data broaden the therapeutic range of mTORCl inhibitors like rapamycin, which are, at present, exclusively used due to their immunosuppressive function. Furthermore, by performing broad metabolic analyses, a well-orchestrated interplay between intracellular signalling pathways and the T cells' metabolic programmes could be identified as important regulator of the T cells' differentiation fate. Moreover, in vitro induced CD4+ TSCM cells possess superior functional capacities and share fate-determining key factors with their naturally occurring counterparts, assessed by a first-time full transcriptome analysis of in vivo occurring CD4+ TN cell, TSCM cells and central memory (TCM) cells and in vitro induced CD4+ TSCM cells. Of interest, a group of 56 genes, with a unique expression profile in TSCM cells could be identified. Thus, a pharmacological mechanism allowing to confer sternness to activated TN cells has been found which might be highly relevant for the design of novel T cell-based cancer immunotherapies.
Resumo:
The transcriptional coactivator peroxisome proliferator-activated receptor-gamma coactivator 1 alpha (PGC-1α) is a chief activator of mitochondrial and metabolic programs and protects against atrophy in skeletal muscle (skm). Here we tested whether PGC-1α overexpression could restructure the transcriptome and metabolism of primary cultured human skm cells, which display a phenotype that resembles the atrophic phenotype. An oligonucleotide microarray analysis was used to reveal the effects of PGC-1α on the whole transcriptome. Fifty-three different genes showed altered expression in response to PGC-1α: 42 upregulated and 11 downregulated. The main gene ontologies (GO) associated with the upregulated genes were mitochondrial components and processes and this was linked with an increase in COX activity, an indicator of mitochondrial content. Furthermore, PGC-1α enhanced mitochondrial oxidation of palmitate and lactate to CO2, but not glucose oxidation. The other most significantly associated GOs for the upregulated genes were chemotaxis and cytokine activity, and several cytokines, including IL-8/CXCL8, CXCL6, CCL5 and CCL8, were within the most highly induced genes. Indeed, PGC-1α highly increased IL-8 cell protein content. The most upregulated gene was PVALB, which is related to calcium signaling. Potential metabolic regulators of fatty acid and glucose storage were among mainly regulated genes. The mRNA and protein level of FITM1/FIT1, which enhances the formation of lipid droplets, was raised by PGC-1α, while in oleate-incubated cells PGC-1α increased the number of smaller lipid droplets and modestly triglyceride levels, compared to controls. CALM1, the calcium-modulated δ subunit of phosphorylase kinase, was downregulated by PGC-1α, while glycogen phosphorylase was inactivated and glycogen storage was increased by PGC-1α. In conclusion, of the metabolic transcriptome deficiencies of cultured skm cells, PGC-1α rescued the expression of genes encoding mitochondrial proteins and FITM1. Several myokine genes, including IL-8 and CCL5, which are known to be constitutively expressed in human skm cells, were induced by PGC-1α.
Resumo:
High levels of HIV-1 replication during the chronic phase of infection usually correlate with rapid progression to severe immunodeficiency. However, a minority of highly viremic individuals remains asymptomatic and maintains high CD4+ T cell counts. This tolerant profile is poorly understood and reminiscent of the widely studied nonprogressive disease model of SIV infection in natural hosts. Here, we identify transcriptome differences between rapid progressors (RPs) and viremic nonprogressors (VNPs) and highlight several genes relevant for the understanding of HIV-1-induced immunosuppression. RPs were characterized by a specific transcriptome profile of CD4+ and CD8+ T cells similar to that observed in pathogenic SIV-infected rhesus macaques. In contrast, VNPs exhibited lower expression of interferon-stimulated genes and shared a common gene regulation profile with nonpathogenic SIV-infected sooty mangabeys. A short list of genes associated with VNP, including CASP1, CD38, LAG3, TNFSF13B, SOCS1, and EEF1D, showed significant correlation with time to disease progression when evaluated in an independent set of CD4+ T cell expression data. This work characterizes 2 minimally studied clinical patterns of progression to AIDS, whose analysis may inform our understanding of HIV pathogenesis.
Resumo:
The transcriptome is the readout of the genome. Identifying common features in it across distant species can reveal fundamental principles. To this end, the ENCODE and modENCODE consortia have generated large amounts of matched RNA-sequencing data for human, worm and fly. Uniform processing and comprehensive annotation of these data allow comparison across metazoan phyla, extending beyond earlier within-phylum transcriptome comparisons and revealing ancient, conserved features. Specifically, we discover co-expression modules shared across animals, many of which are enriched in developmental genes. Moreover, we use expression patterns to align the stages in worm and fly development and find a novel pairing between worm embryo and fly pupae, in addition to the embryo-to-embryo and larvae-to-larvae pairings. Furthermore, we find that the extent of non-canonical, non-coding transcription is similar in each organism, per base pair. Finally, we find in all three organisms that the gene-expression levels, both coding and non-coding, can be quantitatively predicted from chromatin features at the promoter using a 'universal model' based on a single set of organism-independent parameters.
Resumo:
AIM: Heart disease is recognized as a consequence of dysregulation of cardiac gene regulatory networks. Previously, unappreciated components of such networks are the long non-coding RNAs (lncRNAs). Their roles in the heart remain to be elucidated. Thus, this study aimed to systematically characterize the cardiac long non-coding transcriptome post-myocardial infarction and to elucidate their potential roles in cardiac homoeostasis. METHODS AND RESULTS: We annotated the mouse transcriptome after myocardial infarction via RNA sequencing and ab initio transcript reconstruction, and integrated genome-wide approaches to associate specific lncRNAs with developmental processes and physiological parameters. Expression of specific lncRNAs strongly correlated with defined parameters of cardiac dimensions and function. Using chromatin maps to infer lncRNA function, we identified many with potential roles in cardiogenesis and pathological remodelling. The vast majority was associated with active cardiac-specific enhancers. Importantly, oligonucleotide-mediated knockdown implicated novel lncRNAs in controlling expression of key regulatory proteins involved in cardiogenesis. Finally, we identified hundreds of human orthologues and demonstrate that particular candidates were differentially modulated in human heart disease. CONCLUSION: These findings reveal hundreds of novel heart-specific lncRNAs with unique regulatory and functional characteristics relevant to maladaptive remodelling, cardiac function and possibly cardiac regeneration. This new class of molecules represents potential therapeutic targets for cardiac disease. Furthermore, their exquisite correlation with cardiac physiology renders them attractive candidate biomarkers to be used in the clinic.
Resumo:
Our understanding of the pathogenesis of organ‐specific autoinflammation has been restricted by limited access to the target organs. Peripheral blood, however, as a preferred transportation route for immune cells, provides a window to assess the entire immune system throughout the body. Transcriptional profiling with RNA stabilizing blood collection tubes reflects in vivo expression profiles at the time the blood is drawn, allowing detection of the disease activity in different samples or within the same sample over time. The main objective of this Ph.D. study was to apply gene‐expression microarrays in the characterization of peripheral blood transcriptional profiles in patients with autoimmune diseases. To achieve this goal a custom cDNA microarray targeted for gene‐expression profiling of human immune system was designed and produced. Sample collection and preparation was then optimized to allow gene‐expression profiling from whole‐blood samples. To overcome challenges resulting from minute amounts of sample material, RNA amplification was successfully applied to study pregnancy related immunosuppression in patients with multiple sclerosis (MS). Furthermore, similar sample preparation was applied to characterize longitudinal genome‐wide expression profiles in children with type 1 diabetes (T1D) associated autoantibodies and eventually clinical T1D. Blood transcriptome analyses, using both the ImmunoChip cDNA microarray with targeted probe selection and genome‐wide Affymetrix U133 Plus 2.0 oligonucleotide array, enabled monitoring of autoimmune activity. Novel disease related genes and general autoimmune signatures were identified. Notably, down‐regulation of the HLA class Ib molecules in peripheral blood was associated with disease activity in both MS and T1D. Taken together, these studies demonstrate the potential of peripheral blood transcriptional profiling in biomedical research and diagnostics. Imbalances in peripheral blood transcriptional activity may reveal dynamic changes that are relevant for the disease but might be completely missed in conventional cross‐sectional studies.
Resumo:
Activated T helper (Th) cells have ability to differentiate into functionally distinct Th1, Th2 and Th17 subsets through a series of overlapping networks that include signaling and transcriptional control and the epigenetic mechanisms to direct immune responses. However, inappropriate execution in the differentiation process and abnormal function of these Th cells can lead to the development of several immune mediated diseases. Therefore, the thesis aimed at identifying genes and gene regulatory mechanisms responsible for Th17 differentiation and to study epigenetic changes associated with early stage of Th1/Th2 cell differentiation. Genome wide transcriptional profiling during early stages of human Th17 cell differentiation demonstrated differential regulation of several novel and currently known genes associated with Th17 differentiation. Selected candidate genes were further validated at protein level and their specificity for Th17 as compared to other T helper subsets was analyzed. Moreover, combination of RNA interference-mediated downregulation of gene expression, genome-wide transcriptome profiling and chromatin immunoprecipitation followed by massive parallel sequencing (ChIP-seq), combined with computational data integration lead to the identification of direct and indirect target genes of STAT3, which is a pivotal upstream transcription factor for Th17 cell polarization. Results indicated that STAT3 directly regulates the expression of several genes that are known to play a role in activation, differentiation, proliferation, and survival of Th17 cells. These results provide a basis for constructing a network regulating gene expression during early human Th17 differentiation. Th1 and Th2 lineage specific enhancers were identified from genome-wide maps of histone modifications generated from the cells differentiating towards Th1 and Th2 lineages at 72h. Further analysis of lineage-specific enhancers revealed known and novel transcription factors that potentially control lineage-specific gene expression. Finally, we found an overlap of a subset of enhancers with SNPs associated with autoimmune diseases through GWASs suggesting a potential role for enhancer elements in the disease development. In conclusion, the results obtained have extended our knowledge of Th differentiation and provided new mechanistic insights into dysregulation of Th cell differentiation in human immune mediated diseases.