32 resultados para transcriptome
Resumo:
Large-scale gene discovery has been performed for the grass fungal endophytes Neotyphodium coenophialum, Neotyphodium lolii, and Epichloe festucae. The resulting sequences have been annotated by comparison with public DNA and protein sequence databases and using intermediate gene ontology annotation tools. Endophyte sequences have also been analysed for the presence of simple sequence repeat and single nucleotide polymorphism molecular genetic markers. Sequences and annotation are maintained within a MySQL database that may be queried using a custom web interface. Two cDNA-based microarrays have been generated from this genome resource, They permit the interrogation of 3806 Neotyphodium genes (Nchip (TM) rnicroarray), and 4195 Neotyphodium and 920 Epichloe genes (EndoChip (TM) microarray), respectively. These microarrays provide tools for high-throughput transcriptome analysis, including genome-specific gene expression studies, profiling of novel endophyte genes, and investigation of the host grass-symbiont interaction. Comparative transcriptome analysis in Neotyphodium and Epichloe was performed. (c) 2006 Elsevier
Resumo:
Traditional treatment of infectious diseases is based on compounds that kill or inhibit growth of bacteria. A major concern with this approach is the frequent development of resistance to antibiotics. The discovery of communication systems (quorum sensing systems) regulating bacterial virulence has afforded a novel opportunity to control infectious bacteria without interfering with growth. Compounds that can override communication signals have been found in the marine environment. Using Pseudomonas aeruginosa PAO1 as an example of an opportunistic human pathogen, we show that a synthetic derivate of natural furanone compounds can act as a potent antagonist of bacterial quorum sensing. We employed GeneChip((R)) microarray technology to identify furanone target genes and to map the quorum sensing regulon. The transcriptome analysis showed that the furanone drug specifically targeted quorum sensing systems and inhibited virulence factor expression. Application of the drug to P.aeruginosa biofilms increased bacterial susceptibility to tobramycin and SDS. In a mouse pulmonary infection model, the drug inhibited quorum sensing of the infecting bacteria and promoted their clearance by the mouse immune response.
Resumo:
Background: A major goal in the post-genomic era is to identify and characterise disease susceptibility genes and to apply this knowledge to disease prevention and treatment. Rodents and humans have remarkably similar genomes and share closely related biochemical, physiological and pathological pathways. In this work we utilised the latest information on the mouse transcriptome as revealed by the RIKEN FANTOM2 project to identify novel human disease-related candidate genes. We define a new term patholog to mean a homolog of a human disease-related gene encoding a product ( transcript, anti-sense or protein) potentially relevant to disease. Rather than just focus on Mendelian inheritance, we applied the analysis to all potential pathologs regardless of their inheritance pattern. Results: Bioinformatic analysis and human curation of 60,770 RIKEN full-length mouse cDNA clones produced 2,578 sequences that showed similarity ( 70 - 85% identity) to known human-disease genes. Using a newly developed biological information extraction and annotation tool ( FACTS) in parallel with human expert analysis of 17,051 MEDLINE scientific abstracts we identified 182 novel potential pathologs. Of these, 36 were identified by computational tools only, 49 by human expert analysis only and 97 by both methods. These pathologs were related to neoplastic ( 53%), hereditary ( 24%), immunological ( 5%), cardio-vascular (4%), or other (14%), disorders. Conclusions: Large scale genome projects continue to produce a vast amount of data with potential application to the study of human disease. For this potential to be realised we need intelligent strategies for data categorisation and the ability to link sequence data with relevant literature. This paper demonstrates the power of combining human expert annotation with FACTS, a newly developed bioinformatics tool, to identify novel pathologs from within large-scale mouse transcript datasets.
Resumo:
Transcripts that lack any protein-coding potential represent at least half of the identified elements transcriptome. We review the evidence for the existence of such transcripts in the mammalian transcriptome, and argue that there may be many more noncoding RNAs (ncRNAs) still to be discovered. Relatively few ncRNA “genes” have been ascribed a function based upon mutation analysis. The review discusses possible roles of ncRNAs as cis-acting and trans-acting elements in epigenetic transcriptional control, including monoallelic gene silencing and imprinting. We also consider the evidence that the production of ncRNAs is a common feature of transcriptional enhancers.
Resumo:
A general overview of the protein sequence set for the mouse transcriptome produced during the FANTOM2 sequencing project is presented here. We applied different algorithms to characterize protein sequences derived from a nonredundant representative protein set (RPS) and a variant protein set (VPS) of the mouse transcriptome. The functional characterization and assignment of Gene Ontology terms was done by analysis of the proteome using InterPro. The Superfamily database analyses gave a detailed structural classification according to SCOP and provide additional evidence for the functional characterization of the proteome data. The MDS database analysis revealed new domains which are not presented in existing protein domain databases. Thus the transcriptome gives us a unique source of data for the detection of new functional groups. The data obtained for the RPS and VPS sets facilitated the comparison of different patterns of protein expression. A comparison of other existing mouse and human protein sequence sets (e.g., the International Protein Index) demonstrates the common patterns in mammalian proteornes. The analysis of the membrane organization within the transcriptome of multiple eukaryotes provides valuable statistics about the distribution of secretory and transmembrane proteins
Resumo:
With the completion of the human and mouse genome sequences, the task now turns to identifying their encoded transcripts and assigning gene function. In this study, we have undertaken a computational approach to identify and classify all of the protein kinases and phosphatases present in the mouse gene complement. A nonredundant set of these sequences was produced by mining Ensembl gene predictions and publicly available cDNA sequences with a panel of InterPro domains. This approach identified 561 candidate protein kinases and 162 candidate protein phosphatases. This cohort was then analyzed using TribeMCL protein sequence similarity clustering followed by CLUSTALV alignment and hierarchical tree generation. This approach allowed us to (1) distinguish between true members of the protein kinase and phosphatase families and enzymes of related biochemistry, (2) determine the structure of the families, and (3) suggest functions for previously uncharacterized members. The classifications obtained by this approach were in good agreement with previous schemes and allowed us to demonstrate domain associations with a number of clusters. Finally, we comment on the complementary nature of cDNA and genome-based gene detection and the impact of the FANTOM2 transcriptome project.
Resumo:
The current RIKEN transcript set represents a significant proportion of the mouse transcriptome but transcripts expressed in the innate and acquired immune systems are poorly represented. In the present study we have assessed the complexity of the transcriptome expressed in mouse macrophages before and after treatment with lipopolysaccharide, a global regulator of macrophage gene expression, using existing RIKEN 19K arrays. By comparison to array profiles of other cells and tissues, we identify a large set of macrophage-enriched genes, many of which have obvious functions in endocytosis and phagocytosis. In addition, a significant number of LPS-inducible genes were identified. The data suggest that macrophages are a complex source of mRNA for transcriptome studies. To assess complexity and identify additional macrophage expressed genes, cDNA libraries were created from purified populations of macrophage and dendritic cells, a functionally related cell type. Sequence analysis revealed a high incidence of novel mRNAs within these cDNA libraries. These studies provide insights into the depths of transcriptional complexity still untapped amongst products of inducible genes, and identify macrophage and dendritic cell populations as a starting point for sampling the inducible mammalian transcriptome.
Resumo:
Large numbers of noncoding RNA transcripts (ncRNAS) are being revealed by complementary DNA cloning and genome tiling array studies in animals. The big and as yet largely unanswered question is whether these transcripts are relevant. A paper by Willingham et al. shows the way forward by developing a strategy for large-scale functional screening of ncRNAs, involving small interfering RNA knockdowns in cell-based screens, which identified a previously unidentified ncRNA repressor of the transcription factor NFAT. It appears likely that ncRNAs constitute a critical hidden layer of gene regulation in complex organisms, the understanding of which requires new approaches in functional genomics.
Resumo:
Proteins secreted by and anchored on the surfaces of parasites are in intimate contact with host tissues. The transcriptome of infective cercariae of the blood fluke, Schistosoma mansoni, was screened using signal sequence trap to isolate cDNAs encoding predicted proteins with an N-terminal signal peptide. Twenty cDNA fragments were identified, most of which contained predicted signal peptides or transmembrane regions, including a novel putative seven-transmembrane receptor and a membrane-associated mitogen-activated protein kinase. The developmental expression pattern within different life-cycle stages ranged from ubiquitous to a transcript that was highly upregulated in the cercaria. A bioinformatics-based comparison of 100 signal peptides from each of schistosomes, humans, a parasitic nematode and Escherichia coli showed that differences in the sequence composition of signal peptides, notably the residues flanking the predicted cleavage site, might account for the negative bias exhibited in the processing of schistosome signal peptides in mammalian cells. (c) 2005 Federation of European Microbiological Societies. Published by Elsevier B.V. All rights reserved.
Resumo:
Expression of the mouse transcription factor EC (Tfec) is restricted to the myeloid compartment, suggesting a function for Tfec in the development or function of these cells. However, mice lacking Tfec develop normally, indicating a redundant role for Tfec in myeloid cell development. We now report that Tfec is specifically induced in bone marrow-derived macrophages upon stimulation with the Th2 cytokines, IL-4 and IL-13, or LPS. LPS induced a rapid and transient up-regulation of Tfec mRNA expression and promoter activity, which was dependent on a functional NF-kappa B site. IL-4, however, induced a rapid, but long-lasting, increase in Tfec mRNA, which, in contrast to LPS stimulation, also resulted in detectable levels of Tfec protein. IL-4-induced transcription of Tfec was absent in macrophages lacking Stat6, and its promoter depended on two functional Stat6-binding sites. A global comparison of IL-4-induced genes in both wild-type and Tfec mutant macrophages revealed a surprisingly mild phenotype with only a few genes affected by Tfec deficiency. These included the G-CSFR (Csf3r) gene that was strongly up-regulated by IL-4 in wild-type macrophages and, to a lesser extent, in Tfec mutant macrophages. Our study also provides a general definition of the transcriptome in alternatively activated mouse macrophages and identifies a large number of novel genes characterizing this cell type.
Resumo:
The mammalian transcriptome contains many nonprotein-coding RNAs (ncRNAs), but most of these are of unclear significance and lack strong sequence conservation, prompting suggestions that they might be non-functional. However, certain long functional ncRNAs such as Air and Xist are also poorly conserved. In this article, we systematically analyzed the conservation of several groups of functional ncRNAs, including miRNAs, snoRNAs and longer ncRNAs whose function has been either documented or confidently predicted. As expected, miRNAs and snoRNAs were highly conserved. By contrast, the longer functional non-micro, non-sno ncRNAs were much less conserved with many displaying rapid sequence evolution. Our findings suggest that longer ncRNAs are under the influence of different evolutionary constraints and that the lack of conservation displayed by the thousands of candidate ncRNAs does not necessarily signify an absence of function.
Resumo:
Sulfate plays an essential role in human growth and development, and its circulating levels are maintained by the renal Na+-SO42- cotransporter, NaS1. We previously generated a NaS1 knockout ( Nas1(-/-)) mouse, an animal model for hyposulfatemia, that exhibits reduced growth and liver abnormalities including hepatomegaly. In this study, we investigated the hepatic gene expression profile of Nas1(-/-) mice using oligonucleotide microarrays. The mRNA expression levels of 92 genes with known functional roles in metabolism, cell signaling, cell defense, immune response, cell structure, transcription, or protein synthesis were increased ( n = 51) or decreased ( n = 41) in Nas1(-/-) mice when compared with Nas1(-/-) mice. The most upregulated transcript levels in Nas1(-/-) mice were found for the sulfotransferase genes, Sult3a1 ( approximate to 500% increase) and Sult2a2 ( 100% increase), whereas the metallothionein-1 gene, Mt1, was among the most downregulated genes ( 70% decrease). Several genes involved in lipid and cholesterol metabolism, including Scd1, Acly, Gpam, Elov16, Acsl5, Mvd, Insig1, and Apoa4, were found to be upregulated ( >= 30% increase) in Nas1(+/+) mice. In addition, Nas1(+/+) mice exhibited increased levels of hepatic lipid ( approximate to 16% increase), serum cholesterol ( approximate to 20% increase), and low-density lipoprotein ( approximate to 100% increase) and reduced hepatic glycogen ( approximate to 50% decrease) levels. In conclusion, these data suggest an altered lipid and cholesterol metabolism in the hyposulfatemic Nas1(-/-) mouse and provide new insights into the metabolic state of the liver in Nas1(-/-) mice.
Resumo:
Membrane organization describes the orientation of a protein with respect to the membrane and can be determined by the presence, or absence, and organization within the protein sequence of two features: endoplasmic reticulum signal peptides and alpha-helical transmembrane domains. These features allow protein sequences to be classified into one of five membrane organization categories: soluble intracellular proteins, soluble secreted proteins, type I membrane proteins, type II membrane proteins, and multi- spanning membrane proteins. Generation of protein isoforms with variable membrane organizations can change a protein's subcellular localization or association with the membrane. Application of MemO, a membrane organization annotation pipeline, to the FANTOM3 Isoform Protein Sequence mouse protein set revealed that within the 8,032 transcriptional units ( TUs) with multiple protein isoforms, 573 had variation in their use of signal peptides, 1,527 had variation in their use of transmembrane domains, and 615 generated protein isoforms from distinct membrane organization classes. The mechanisms underlying these transcript variations were analyzed. While TUs were identified encoding all pairwise combinations of membrane organization categories, the most common was conversion of membrane proteins to soluble proteins. Observed within our highconfidence set were 156 TUs predicted to generate both extracellular soluble and membrane proteins, and 217 TUs generating both intracellular soluble and membrane proteins. The differential use of endoplasmic reticulum signal peptides and transmembrane domains is a common occurrence within the variable protein output of TUs. The generation of protein isoforms that are targeted to multiple subcellular locations represents a major functional consequence of transcript variation within the mouse transcriptome.
Resumo:
Recent large-scale analyses of mainly full-length cDNA libraries generated from a variety of mouse tissues indicated that almost half of all representative cloned sequences did flat contain ail apparent protein-coding sequence, and were putatively derived from non-protein-coding RNA (ncRNA) genes. However, many of these clones were singletons and the majority were unspliced, raising the possibility that they may be derived from genomic DNA or unprocessed pre-rnRNA contamination during library construction, or alternatively represent nonspecific transcriptional noise. Here we Show, using reverse transcriptase-dependent PCR, microarray, and Northern blot analyses, that many of these clones were derived from genuine transcripts Of unknown function whose expression appears to be regulated. The ncRNA transcripts have larger exons and fewer introns than protein-coding transcripts. Analysis of the genomic landscape around these sequences indicates that some cDNA clones were produced not from terminal poly(A) tracts but internal priming sites within longer transcripts, only a minority of which is encompassed by known genes. A significant proportion of these transcripts exhibit tissue-specific expression patterns, as well as dynamic changes in their expression in macrophages following lipopolysaccharide Stimulation. Taken together, the data provide strong support for the conclusion that ncRNAs are an important, regulated component of the mammalian transcriptome.
Resumo:
Increasing evidence suggests that the development and function of the nervous system is heavily dependent on RNA editing and the intricate spatiotemporal expression of a wide repertoire of non-coding RNAs, including micro RNAs, small nucleolar RNAs and longer non-coding RNAs. Non-coding RNAs may provide the key to understanding the multi-tiered links between neural development, nervous system function, and neurological diseases.