977 resultados para Transcriptome analysis
Resumo:
The transcriptome is the readout of the genome. Identifying common features in it across distant species can reveal fundamental principles. To this end, the ENCODE and modENCODE consortia have generated large amounts of matched RNA-sequencing data for human, worm and fly. Uniform processing and comprehensive annotation of these data allow comparison across metazoan phyla, extending beyond earlier within-phylum transcriptome comparisons and revealing ancient, conserved features. Specifically, we discover co-expression modules shared across animals, many of which are enriched in developmental genes. Moreover, we use expression patterns to align the stages in worm and fly development and find a novel pairing between worm embryo and fly pupae, in addition to the embryo-to-embryo and larvae-to-larvae pairings. Furthermore, we find that the extent of non-canonical, non-coding transcription is similar in each organism, per base pair. Finally, we find in all three organisms that the gene-expression levels, both coding and non-coding, can be quantitatively predicted from chromatin features at the promoter using a 'universal model' based on a single set of organism-independent parameters.
Resumo:
Turbot (Scophthalmus maximus L.) is an important aquacultural resource both in Europe and Asia. However, there is little information on gene sequences available in public databases. Currently, one of the main problems affecting the culture of this flatfish is mortality due to several pathogens, especially viral diseases which are not treatable. In order to identify new genes involved in immune defense, we conducted 454-pyrosequencing of the turbot transcriptome after different immune stimulations.
Resumo:
Background: Serine proteases are major components of viper venom and target various stages of the blood coagulation system in victims and prey. A better understanding of the diversity of serine proteases and other enzymes present in snake venom will help to understand how the complexity of snake venom has evolved and will aid the development of novel therapeutics for treating snake bites. Methodology and Principal Findings: Four serine protease-encoding genes from the venom gland transcriptome of Bitis gabonica rhinoceros were amplified and sequenced. Mass spectrometry suggests the four enzymes corresponding to these genes are present in the venom of B. g. rhinoceros. Two of the enzymes, rhinocerases 2 and 3 have substitutions to two of the serine protease catalytic triad residues and are thus unlikely to be catalytically active, though they may have evolved other toxic functions. The other two enzymes, rhinocerases 4 and 5, have classical serine protease catalytic triad residues and thus are likely to be catalytically active, however they have glycine rather than the more typical aspartic acid at the base of the primary specificity pocket (position 189). Based on a detailed analysis of these sequences we suggest that alternative splicing together with individual amino acid mutations may have been involved in their evolution. Changes within amino acid segments which were previously proposed to undergo accelerated change in venom serine proteases have also been observed. Conclusions and Significance: Our study provides further insight into the diversity of serine protease isoforms present within snake venom and discusses their possible functions and how they may have evolved. These multiple serine protease isoforms with different substrate specificities may enhance the envenomation effects and help the snake to adapt to new habitats and diets. Our findings have potential for helping the future development of improved therapeutics for snake bites.
Resumo:
Snake venom glands are a rich source of bioactive molecules such as peptides, proteins and enzymes that show important pharmacological activity leading to in local and systemic effects as pain, edema, bleeding and muscle necrosis. Most studies on pharmacologically active peptides and proteins from snake venoms have been concerned with isolation and structure elucidation through methods of classical biochemistry. As an attempt to examine the transcripts expressed in the venom gland of Bothrops jararacussu and to unveil the toxicological and pharmacological potential of its products at the molecular level, we generated 549 expressed sequence tags (ESTs) from a directional cDNA library. Sequences obtained from single-pass sequencing of randomly selected cDNA clones could be identified by similarities searches on existing databases, resulting in 197 sequences with significant similarity to phospholipase A(2) (PLA(2)), of which 83.2% were Lys49-PLA(2) homologs (BOJU-1), 0.1% were basic Asp49-PLA(2)s (BOJU-II) and 0.6% were acidic Asp49-PLA(2)s (BOJU-III). Adjoining this very abundant class of proteins we found 88 transcripts codifying for putative sequences of metalloproteases, which after clustering and assembling resulted in three full-length sequences: BOJUMET-I, BOJUMET-II and BOJUMET-III; as well as 25 transcripts related to C-type lectin like protein including a full-length cDNA of a putative galactose binding C-type lectin and a cluster of eight serine-proteases transcripts including a full-length cDNA of a putative serine protease. Among the full-length sequenced clones we identified a nerve growth factor (Bj-NGF) with 92% identity with a human NGF (NGHUBM) and an acidic phospholipase A2 (BthA-I-PLA(2)) displaying 85-93% identity with other snake venom toxins. Genetic distance among PLA(2)s from Bothrops species were evaluated by phylogenetic analysis. Furthermore, analysis of full-length putative Lys49-PLA(2) through molecular modeling showed conserved structural domains, allowing the characterization of those proteins as group II PLA(2)s. The constructed cDNA library provides molecular clones harboring sequences that can be used to probe directly the genetic material from gland venom of other snake species. Expression of complete cDNAs or their modified derivatives will be useful for elucidation of the structure-function relationships of these toxins and peptides of biotechnological interest. (C) 2004 Elsevier SAS. All rights reserved.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Vampire bats are notorious for being the sole mammals that strictly feed on fresh blood for their survival. While their saliva has been historically associated with anticoagulants, only one antihemostatic (plasminogen activator) has been molecularly and functionally characterized. Here, RNAs from both principal and accessory submaxillary (submandibular) salivary glands of Desmodus rotundus were extracted, and ~. 200. million reads were sequenced by Illumina. The principal gland was enriched with plasminogen activators with fibrinolytic properties, members of lipocalin and secretoglobin families, which bind prohemostatic prostaglandins, and endonucleases, which cleave neutrophil-derived procoagulant NETs. Anticoagulant (tissue factor pathway inhibitor, TFPI), vasodilators (PACAP and C-natriuretic peptide), and metalloproteases (ADAMTS-1) were also abundantly expressed. Members of the TSG-6 (anti-inflammatory), antigen 5/CRISP, and CCL28-like (antimicrobial) protein families were also sequenced. Apyrases (which remove platelet agonist ADP), phosphatases (which degrade procoagulant polyphosphates), and sphingomyelinase were found at lower transcriptional levels. Accessory glands were enriched with antimicrobials (lysozyme, defensin, lactotransferrin) and protease inhibitors (TIL-domain, cystatin, Kazal). Mucins, heme-oxygenase, and IgG chains were present in both glands. Proteome analysis by nano LC-MS/MS confirmed that several transcripts are expressed in the glands. The database presented herein is accessible online at http://exon.niaid.nih.gov/transcriptome/D_rotundus/Supplemental-web.xlsx. These results reveal that bat saliva emerges as a novel source of modulators of vascular biology. Biological significance: Vampire bat saliva emerges as a novel source of antihemostatics which modulate several aspects of vascular biology. © 2013.
Resumo:
Abstract Background The ongoing efforts to sequence the honey bee genome require additional initiatives to define its transcriptome. Towards this end, we employed the Open Reading frame ESTs (ORESTES) strategy to generate profiles for the life cycle of Apis mellifera workers. Results Of the 5,021 ORESTES, 35.2% matched with previously deposited Apis ESTs. The analysis of the remaining sequences defined a set of putative orthologs whose majority had their best-match hits with Anopheles and Drosophila genes. CAP3 assembly of the Apis ORESTES with the already existing 15,500 Apis ESTs generated 3,408 contigs. BLASTX comparison of these contigs with protein sets of organisms representing distinct phylogenetic clades revealed a total of 1,629 contigs that Apis mellifera shares with different taxa. Most (41%) represent genes that are in common to all taxa, another 21% are shared between metazoans (Bilateria), and 16% are shared only within the Insecta clade. A set of 23 putative genes presented a best match with human genes, many of which encode factors related to cell signaling/signal transduction. 1,779 contigs (52%) did not match any known sequence. Applying a correction factor deduced from a parallel analysis performed with Drosophila melanogaster ORESTES, we estimate that approximately half of these no-match ESTs contigs (22%) should represent Apis-specific genes. Conclusions The versatile and cost-efficient ORESTES approach produced minilibraries for honey bee life cycle stages. Such information on central gene regions contributes to genome annotation and also lends itself to cross-transcriptome comparisons to reveal evolutionary trends in insect genomes.
Resumo:
We used 2D protein gel electrophoresis and DNA microarray technologies to systematically analyze genes under glucose repression in Bacillus subtilis. In particular, we focused on genes expressed after the shift from glycolytic to gluconeogenic at the middle logarithmic phase of growth in a nutrient sporulation medium, which remained repressed by the addition of glucose. We also examined whether or not glucose repression of these genes was mediated by CcpA, the catabolite control protein of this bacterium. The wild-type and ccpA1 cells were grown with and without glucose, and their proteomes and transcriptomes were compared. 2D gel electrophoresis allowed us to identify 11 proteins, the synthesis of which was under glucose repression. Of these proteins, the synthesis of four (IolA, I, S and PckA) was under CcpA-independent control. Microarray analysis enabled us to detect 66 glucose-repressive genes, 22 of which (glmS, acoA, C, yisS, speD, gapB, pckA, yvdR, yxeF, iolA, B, C, D, E, F, G, H, I, J, R, S and yxbF ) were at least partially under CcpA-independent control. Furthermore, we found that CcpA and IolR, a repressor of the iol divergon, were involved in the glucose repression of the synthesis of inositol dehydrogenase encoded by iolG included in the above list. The CcpA-independent glucose repression of the iol genes appeared to be explained by inducer exclusion.
Resumo:
The chromodomain is 40-50 amino acids in length and is conserved in a wide range of chromatic and regulatory proteins involved in chromatin remodeling. Chromodomain-containing proteins can be classified into families based on their broader characteristics, in particular the presence of other types of domains, and which correlate with different subclasses of the chromodomains themselves. Hidden Markov model (HMM)-generated profiles of different subclasses of chromodomains were used here to identify sequences encoding chromodomain-containing proteins in the mouse transcriptome and genome. A total of 36 different loci encoding proteins containing chromodomains, including 17 novel loci, were identified. Six of these loci (including three apparent pseudogenes, a novel HP1 ortholog, and two novel Msl-3 transcription factor-like proteins) are not present in the human genome, whereas the human genome contains four loci (two CDY orthologs and two apparent CDY pseuclogenes) that are not present in mouse. A number of these loci exhibit alternative splicing to produce different isoforms, including 43 novel variants, some of which lack the chromodomain. The likely functions of these proteins are discussed in relation to the known functions of other chromodomain-containing proteins within the same family.
Resumo:
The number of mammalian transcripts identified by full-length cDNA projects and genome sequencing projects is increasing remarkably. Clustering them into a strictly nonredundant and comprehensive set provides a platform for functional analysis of the transcriptome and proteome, but the quality of the clustering and predictive usefulness have previously required manual curation to identify truncated transcripts and inappropriate clustering of closely related sequences. A Representative Transcript and Protein Sets (RTPS) pipeline was previously designed to identify the nonredundant and comprehensive set of mouse transcripts based on clustering of a large mouse full-length cDNA set (FANTOM2). Here we propose an alternative method that is more robust, requires less manual curation, and is applicable to other organisms in addition to mouse. RTPSs of human, mouse, and rat have been produced by this method and used for validation. Their comprehensiveness and quality are discussed by comparison with other clustering approaches. The RTPSs are available at ftp://fantom2.gsc.riken.go.jp/RTPS/. (C). 2004 Elsevier Inc. All rights reserved.
Resumo:
Of the ~1.7 million SINE elements in the human genome, only a tiny number are estimated to be active in transcription by RNA polymerase (Pol) III. Tracing the individual loci from which SINE transcripts originate is complicated by their highly repetitive nature. By exploiting RNA-Seq datasets and unique SINE DNA sequences, we devised a bioinformatic pipeline allowing us to identify Pol III-dependent transcripts of individual SINE elements. When applied to ENCODE transcriptomes of seven human cell lines, this search strategy identified ~1300 Alu loci and ~1100 MIR loci corresponding to detectable transcripts, with ~120 and ~60 respectively Alu and MIR loci expressed in at least three cell lines. In vitro transcription of selected SINEs did not reflect their in vivo expression properties, and required the native 5’-flanking region in addition to internal promoter. We also identified a cluster of expressed AluYa5-derived transcription units, juxtaposed to snaR genes on chromosome 19, formed by a promoter-containing left monomer fused to an Alu-unrelated downstream moiety. Autonomous Pol III transcription was also revealed for SINEs nested within Pol II-transcribed genes raising the possibility of an underlying mechanism for Pol II gene regulation by SINE transcriptional units. Moreover the application of our bioinformatic pipeline to both RNA-seq data of cells subjected to an in vitro pro-oncogenic stimulus and of in vivo matched tumor and non-tumor samples allowed us to detect increased Alu RNA expression as well as the source loci of such deregulation. The ability to investigate SINE transcriptomes at single-locus resolution will facilitate both the identification of novel biologically relevant SINE RNAs and the assessment of SINE expression alteration under pathological conditions.
Resumo:
The carotid body (CB) is a major arterial chemoreceptor containing glomus cells that are activated by changes in arterial blood contents including oxygen. Despite significant advancement in the characterization of their physiological properties, our understanding on the underlying molecular machinery and signaling pathway in CB glomus cells is still limited.
To overcome these limitations, in chapter 1, I demonstrated the first transcriptome profile of CB glomus cells using single cell sequencing technology, which allowed us to uncover a set of abundantly expressed genes, including novel glomus cell-specific transcripts. These results revealed involvement of G protein-coupled receptor (GPCR) signaling pathway, various types of ion channels, as well as atypical mitochondrial subunits in CB function. I also identified ligands for the mostly highly expressed GPCR (Olfr78) in CB glomus cells and examined this receptor’s role in CB mediated hypoxic ventilatory response.
Current knowledge of CB suggest glomus cells rely on unusual mitochondria for their sensitivity to hypoxia. I previously identified the atypical mitochondrial subunit Ndufa4l2 as a highly over-represented gene in CB glomus cells. In chapter 2, to investigate the functional significance of Ndufa4l2 in CB function, I phenotyped both Ndufa4l2 knockout mice and mice with conditional Ndufa4l2 deletion in CB glomus cells. I found that Ndufa4l2 is essential to the establishment of regular breathing after birth. Ablating Ndufa4l2 in postnatal CB glomus cells resulted in defective CB sensitivity to hypoxia as well as CB mediated hypoxic ventilatory response. Together, our data showed that Ndufa4l2 is critical to respiratory control and the oxygen sensitivity of CB glomus cells.