25 resultados para Annotation de génomes


Relevância:

10.00% 10.00%

Publicador:

Resumo:

Schistosoma mansoni is one of the agents of schistosomiasis, a chronic and debilitating disease. Here we, present a transcriptome-wide characterization of adult S. mansoni males by high-throughput RNA-sequencing. We obtained 1,620,432 high-quality ESTs from a directional strand-specific cDNA library, resulting in a 26% higher coverage of genome bases than that of the public ESTs available at NCBI. With a 15 x-deep coverage of transcribed genomic regions, our data were able to (i) confirm for the first time 990 predictions without previous evidence of transcription; (ii) correct gene predictions; (iii) discover 989 and 1196 RNA-seq contigs that map to intergenic and intronic genomic regions, respectively, where no gene had been predicted before. These contigs could represent new protein-coding genes or non-coding RNAs (ncRNAs). Interestingly, we identified 11 novel Micro-exon genes (MEGs). These data reveal new features of the S. mansoni transcriptional landscape and significantly advance our understanding of the parasite transcriptome. (c) 2011 Elsevier Inc. All rights reserved.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The endemic marine sponge Arenosclera brasiliensis (Porifera, Demospongiae, Haplosclerida) is a known source of secondary metabolites such as arenosclerins A-C. In the present study, we established the composition of the A. brasiliensis microbiome and the metabolic pathways associated with this community. We used 454 shotgun pyrosequencing to generate approximately 640,000 high-quality sponge-derived sequences (similar to 150 Mb). Clustering analysis including sponge, seawater and twenty-three other metagenomes derived from marine animal microbiomes shows that A. brasiliensis contains a specific microbiome. Fourteen bacterial phyla (including Proteobacteria, Cyanobacteria, Actinobacteria, Bacteroidetes, Firmicutes and Cloroflexi) were consistently found in the A. brasiliensis metagenomes. The A. brasiliensis microbiome is enriched for Betaproteobacteria (e.g., Burkholderia) and Gammaproteobacteria (e.g., Pseudomonas and Alteromonas) compared with the surrounding planktonic microbial communities. Functional analysis based on Rapid Annotation using Subsystem Technology (RAST) indicated that the A. brasiliensis microbiome is enriched for sequences associated with membrane transport and one-carbon metabolism. In addition, there was an overrepresentation of sequences associated with aerobic and anaerobic metabolism as well as the synthesis and degradation of secondary metabolites. This study represents the first analysis of sponge-associated microbial communities via shotgun pyrosequencing, a strategy commonly applied in similar analyses in other marine invertebrate hosts, such as corals and algae. We demonstrate that A. brasiliensis has a unique microbiome that is distinct from that of the surrounding planktonic microbes and from other marine organisms, indicating a species-specific microbiome.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Abstract Background The ongoing efforts to sequence the honey bee genome require additional initiatives to define its transcriptome. Towards this end, we employed the Open Reading frame ESTs (ORESTES) strategy to generate profiles for the life cycle of Apis mellifera workers. Results Of the 5,021 ORESTES, 35.2% matched with previously deposited Apis ESTs. The analysis of the remaining sequences defined a set of putative orthologs whose majority had their best-match hits with Anopheles and Drosophila genes. CAP3 assembly of the Apis ORESTES with the already existing 15,500 Apis ESTs generated 3,408 contigs. BLASTX comparison of these contigs with protein sets of organisms representing distinct phylogenetic clades revealed a total of 1,629 contigs that Apis mellifera shares with different taxa. Most (41%) represent genes that are in common to all taxa, another 21% are shared between metazoans (Bilateria), and 16% are shared only within the Insecta clade. A set of 23 putative genes presented a best match with human genes, many of which encode factors related to cell signaling/signal transduction. 1,779 contigs (52%) did not match any known sequence. Applying a correction factor deduced from a parallel analysis performed with Drosophila melanogaster ORESTES, we estimate that approximately half of these no-match ESTs contigs (22%) should represent Apis-specific genes. Conclusions The versatile and cost-efficient ORESTES approach produced minilibraries for honey bee life cycle stages. Such information on central gene regions contributes to genome annotation and also lends itself to cross-transcriptome comparisons to reveal evolutionary trends in insect genomes.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Abstract Background The mitochondrial DNA of kinetoplastid flagellates is distinctive in the eukaryotic world due to its massive size, complex form and large sequence content. Comprised of catenated maxicircles that contain rRNA and protein-coding genes and thousands of heterogeneous minicircles encoding small guide RNAs, the kinetoplast network has evolved along with an extreme form of mRNA processing in the form of uridine insertion and deletion RNA editing. Many maxicircle-encoded mRNAs cannot be translated without this post-transcriptional sequence modification. Results We present the complete sequence and annotation of the Trypanosoma cruzi maxicircles for the CL Brener and Esmeraldo strains. Gene order is syntenic with Trypanosoma brucei and Leishmania tarentolae maxicircles. The non-coding components have strain-specific repetitive regions and a variable region that is unique for each strain with the exception of a conserved sequence element that may serve as an origin of replication, but shows no sequence identity with L. tarentolae or T. brucei. Alternative assemblies of the variable region demonstrate intra-strain heterogeneity of the maxicircle population. The extent of mRNA editing required for particular genes approximates that seen in T. brucei. Extensively edited genes were more divergent among the genera than non-edited and rRNA genes. Esmeraldo contains a unique 236-bp deletion that removes the 5'-ends of ND4 and CR4 and the intergenic region. Esmeraldo shows additional insertions and deletions outside of areas edited in other species in ND5, MURF1, and MURF2, while CL Brener has a distinct insertion in MURF2. Conclusion The CL Brener and Esmeraldo maxicircles represent two of three previously defined maxicircle clades and promise utility as taxonomic markers. Restoration of the disrupted reading frames might be accomplished by strain-specific RNA editing. Elements in the non-coding region may be important for replication, transcription, and anchoring of the maxicircle within the kinetoplast network.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Abstract Background Plasmodium vivax is the most widely distributed human malaria, responsible for 70–80 million clinical cases each year and large socio-economical burdens for countries such as Brazil where it is the most prevalent species. Unfortunately, due to the impossibility of growing this parasite in continuous in vitro culture, research on P. vivax remains largely neglected. Methods A pilot survey of expressed sequence tags (ESTs) from the asexual blood stages of P. vivax was performed. To do so, 1,184 clones from a cDNA library constructed with parasites obtained from 10 different human patients in the Brazilian Amazon were sequenced. Sequences were automatedly processed to remove contaminants and low quality reads. A total of 806 sequences with an average length of 586 bp met such criteria and their clustering revealed 666 distinct events. The consensus sequence of each cluster and the unique sequences of the singlets were used in similarity searches against different databases that included P. vivax, Plasmodium falciparum, Plasmodium yoelii, Plasmodium knowlesi, Apicomplexa and the GenBank non-redundant database. An E-value of <10-30 was used to define a significant database match. ESTs were manually assigned a gene ontology (GO) terminology Results A total of 769 ESTs could be assigned a putative identity based upon sequence similarity to known proteins in GenBank. Moreover, 292 ESTs were annotated and a GO terminology was assigned to 164 of them. Conclusion These are the first ESTs reported for P. vivax and, as such, they represent a valuable resource to assist in the annotation of the P. vivax genome currently being sequenced. Moreover, since the GC-content of the P. vivax genome is strikingly different from that of P. falciparum, these ESTs will help in the validation of gene predictions for P. vivax and to create a gene index of this malaria parasite.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Abstract Background The CHD7 (Chromodomain Helicase DNA binding protein 7) gene encodes a member of the chromodomain family of ATP-dependent chromatin remodeling enzymes. Mutations in the CHD7 gene are found in individuals with CHARGE, a syndrome characterized by multiple birth malformations in several tissues. CHD7 was identified as a binding partner of PBAF complex (Polybromo and BRG Associated Factor containing complex) playing a central role in the transcriptional reprogramming process associated to the formation of multipotent migratory neural crest, a transient cell population associated with the genesis of various tissues. CHD7 is a large gene containing 38 annotated exons and spanning 200 kb of genomic sequence. Although genes containing such number of exons are expected to have several alternative transcripts, there are very few evidences of alternative transcripts associated to CHD7 to date indicating that alternative splicing associated to this gene is poorly characterized. Findings Here, we report the cloning and characterization by experimental and computational studies of a novel alternative transcript of the human CHD7 (named CHD7 CRA_e), which lacks most of its coding exons. We confirmed by overexpression of CHD7 CRA_e alternative transcript that it is translated into a protein isoform lacking most of the domains displayed by the canonical isoform. Expression of the CHD7 CRA_e transcript was detected in normal liver, in addition to the DU145 human prostate carcinoma cell line from which it was originally isolated. Conclusions Our findings indicate that the splicing event associated to the CHD7 CRA_e alternative transcript is functional. The characterization of the CHD7 CRA_e novel isoform presented here not only sets the basis for more detailed functional studies of this isoform, but, also, contributes to the alternative splicing annotation of the CHD7 gene and the design of future functional studies aimed at the elucidation of the molecular functions of its gene products.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Background: The species of T. harzianum are well known for their biocontrol activity against many plant pathogens. However, there is a lack of studies concerning its use as a biological control agent against F. solani, a pathogen involved in several crop diseases. In this study, we have used subtractive library hybridization (SSH) and quantitative real-time PCR (RT-qPCR) techniques in order to explore changes in T. harzianum genes expression during growth on cell wall of F. solani (FSCW) or glucose. RT-qPCR was also used to examine the regulation of 18 genes, potentially involved in biocontrol, during confrontation between T. harzianum and F. solani. Results: Data obtained from two subtractive libraries were compared after annotation using the Blast2GO suite. A total of 417 and 78 readable EST sequence were annotated in the FSCW and glucose libraries, respectively. Functional annotation of these genes identified diverse biological processes and molecular functions required during T. harzianum growth on FSCW or glucose. We identified various genes of biotechnological value encoding to proteins which function such as transporters, hydrolytic activity, adherence, appressorium development and pathogenesis. Fifteen genes were up-regulated and sixteen were down-regulated at least at one-time point during growth of T. harzianum in FSCW. During the confrontation assay most of the genes were up-regulated, mainly after contact, when the interaction has been established. Conclusions: This study demonstrates that T. harzianum expressed different genes when grown on FSCW compared to glucose. It provides insights into the mechanisms of gene expression involved in mycoparasitism of T. harzianum against F. solani. The identification and evaluation of these genes may contribute to the development of an efficient biological control agent.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Background: The insect exoskeleton provides shape, waterproofing, and locomotion via attached somatic muscles. The exoskeleton is renewed during molting, a process regulated by ecdysteroid hormones. The holometabolous pupa transforms into an adult during the imaginal molt, when the epidermis synthe3sizes the definitive exoskeleton that then differentiates progressively. An important issue in insect development concerns how the exoskeletal regions are constructed to provide their morphological, physiological and mechanical functions. We used whole-genome oligonucleotide microarrays to screen for genes involved in exoskeletal formation in the honeybee thoracic dorsum. Our analysis included three sampling times during the pupal-to-adult molt, i.e., before, during and after the ecdysteroid-induced apolysis that triggers synthesis of the adult exoskeleton. Results: Gene ontology annotation based on orthologous relationships with Drosophila melanogaster genes placed the honeybee differentially expressed genes (DEGs) into distinct categories of Biological Process and Molecular Function, depending on developmental time, revealing the functional elements required for adult exoskeleton formation. Of the 1,253 unique DEGs, 547 were upregulated in the thoracic dorsum after apolysis, suggesting induction by the ecdysteroid pulse. The upregulated gene set included 20 of the 47 cuticular protein (CP) genes that were previously identified in the honeybee genome, and three novel putative CP genes that do not belong to a known CP family. In situ hybridization showed that two of the novel genes were abundantly expressed in the epidermis during adult exoskeleton formation, strongly implicating them as genuine CP genes. Conserved sequence motifs identified the CP genes as members of the CPR, Tweedle, Apidermin, CPF, CPLCP1 and Analogous-to-Peritrophins families. Furthermore, 28 of the 36 muscle-related DEGs were upregulated during the de novo formation of striated fibers attached to the exoskeleton. A search for cis-regulatory motifs in the 5′-untranslated region of the DEGs revealed potential binding sites for known transcription factors. Construction of a regulatory network showed that various upregulated CP- and muscle-related genes (15 and 21 genes, respectively) share common elements, suggesting co-regulation during thoracic exoskeleton formation. Conclusions: These findings help reveal molecular aspects of rigid thoracic exoskeleton formation during the ecdysteroid-coordinated pupal-to-adult molt in the honeybee.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Genome-wide association studies have failed to establish common variant risk for the majority of common human diseases. The underlying reasons for this failure are explained by recent studies of resequencing and comparison of over 1200 human genomes and 10 000 exomes, together with the delineation of DNA methylation patterns (epigenome) and full characterization of coding and noncoding RNAs (transcriptome) being transcribed. These studies have provided the most comprehensive catalogues of functional elements and genetic variants that are now available for global integrative analysis and experimental validation in prospective cohort studies. With these datasets, researchers will have unparalleled opportunities for the alignment, mining, and testing of hypotheses for the roles of specific genetic variants, including copy number variations, single nucleotide polymorphisms, and indels as the cause of specific phenotypes and diseases. Through the use of next-generation sequencing technologies for genotyping and standardized ontological annotation to systematically analyze the effects of genomic variation on humans and model organism phenotypes, we will be able to find candidate genes and new clues for disease’s etiology and treatment. This article describes essential concepts in genetics and genomic technologies as well as the emerging computational framework to comprehensively search websites and platforms available for the analysis and interpretation of genomic data.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Surprisingly little is known of the toxic arsenal of cnidarian nematocysts compared to other venomous animals. Here we investigate the toxins of nematocysts isolated from the jellyfish Olindias sambaquiensis. A total of 29 unique ms/ms events were annotated as potential toxins homologous to the toxic proteins from diverse animal phyla, including conesnails, snakes, spiders, scorpions, wasp, bee, parasitic worm and other Cnidaria. Biological activities of these potential toxins include cytolysins, neurotoxins, phospholipases and toxic peptidases. The presence of several toxic enzymes is intriguing, such as sphingomyelin phosphodiesterase B (SMase B) that has only been described in certain spider venoms, and a prepro-haystatin P-IIId snake venom metalloproteinase (SVMP) that activates coagulation factor X, which is very rare even in snake venoms. Our annotation reveals sequence orthologs to many representatives of the most important superfamilies of peptide venoms suggesting that their origins in higher organisms arise from deep eumetazoan innovations. Accordingly, cnidarian venoms may possess unique biological properties that might generate new leads in the discovery of novel pharmacologically active drugs.