647 resultados para Transcriptome
Resumo:
Insect oocytes grow in close association with the ovarian follicular epithelium (OFE), which escorts the oocyte during oogenesis and is responsible for synthesis and secretion of the eggshell. We describe a transcriptome of OFE of the triatomine bug Rhodnius prolixus, a vector of Chagas disease, to increase our knowledge of the role of FE in egg development. Random clones were sequenced from a cDNA library of different stages of follicle development. The transcriptome showed high commitment to transcription, protein synthesis, and secretion. The most abundant cDNA was a secreted (S) small, proline-rich protein with maximal expression in the vitellogenic follicle, suggesting a role in oocyte maturation. We also found Rp45, a chorion protein already described, and a putative chitin-associated cuticle protein that was an eggshell component candidate. Six transcripts coding for proteins related to the unfolded-protein response (UPR) by were chosen and their expression analyzed. Surprisingly, transcripts related to UPR showed higher expression during early stages of development and downregulation during late stages, when transcripts coding for S proteins participating in chorion formation were highly expressed. Several transcripts with potential roles in oogenesis and embryo development are also discussed. We propose that intense protein synthesis at the FE results in reticulum stress (RS) and that lowering expression of a set of genes related to cell survival should lead to degeneration of follicular cells at oocyte maturation. This paradoxical suppression of UPR suggests that ovarian follicles may represent an interesting model for studying control of RS and cell survival in professional S cell types. (C) 2011 Elsevier Ltd. All rights reserved.
Resumo:
The genome sequence of Aedes aegypti was recently reported. A significant amount of Expressed Sequence Tags (ESTs) were sequenced to aid in the gene prediction process. In the present work we describe an integrated analysis of the genomic and EST data, focusing on genes with preferential expression in larvae (LG), adults (AG) and in both stages (SG). A total of 913 genes (5.4% of the transcript complement) are LG, including ion transporters and cuticle proteins that are important for ion homeostasis and defense. From a starting set of 245 genes encoding the trypsin domain, we identified 66 putative LG, AG, and SG trypsins by manual curation. Phylogenetic analyses showed that AG trypsins are divergent from their larval counterparts (LG), grouping with blood-induced trypsins from Anopheles gambiae and Simulium vittatum. These results support the hypothesis that blood-feeding arose only once, in the ancestral Culicomorpha. Peritrophins are proteins that interlock chitin fibrils to form the peritrophic membrane (PM) that compartmentalizes the food in the midgut. These proteins are recognized by having chitin-binding domains with 6 conserved Cys and may also present mucin-like domains (regions expected to be highly O-glycosylated). PM may be formed by a ring of cells (type 2, seen in Ae. aegypti larvae and Drosophila melanogaster) or by most midgut cells (type 1, found in Ae. aegypti adult and Tribolium castaneum). LG and D. melanogaster peritrophins have more complex domain structures than AG and T. castaneum peritrophins. Furthermore, mucin-like domains of peritrophins from T. castaneum (feeding on rough food) are lengthier than those of adult Ae. aegypti (blood-feeding). This suggests, for the first time, that type 1 and type 2 PM may have variable molecular architectures determined by different peritrophins and/or ancillary proteins, which may be partly modulated by diet.
Resumo:
Schistosomiasis affects more than 200 million people worldwide; another 600 million are at risk of infection. The schistosomulum stage is believed to be the target of protective immunity in the attenuated cercaria vaccine model. In an attempt to identify genes up-regulated in the schistosomulum stage in relation to cercaria, we explored the Schistosoma mansoni transcriptome by looking at the relative frequency of reads in EST libraries from both stages. The 400 genes potentially up-regulated in schistosomula were analyzed as to their Gene Ontology categorization, and we have focused on those encoding-predicted proteins with no similarity to proteins of other organisms, assuming they could be parasite-specific proteins important for survival in the host. Up-regulation in schistosomulum relative to cercaria was validated with real-time reverse transcription polymerase chain reaction (RT-PCR) for five out of nine selected genes (56%). We tested their protective potential in mice through immunization with DNA vaccines followed by a parasite challenge. Worm burden reductions of 16-17% were observed for one of them, indicating its protective potential. Our results demonstrate the value and caveats of using stage-associated frequency of ESTs as an indication of differential expression coupled to DNA vaccine screening in the identification of novel proteins to be further investigated as potential vaccine candidates.
Resumo:
Background: Human infection by the pork tapeworm Taenia solium affects more than 50 million people worldwide, particularly in underdeveloped and developing countries. Cysticercosis which arises from larval encystation can be life threatening and difficult to treat. Here, we investigate for the first time the transcriptome of the clinically relevant cysticerci larval form. Results: Using Expressed Sequence Tags (ESTs) produced by the ORESTES method, a total of 1,520 high quality ESTs were generated from 20 ORESTES cDNA mini-libraries and its analysis revealed fragments of genes with promising applications including 51 ESTs matching antigens previously described in other species, as well as 113 sequences representing proteins with potential extracellular localization, with obvious applications for immune-diagnosis or vaccine development. Conclusion: The set of sequences described here will contribute to deciphering the expression profile of this important parasite and will be informative for the genome assembly and annotation, as well as for studies of intra- and inter-specific sequence variability. Genes of interest for developing new diagnostic and therapeutic tools are described and discussed.
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
A detailed genome mapping analysis of 213,636 expressed sequence tags (EST) derived from nontumor and tumor tissues of the oral cavity, larynx, pharynx, and thyroid was done. Transcripts matching known human genes were identified; potential new splice variants were flagged and subjected to manual curation, pointing to 788 putatively new alternative splicing isoforms, the majority (75%) being insertion events. A subset of 34 new splicing isoforms (5% of 788 events) was selected and 23 (68%) were confirmed by reverse transcription-PCR and DNA sequencing. Putative new genes were revealed, including six transcripts mapped to well-studied chromosomes such as 22, as well as transcripts that mapped to 253 intergenic regions. In addition, 2,251 noncoding intronic RNAs, eventually involved in transcriptional regulation, were found. A set of 250 candidate markers for loss of heterozygosis or gene amplification was selected by identifying transcripts that mapped to genomic regions previously known to be frequently amplified or deleted in head, neck, and thyroid tumors. Three of these markers were evaluated by quantitative reverse transcription-PCR in an independent set of individual samples. Along with detailed clinical data about tumor origin, the information reported here is now publicly available on a dedicated Web site as a resource for further biological investigation. This first in silico reconstruction of the head, neck, and thyroid transcriptomes points to a wealth of new candidate markers that can be used for future studies on the molecular basis of these tumors. Similar analysis is warranted for a number of other tumors for which large EST data sets are available.
Resumo:
open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription-PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning.
Resumo:
Snake venom glands are a rich source of bioactive molecules such as peptides, proteins and enzymes that show important pharmacological activity leading to in local and systemic effects as pain, edema, bleeding and muscle necrosis. Most studies on pharmacologically active peptides and proteins from snake venoms have been concerned with isolation and structure elucidation through methods of classical biochemistry. As an attempt to examine the transcripts expressed in the venom gland of Bothrops jararacussu and to unveil the toxicological and pharmacological potential of its products at the molecular level, we generated 549 expressed sequence tags (ESTs) from a directional cDNA library. Sequences obtained from single-pass sequencing of randomly selected cDNA clones could be identified by similarities searches on existing databases, resulting in 197 sequences with significant similarity to phospholipase A(2) (PLA(2)), of which 83.2% were Lys49-PLA(2) homologs (BOJU-1), 0.1% were basic Asp49-PLA(2)s (BOJU-II) and 0.6% were acidic Asp49-PLA(2)s (BOJU-III). Adjoining this very abundant class of proteins we found 88 transcripts codifying for putative sequences of metalloproteases, which after clustering and assembling resulted in three full-length sequences: BOJUMET-I, BOJUMET-II and BOJUMET-III; as well as 25 transcripts related to C-type lectin like protein including a full-length cDNA of a putative galactose binding C-type lectin and a cluster of eight serine-proteases transcripts including a full-length cDNA of a putative serine protease. Among the full-length sequenced clones we identified a nerve growth factor (Bj-NGF) with 92% identity with a human NGF (NGHUBM) and an acidic phospholipase A2 (BthA-I-PLA(2)) displaying 85-93% identity with other snake venom toxins. Genetic distance among PLA(2)s from Bothrops species were evaluated by phylogenetic analysis. Furthermore, analysis of full-length putative Lys49-PLA(2) through molecular modeling showed conserved structural domains, allowing the characterization of those proteins as group II PLA(2)s. The constructed cDNA library provides molecular clones harboring sequences that can be used to probe directly the genetic material from gland venom of other snake species. Expression of complete cDNAs or their modified derivatives will be useful for elucidation of the structure-function relationships of these toxins and peptides of biotechnological interest. (C) 2004 Elsevier SAS. All rights reserved.
Resumo:
Over 40,000 sugarcane (Saccharum officinarum) consensus sequences assembled from 237,954 expressed sequence tags were compared with the protein and DNA sequences from other angiosperms, including the genomes of Arabidopsis and rice (Oryza sativa). Approximately two-thirds of the sugarcane transcriptome have similar sequences in Arabidopsis. These sequences may represent a core set of proteins or protein domains that are conserved among monocots and eudicots and probably encode for essential angiosperm. functions. The remaining sequences represent putative monocot-specific genetic material, one-half of which were found only in sugarcane. These monocot-specific cDNAs represent either novelties or, in many cases, fast-evolving sequences that diverged substantially from their eudicot homologs. The wide comparative genome analysis presented here provides information on the evolutionary changes that underlie the divergence of monocots and eudicots. Our comparative analysis also led to the identification of several not yet annotated putative genes and possible gene loss events in Arabidopsis.
Resumo:
We report the results of a transcript finishing initiative, undertaken for the purpose of identifying and characterizing novel human transcripts, in which RT-PCR was used to bridge gaps between paired EST Clusters, mapped against the genomic sequence. Each pair of EST Clusters selected for experimental validation was designated a transcript finishing unit (TFU). A total of 489 TFUs were selected for validation, and an overall efficiency of 43.1% was achieved. We generated a total of 59,975 bp of transcribed sequences organized into 432 exons, contributing to the definition of the structure of 211 human transcripts. The structure of several transcripts reported here was confirmed during the course of this project, through the generation of their corresponding full-length cDNA sequences. Nevertheless, for 21% of the validated TFUs, a full-length cDNA sequence is not yet available in public databases, and the structure of 69.2% of these TFUs was not correctly predicted by computer programs. The TF strategy provides a significant contribution to the definition of the complete catalog of human genes and transcripts, because it appears to be particularly useful for identification of low abundance transcripts expressed in a restricted Set of tissues as well as for the delineation of gene boundaries and alternatively spliced isoforms.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Paracoccidioides brasiliensis is a fungal human pathogen with a wide distribution in Latin America. It causes paracoccidioidomycosis, the most widespread systemic mycosis in Latin America. Although gene expression in P. brasiliensis had been studied, little is known about the genome sequences expressed by this species during the infection process. To better understand the infection process, 4934 expressed sequence tags (ESTs) derived from a non-normalized cDNA library from P. brasiliensis (isolate Pb01) yeast-phase cells recovered from the livers of infected mice were annotated and clustered to a UniGene (clusters containing sequences that represent a unique gene) set with 1602 members. A large-scale comparative analysis was performed between the UniGene sequences of P. brasiliensis yeast-phase cells recovered from infected mice and a database constructed with sequences of the yeast-phase and mycelium transcriptome (isolate Pb01) (https://dna.biomol.unb.br/Pb/), as well as with all public ESTs available at GenBank, including sequences of the P. brasiliensis yeast-phase transcriptome (isolate Pb18) (http:// www.ncbi.nlm.nih.gov/). The focus was on the overexpressed and novel genes. From the total, 3184 ESTs (64.53%) were also present in the previously described transcriptome of yeast-form and mycelium cells obtained from in vitro cultures (https://dna.biomol.unb.br/Pb/) and of those, 1172 ESTs (23.75% of the described sequences) represented transcripts overexpressed during the infection process. Comparative analysis identified 1750 ESTs (35.47% of the total), comprising 649 UniGene sequences representing novel transcripts of P. brasiliensis, not previously described for this isolate or for other isolates in public databases. KEGG pathway mapping showed that the novel and overexpressed transcripts represented standard metabolic pathways, including glycolysis, amino acid biosynthesis, lipid and sterol metabolism. The unique and divergent representation of transcripts in the cDNA library of yeast cells recovered from infected mice suggests differential gene expression in response to the host milieu.
Resumo:
Whereas genome sequencing defines the genetic potential of an organism, transcript sequencing defines the utilization of this potential and links the genome with most areas of biology. To exploit the information within the human genome in the fight against cancer, we have deposited some two million expressed sequence tags (ESTs) from human tumors and their corresponding normal tissues in the public databases. The data currently define approximate to23,500 genes, of which only approximate to1,250 are still represented only by ESTs. Examination of the EST coverage of known cancer-related (CR) genes reveals that <1% do not have corresponding ESTs, indicating that the representation of genes associated with commonly studied tumors is high. The careful recording of the origin of all ESTs we have produced has enabled detailed definition of where the genes they represent are expressed in the human body. More than 100,000 ESTs are available for seven tissues, indicating a surprising variability of gene usage that has led to the discovery of a significant number of genes with restricted expression, and that may thus be therapeutically useful. The ESTs also reveal novel nonsynonymous germline variants (although the one-pass nature of the data necessitates careful validation) and many alternatively spliced transcripts. Although widely exploited by the scientific community, vindicating our totally open source policy, the EST data generated still provide extensive information that remains to be systematically explored, and that may further facilitate progress toward both the understanding and treatment of human cancers.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Background: Artificial selection has resulted in animal breeds with extreme phenotypes. As an organism is made up of many different tissues and organs, each with its own genetic programme, it is pertinent to ask: How relevant is tissue in terms of total transcriptome variability? Which are the genes most distinctly expressed between tissues? Does breed or sex equally affect the transcriptome across tissues?Results: In order to gain insight on these issues, we conducted microarray expression profiling of 16 different tissues from four animals of two extreme pig breeds, Large White and Iberian, two males and two females. Mixed model analysis and neighbor - joining trees showed that tissues with similar developmental origin clustered closer than those with different embryonic origins. Often a sound biological interpretation was possible for overrepresented gene ontology categories within differentially expressed genes between groups of tissues. For instance, an excess of nervous system or muscle development genes were found among tissues of ectoderm or mesoderm origins, respectively. Tissue accounted for similar to 11 times more variability than sex or breed. Nevertheless, we were able to confidently identify genes with differential expression across tissues between breeds (33 genes) and between sexes (19 genes). The genes primarily affected by sex were overall different than those affected by breed or tissue. Interaction with tissue can be important for differentially expressed genes between breeds but not so much for genes whose expression differ between sexes.Conclusion: Embryonic development leaves an enduring footprint on the transcriptome. The interaction in gene x tissue for differentially expressed genes between breeds suggests that animal breeding has targeted differentially each tissue's transcriptome.