980 resultados para Expressed sequence tags
Resumo:
Crotalus durissus rattlesnakes are responsible for the most lethal cases of snakebites in Brazil. Crotalus durissus collilineatus subspecies is related to a great number of accidents in Southeast and Central West regions, but few studies on its venom composition have been carried out to date. In an attempt to describe the transcriptional profile of the C. durissus collilineatus venom gland, we generated a cDNA library and the sequences obtained could be identified by similarity searches on existing databases. Out of 673 expressed sequence tags (ESTs) 489 produced readable sequences comprising 201 singletons and 47 clusters of two or more ESTs. One hundred and fifty reads (60.5%) produced significant hits to known sequences. The results showed a predominance of toxin-coding ESTs instead of transcripts coding for proteins involved in all cellular functions. The most frequent toxin was crotoxin, comprising 88% of toxin-coding sequences. Crotoxin B, a basic phospholipase A(2) (PLA(2)) subunit of crotoxin, was represented in more variable forms comparing to the non-enzymatic subunit (crotoxin A), and most sequences coding this molecule were identified as CB1 isoform from Crotalus durissus terrificus venom. Four percent of toxin-related sequences in this study were identified as growth factors, comprising five sequences for vascular endothelial growth factor (VEGF) and one for nerve growth factor (NGF) that showed 100% of identity with C. durissus terrificus NGF. We also identified two clusters for metalloprotease from PII class comprising 3% of the toxins, and two for serine proteases, including gyroxin (2.5%). The remaining 2.5% of toxin-coding ESTs represent singletons identified as homologue sequences to cardiotoxin, convulxin, angiotensin-converting enzyme inhibitor and C-type natriuretic peptide, Ohanin, crotamin and PLA(2) inhibitor. These results allowed the identification of the most common classes of toxins in C. durissus collilineatus snake venom, also showing some unknown classes for this subspecies and even for C. durissus species, such as cardiotoxins and VEGF. (C) 2009 Published by Elsevier Masson SAS.
Resumo:
A polyclonal antibody (C4), raised against the head domain of chicken myosin Va, reacted strongly towards a 65 kDa polypeptide (p65) on Western blots of extracts from squid optic lobes but did not recognize the heavy chain of squid myosin V. This peptide was not recognized by other myosin Va antibodies, nor by an antibody specific for squid myosin V. In an attempt to identify it, p65 was purified from optic lobes of Loligo plei by cationic exchange and reverse phase chromatography. Several peptide sequences were obtained by mass spectroscopy from p65 cut from sodium dodecyl sulphate polyacrylamide gel electrophoresis (SDS-PAGE) gels. BLAST analysis and partial matching with expressed sequence tags (ESTs) from a Loligo pealei data bank indicated that p65 contains consensus signatures for the heterogeneous nuclear ribonucleoprotein (hnRNP) A/B family of RNA-binding proteins. Centrifugation of post mitochondrial extracts from optic lobes on sucrose gradients after treatment with RNase gave biochemical evidence that p65 associates with cytoplasmic RNP complexes in an RNA-dependent manner. Immunohistochemistry and immunofluorescence studies using the C4 antibody showed partial co-labeling with an antibody against squid synaptotagmin in bands within the outer plexiform layer of the optic lobes and at the presynaptic zone of the stellate ganglion. Also, punctate labeling by the C4 antibody was observed within isolated optic lobe synaptosomes. The data indicate that p65 is a novel RNA-binding protein located to the presynaptic terminal within squid neurons and may have a role in synaptic localization of RNA and its translation or processing. (C) 2010 IBRO. Published by Elsevier Ltd. All rights reserved.
Resumo:
The molecular mechanism that controls the response to phosphate shortage in Neurospora crassa involves four regulatory genes - nuc-2, preg, pgov, and nuc-1. Phosphate shortage is sensed by the nuc-2 gene, the product of which inhibits the functioning of the PREG-PGOV complex. This allows the translocation of the transcriptional factor NUC-1 into the nucleus, which activates the transcription of phosphate-repressible phosphatases. The nuc-2A mutant strain of N. crassa carries a loss-of-function mutation in the nuc-2 gene, which encodes an ankyrin-like repeat protein. In this study, we identified transcripts that are downregutated in the nuc-2A mutant strain. Functional grouping of these expressed sequence tags allowed the identification of genes that play essential roles in different cellular processes such as transport, transcriptional regulation, signal transduction, metabolism, protein synthesis, protein fate, and development. These results reveal novel aspects of the phosphorus-sensing network in N. crassa. (C) 2009 Elsevier GmbH. All rights reserved.
Resumo:
The success of plant reproduction depends on pollen-pistil interactions occurring at the stigma/style. These interactions vary depending on the stigma type: wet or dry. Tobacco (Nicotiana tabacum) represents a model of wet stigma, and its stigmas/styles express genes to accomplish the appropriate functions. For a large-scale study of gene expression during tobacco pistil development and preparation for pollination, we generated 11,216 high-quality expressed sequence tags (ESTs) from stigmas/styles and created the TOBEST database. These ESTs were assembled in 6,177 clusters, from which 52.1% are pistil transcripts/genes of unknown function. The 21 clusters with the highest number of ESTs (putative higher expression levels) correspond to genes associated with defense mechanisms or pollen-pistil interactions. The database analysis unraveled tobacco sequences homologous to the Arabidopsis (Arabidopsis thaliana) genes involved in specifying pistil identity or determining normal pistil morphology and function. Additionally, 782 independent clusters were examined by macroarray, revealing 46 stigma/style preferentially expressed genes. Real-time reverse transcription-polymerase chain reaction experiments validated the pistil-preferential expression for nine out of 10 genes tested. A search for these 46 genes in the Arabidopsis pistil data sets demonstrated that only 11 sequences, with putative equivalent molecular functions, are expressed in this dry stigma species. The reverse search for the Arabidopsis pistil genes in the TOBEST exposed a partial overlap between these dry and wet stigma transcriptomes. The TOBEST represents the most extensive survey of gene expression in the stigmas/styles of wet stigma plants, and our results indicate that wet and dry stigmas/styles express common as well as distinct genes in preparation for the pollination process.
Resumo:
Background The continued increase in tuberculosis (TB) rates and the appearance of extremely resistant Mycobacterium tuberculosis strains (XDR-TB) worldwide are some of the great problems of public health. In this context, DNA immunotherapy has been proposed as an effective alternative that could circumvent the limitations of conventional drugs. Nonetheless, the molecular events underlying these therapeutic effects are poorly understood. Methods We characterized the transcriptional signature of lungs from mice infected with M. tuberculosis and treated with heat shock protein 65 as a genetic vaccine (DNAhsp65) combining microarray and real-time polymerase chain reaction analysis. The gene expression data were correlated with the histopathological analysis of lungs. Results The differential modulation of a high number of genes allowed us to distinguish DNAhsp65-treated from nontreated animals (saline and vector-injected mice). Functional analysis of this group of genes suggests that DNAhsp65 therapy could not only boost the T helper (Th)1 immune response, but also could inhibit Th2 cytokines and regulate the intensity of inflammation through fine tuning of gene expression of various genes, including those of interleukin-17, lymphotoxin A, tumour necrosis factor-cl, interleukin-6, transforming growth factor-beta, inducible nitric oxide synthase and Foxp3. In addition, a large number of genes and expressed sequence tags previously unrelated to DNA-therapy were identified. All these findings were well correlated with the histopathological lesions presented in the lungs. Conclusions The effects of DNA therapy are reflected in gene expression modulation; therefore, the genes identified as differentially expressed could be considered as transcriptional biomarkers of DNAhsp65 immunotherapy against TB. The data have important implications for achieving a better understanding of gene-based therapies. Copyright (C) 2008 John Wiley & Sons, Ltd.
Resumo:
Among the largest resources for biological sequence data is the large amount of expressed sequence tags (ESTs) available in public and proprietary databases. ESTs provide information on transcripts but for technical reasons they often contain sequencing errors. Therefore, when analyzing EST sequences computationally, such errors must be taken into account. Earlier attempts to model error prone coding regions have shown good performance in detecting and predicting these while correcting sequencing errors using codon usage frequencies. In the research presented here, we improve the detection of translation start and stop sites by integrating a more complex mRNA model with codon usage bias based error correction into one hidden Markov model (HMM), thus generalizing this error correction approach to more complex HMMs. We show that our method maintains the performance in detecting coding sequences.
Resumo:
We have used massively parallel signature sequencing (MPSS) to sample the transcriptomes of 32 normal human tissues to an unprecedented depth, thus documenting the patterns of expression of almost 20,000 genes with high sensitivity and specificity. The data confirm the widely held belief that differences in gene expression between cell and tissue types are largely determined by transcripts derived from a limited number of tissue-specific genes, rather than by combinations of more promiscuously expressed genes. Expression of a little more than half of all known human genes seems to account for both the common requirements and the specific functions of the tissues sampled. A classification of tissues based on patterns of gene expression largely reproduces classifications based on anatomical and biochemical properties. The unbiased sampling of the human transcriptome achieved by MPSS supports the idea that most human genes have been mapped, if not functionally characterized. This data set should prove useful for the identification of tissue-specific genes, for the study of global changes induced by pathological conditions, and for the definition of a minimal set of genes necessary for basic cell maintenance. The data are available on the Web at http://mpss.licr.org and http://sgb.lynxgen.com.
Resumo:
BACKGROUND: Cancer/testis (CT) genes are normally expressed only in germ cells, but can be activated in the cancer state. This unusual property, together with the finding that many CT proteins elicit an antigenic response in cancer patients, has established a role for this class of genes as targets in immunotherapy regimes. Many families of CT genes have been identified in the human genome, but their biological function for the most part remains unclear. While it has been shown that some CT genes are under diversifying selection, this question has not been addressed before for the class as a whole. RESULTS: To shed more light on this interesting group of genes, we exploited the generation of a draft chimpanzee (Pan troglodytes) genomic sequence to examine CT genes in an organism that is closely related to human, and generated a high-quality, manually curated set of human:chimpanzee CT gene alignments. We find that the chimpanzee genome contains homologues to most of the human CT families, and that the genes are located on the same chromosome and at a similar copy number to those in human. Comparison of putative human:chimpanzee orthologues indicates that CT genes located on chromosome X are diverging faster and are undergoing stronger diversifying selection than those on the autosomes or than a set of control genes on either chromosome X or autosomes. CONCLUSION: Given their high level of diversifying selection, we suggest that CT genes are primarily responsible for the observed rapid evolution of protein-coding genes on the X chromosome.
Resumo:
High throughput genome (HTG) and expressed sequence tag (EST) sequences are currently the most abundant nucleotide sequence classes in the public database. The large volume, high degree of fragmentation and lack of gene structure annotations prevent efficient and effective searches of HTG and EST data for protein sequence homologies by standard search methods. Here, we briefly describe three newly developed resources that should make discovery of interesting genes in these sequence classes easier in the future, especially to biologists not having access to a powerful local bioinformatics environment. trEST and trGEN are regularly regenerated databases of hypothetical protein sequences predicted from EST and HTG sequences, respectively. Hits is a web-based data retrieval and analysis system providing access to precomputed matches between protein sequences (including sequences from trEST and trGEN) and patterns and profiles from Prosite and Pfam. The three resources can be accessed via the Hits home page (http://hits. isb-sib.ch).
Resumo:
"The host-parasite relationship" is a vast and diverse research field which, despite huge human and financial input over many years, remains largely shrouded in mystery. Clearly, the adaptation of parasites to their different host species, and to the different environmental stresses that they represent, depends on interactions with, and responses to, various molecules of host and/or parasite origin. The schistosome genome project is a primary strategy to reach the goal; this systematic research project has successfully developed novel technologies for qualitative and quantitative characterization of schistosome genes and genome organization by extensive international collaboration between top quality laboratories. Schistosomes are a family of parasitic blood flukes (Phylum Platyhelminthes), which have seven pairs of autosomal chromosomes and one pair of sex chromosomes (ZZ for a male worm and ZW for a female), of a haploid genome size of 2.7x108 base pairs (Simpson et al. 1982). Schistosomes are ideal model organisms for the development of genome mapping strategies since they have a small genome size comparable to that of well-characterized model organisms such as Caenorhabditis elegans (100 Mb) and Drosophila (165 Mb), and contain functional genes with a high level of homology to the host mammalian genes. Here we summarize the current progress in the schistosome genome project, the information of 3,047 transcribed genes (Expressed Sequence Tags; EST), complete sets of cDNA and genomic DNA libraries (including YAC and cosmid libraries) with a mapping technique to the well defined schistosome chromosomes. The schistosome genome project will further identify and characterize the key molecules that are responsible for host-parasite adaptation, i.e., successful growth, development, maturation and reproduction of the parasite within its host in the near future
Resumo:
Strategies to construct the physical map of the Trypanosoma cruzi nuclear genome have to capitalize on three main advantages of the parasite genome, namely (a) its small size, (b) the fact that all chromosomes can be defined, and many of them can be isolated by pulse field gel electrophoresis, and (c) the fact that simple Southern blots of electrophoretic karyotypes can be used to map sequence tagged sites and expressed sequence tags to chromosomal bands. A major drawback to cope with is the complexity of T. cruzi genetics, that hinders the construction of a comprehensive genetic map. As a first step towards physical mapping, we report the construction and partial characterization of a T. cruzi CL-Brener genomic library in yeast artificial chromosomes (YACs) that consists of 2,770 individual YACs with a mean insert size of 365 kb encompassing around 10 genomic equivalents. Two libraries in bacterial artificial chromosomes (BACs) have been constructed, BACI and BACII. Both libraries represent about three genome equivalents. A third BAC library (BAC III) is being constructed. YACs and BACs are invaluable tools for physical mapping. More generally, they have to be considered as a common resource for research in Chagas disease
Resumo:
Random single pass sequencing of cDNA fragments, also known as generation of Expressed Sequence Tags (ESTs), has been highly successful in the study of the gene content of higher organisms, and forms an integral part of most genome projects, with the objective to identify new genes and targets for disease control and prevention and to generate mapping probes. In the Trypanosoma cruzi genome project, EST sequencing has also been a starting point, and here we report data on the first 797 sequences obtained, partly from a CL Brener epimastigote non-normalized library, partly on a normalized library. Only around 30% of the sequences obtained showed similarity with Genbank and dbEST databases, half of which with sequences already reported for T. cruzi.
Resumo:
The identification of all human chromosome 21 (HC21) genes is a necessary step in understanding the molecular pathogenesis of trisomy 21 (Down syndrome). The first analysis of the sequence of 21q included 127 previously characterized genes and predicted an additional 98 novel anonymous genes. Recently we evaluated the quality of this annotation by characterizing a set of HC21 open reading frames (C21orfs) identified by mapping spliced expressed sequence tags (ESTs) and predicted genes (PREDs), identified only in silico. This study underscored the limitations of in silico-only gene prediction, as many PREDs were incorrectly predicted. To refine the HC21 annotation, we have developed a reliable algorithm to extract and stringently map sequences that contain bona fide 3' transcript ends to the genome. We then created a specific 21q graphical display allowing an integrated view of the data that incorporates new ESTs as well as features such as CpG islands, repeats, and gene predictions. Using these tools we identified 27 new putative genes. To validate these, we sequenced previously cloned cDNAs and carried out RT-PCR, 5'- and 3'-RACE procedures, and comparative mapping. These approaches substantiated 19 new transcripts, thus increasing the HC21 gene count by 9.5%. These transcripts were likely not previously identified because they are small and encode small proteins. We also identified four transcriptional units that are spliced but contain no obvious open reading frame. The HC21 data presented here further emphasize that current gene prediction algorithms miss a substantial number of transcripts that nevertheless can be identified using a combination of experimental approaches and multiple refined algorithms.
Resumo:
Pendant ma thèse de doctorat, j'ai utilisé des espèces modèles, comme la souris et le poisson-zèbre, pour étudier les facteurs qui affectent l'évolution des gènes et leur expression. Plus précisément, j'ai montré que l'anatomie et le développement sont des facteurs clés à prendre en compte, car ils influencent la vitesse d'évolution de la séquence des gènes, l'impact sur eux de mutations (i.e. la délétion du gène est-elle létale ?), et leur tendance à se dupliquer. Où et quand il est exprimé impose à un gène certaines contraintes ou au contraire lui donne des opportunités d'évoluer. J'ai pu comparer ces tendances aux modèles classiques d'évolution de la morphologie, que l'on pensait auparavant refléter directement les contraintes s'appliquant sur le génome. Nous avons montré que les contraintes entre ces deux niveaux d'organisation ne peuvent pas être transférées simplement : il n'y a pas de lien direct entre la conservation du génotype et celle de phénotypes comme la morphologie. Ce travail a été possible grâce au développement d'outils bioinformatiques. Notamment, j'ai travaillé sur le développement de la base de données Bgee, qui a pour but de comparer l'expression des gènes entre différentes espèces de manière automatique et à large échelle. Cela implique une formalisation de l'anatomie, du développement et de concepts liés à l'homologie grâce à l'utilisation d'ontologies. Une intégration cohérente de données d'expression hétérogènes (puces à ADN, marqueurs de séquence exprimée, hybridations in situ) a aussi été nécessaire. Cette base de données est mise à jour régulièrement et disponible librement. Elle devrait contribuer à étendre les possibilités de comparaison de l'expression des gènes entre espèces pour des études d'évo-devo (évolution du développement) et de génomique. During my PhD, I used model species of vertebrates, such as mouse and zebrafish, to study factors affecting the evolution of genes and their expression. More precisely I have shown that anatomy and development are key factors to take into account, influencing the rate of gene sequence evolution, the impact of mutations (i.e. is the deletion of a gene lethal?), and the propensity of a gene to duplicate. Where and when genes are expressed imposes constraints, or on the contrary leaves them some opportunity to evolve. We analyzed these patterns in relation to classical models of morphological evolution in vertebrates, which were previously thought to directly reflect constraints on the genomes. We showed that the patterns of evolution at these two levels of organization do not translate smoothly: there is no direct link between the conservation of genotype and phenotypes such as morphology. This work was made possible by the development of bioinformatics tools. Notably, I worked on the development of the database Bgee, which aims at comparing gene expression between different species in an automated and large-scale way. This involves the formalization of anatomy, development, and concepts related to homology, through the use of ontologies. A coherent integration of heterogeneous expression data (microarray, expressed sequence tags, in situ hybridizations) is also required. This database is regularly updated and freely available. It should contribute to extend the possibilities for comparison of gene expression between species in evo-devo and genomics studies.