995 resultados para expressed sequence tags (EST)


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The skin of fish is the first line of defense against pathogens and parasites. The skin transcriptome of the Atlantic salmon is poorly characterized, and currently only 2,089 expressed sequence tags (ESTs) out of a total of half a million sequences are generated from skin-derived cDNA libraries. The primary aim of this study was to enhance the transcriptomic knowledge of salmon skin by using next-generation sequencing (NGS) technology, namely the Roche-454 platform. An equimolar mixture of high-quality RNA from skin and epidermal samples of salmon reared in either freshwater or seawater was used for 454-sequencing. This technique yielded over 600,000 reads, which were assembled into 34,696 isotigs using Newbler. Of these isotigs, 12 % had not been sequenced in Atlantic salmon, hence representing previously unreported salmon mRNAs that can potentially be skin-specific. Many full-length genes have been acquired, representing numerous biological processes. Mucin proteins are the main structural component of mucus and we examined in greater detail the sequences we obtained for these genes. Several isotigs exhibited homology to mammalian mucins (MUC2, MUC5AC and MUC5B). Mucin mRNAs are generally > 10 kbp and contain large repetitive units, which pose a challenge towards full-length sequence discovery. To date, we have not unearthed any full-length salmon mucin genes with this dataset, but have both N- and C-terminal regions of a mucin type 5. This highlights the fact that, while NGS is indeed a formidable tool for sequence data mining of non-model species, it must be complemented with additional experimental and bioinformatic work to characterize some mRNA sequences with complex features.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The European sea bass, Dicentrarchus labrax, is one of the most important marine species cultivated in Southern Europe and has not benefited from selective breeding. One of the major goals in the sea bass (D. labrax) aquaculture industry is to understand and control the complexity of growth associated traits. The aim of the methodology developed for the studies reported in the thesis was not only to establish genetic and genomic resources for sea bass, but to also develop a conceptual strategy to efficiently create knowledge in a research environment that can easily be transferred to the aquaculture industry. The strategy involved; i) establishing an annotated sea bass transcriptome and then using it to, ii) identify new genetic markers for target QTL regions so that, iii) new QTL analysis could be performed and marker based resolution of the DNA regions of interest increased, and then iv) to merge the linkage map and the physical map in order to map the QTL confidence intervals to the sea bass genome and identify genes underlying the targeted traits. Finally to test if genes in the QTL regions that are candidates for divergent growth phenotypes have modified patterns of transcription that reflects the modified whole organism physiology SuperSAGE-SOLiD4 gene expression was used with sea bass with high growth heterogeneity. The SuperSAGE contributed to significantly increase the transcriptome information for sea bass muscle, brain and liver and also led to the identification of putative candidate genes lying in the genomic region of growth related QTL. Lastly all differentially expressed transcripts in brain, liver and muscle of the European sea bass with divergent specific growth rates were mapped to gene pathways and networks and the regulatory pathways most affected identified and established the tissue specific changes underlying the divergent SGR. Owing to the importance of European sea bass to Mediterranean aquaculture and the developed genomics resources from the present thesis and from other studies it should be possible to implement genetic selection programs using marker assisted selection.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Among the largest resources for biological sequence data is the large amount of expressed sequence tags (ESTs) available in public and proprietary databases. ESTs provide information on transcripts but for technical reasons they often contain sequencing errors. Therefore, when analyzing EST sequences computationally, such errors must be taken into account. Earlier attempts to model error prone coding regions have shown good performance in detecting and predicting these while correcting sequencing errors using codon usage frequencies. In the research presented here, we improve the detection of translation start and stop sites by integrating a more complex mRNA model with codon usage bias based error correction into one hidden Markov model (HMM), thus generalizing this error correction approach to more complex HMMs. We show that our method maintains the performance in detecting coding sequences.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We have used massively parallel signature sequencing (MPSS) to sample the transcriptomes of 32 normal human tissues to an unprecedented depth, thus documenting the patterns of expression of almost 20,000 genes with high sensitivity and specificity. The data confirm the widely held belief that differences in gene expression between cell and tissue types are largely determined by transcripts derived from a limited number of tissue-specific genes, rather than by combinations of more promiscuously expressed genes. Expression of a little more than half of all known human genes seems to account for both the common requirements and the specific functions of the tissues sampled. A classification of tissues based on patterns of gene expression largely reproduces classifications based on anatomical and biochemical properties. The unbiased sampling of the human transcriptome achieved by MPSS supports the idea that most human genes have been mapped, if not functionally characterized. This data set should prove useful for the identification of tissue-specific genes, for the study of global changes induced by pathological conditions, and for the definition of a minimal set of genes necessary for basic cell maintenance. The data are available on the Web at http://mpss.licr.org and http://sgb.lynxgen.com.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: Cancer/testis (CT) genes are normally expressed only in germ cells, but can be activated in the cancer state. This unusual property, together with the finding that many CT proteins elicit an antigenic response in cancer patients, has established a role for this class of genes as targets in immunotherapy regimes. Many families of CT genes have been identified in the human genome, but their biological function for the most part remains unclear. While it has been shown that some CT genes are under diversifying selection, this question has not been addressed before for the class as a whole. RESULTS: To shed more light on this interesting group of genes, we exploited the generation of a draft chimpanzee (Pan troglodytes) genomic sequence to examine CT genes in an organism that is closely related to human, and generated a high-quality, manually curated set of human:chimpanzee CT gene alignments. We find that the chimpanzee genome contains homologues to most of the human CT families, and that the genes are located on the same chromosome and at a similar copy number to those in human. Comparison of putative human:chimpanzee orthologues indicates that CT genes located on chromosome X are diverging faster and are undergoing stronger diversifying selection than those on the autosomes or than a set of control genes on either chromosome X or autosomes. CONCLUSION: Given their high level of diversifying selection, we suggest that CT genes are primarily responsible for the observed rapid evolution of protein-coding genes on the X chromosome.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Le rôle important joué par la mitochondrie dans la cellule eucaryote est admis depuis longtemps. Cependant, la composition exacte des mitochondries, ainsi que les processus biologiques qui sy déroulent restent encore largement inconnus. Deux facteurs principaux permettent dexpliquer pourquoi létude des mitochondries progresse si lentement : le manque defficacité des méthodes didentification des protéines mitochondriales et le manque de précision dans lannotation de ces protéines. En conséquence, nous avons développé un nouvel outil informatique, YimLoc, qui permet de prédire avec succès les protéines mitochondriales à partir des séquences génomiques. Cet outil intègre plusieurs indicateurs existants, et sa performance est supérieure à celle des indicateurs considérés individuellement. Nous avons analysé environ 60 génomes fongiques avec YimLoc afin de lever la controverse concernant la localisation de la bêta-oxydation dans ces organismes. Contrairement à ce qui était généralement admis, nos résultats montrent que la plupart des groupes de Fungi possèdent une bêta-oxydation mitochondriale. Ce travail met également en évidence la diversité des processus de bêta-oxydation chez les champignons, en corrélation avec leur utilisation des acides gras comme source dénergie et de carbone. De plus, nous avons étudié le composant clef de la voie de bêta-oxydation mitochondriale, lacyl-CoA déshydrogénase (ACAD), dans 250 espèces, couvrant les 3 domaines de la vie, en combinant la prédiction de la localisation subcellulaire avec la classification en sous-familles et linférence phylogénétique. Notre étude suggère que les gènes ACAD font partie dune ancienne famille qui a adopté des stratégies évolutionnaires innovatrices afin de générer un large ensemble denzymes susceptibles dutiliser la plupart des acides gras et des acides aminés. Finalement, afin de permettre la prédiction de protéines mitochondriales à partir de données autres que les séquences génomiques, nous avons développé le logiciel TESTLoc qui utilise comme données des Expressed Sequence Tags (ESTs). La performance de TESTLoc est significativement supérieure à celle de tout autre outil de prédiction connu. En plus de fournir deux nouveaux outils de prédiction de la localisation subcellulaire utilisant différents types de données, nos travaux démontrent comment lassociation de la prédiction de la localisation subcellulaire à dautres méthodes danalyse in silico permet daméliorer la connaissance des protéines mitochondriales. De plus, ces travaux proposent des hypothèses claires et faciles à vérifier par des expériences, ce qui présente un grand potentiel pour faire progresser nos connaissances des métabolismes mitochondriaux.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Chez les angiospermes, la reproduction passe par la double fécondation. Le tube pollinique délivre deux cellules spermatiques au sein du gamétophyte femelle. Une cellule féconde la cellule œuf pour produire un zygote; l’autre féconde la cellule centrale pour produire l’endosperme. Pour assurer un succès reproductif, le développement du gamétophyte femelle au sein de l’ovule doit établir un patron cellulaire qui favorise les interactions avec le tube pollinique et les cellules spermatiques. Pour ce faire, un dialogue doit s’établir entre les différentes cellules de l’ovule lors de son développement, de même que lors de la fécondation. D’ailleurs, plusieurs types de communications intercellulaires sont supposées suite à la caractérisation de plusieurs mutants développementaux. De même, ces communications semblent persister au sein du zygote et de l’endosperme pour permettre la formation d’un embryon viable au sein de la graine. Malgré les développements récents qui ont permis de trouver des molécules de signalisation supportant les modèles d’interactions cellulaires avancés par la communauté scientifique, les voies de signalisation sont de loin très incomplètes. Dans le but de caractériser des gènes encodant des protéines de signalisation potentiellement impliqués dans la reproduction chez Solanum chacoense, l’analyse d’expression des gènes de type RALF présents dans une banque d’ESTs (Expressed Sequence Tags) spécifiques à l’ovule après fécondation a été entreprise. RALF, Rapid Alcalinization Factor, est un peptide de 5 kDa qui fait partie de la superfamille des «protéines riches en cystéines (CRPs)», dont les rôles physiologiques au sein de la plante sont multiples. Cette analyse d’expression a conduit à une analyse approfondie de ScRALF3, dont l’expression au sein de la plante se limite essentiellement à l’ovule. L’analyse de plantes transgéniques d’interférence pour le gène ScRALF3 a révélé un rôle particulier lors de la mégagamétogénèse. Les plantes transgéniques présentent des divisions mitotiques anormales qui empêchent le développement complet du sac embryonnaire. Le positionnement des noyaux, de même que la synchronisation des divisions au sein du syncytium, semblent responsables de cette perte de progression lors de la mégagamétogénèse. L’isolement du promoteur de même que l’analyse plus précise d’expression au sein de l’ovule révèle une localisation sporophytique du transcrit. La voie de signalisation de l’auxine régule également la transcription de ScRALF3. De surcroît, ScRALF3 est un peptide empruntant la voie de sécrétion médiée par le réticulum endoplasmique et l’appareil de Golgi. En somme, ScRALF3 est un important facteur facilitant la communication entre le sporophyte et le gamétophyte pour amener à maturité le sac embryonnaire. L’identification d’un orthologue potentiel chez Arabidopsis thaliana a conduit à la caractérisation de AtRALF34. L’absence de phénotype lors du développement du sac embryonnaire suggère, cependant, de la redondance génétique au sein de la grande famille des gènes de type RALF. Néanmoins, les peptides RALFs apparaissent comme d’importants régulateurs lors de la reproduction chez Solanum chacoense et Arabidopsis thaliana.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Coconut, Cocos nucifera L. is a major plantation crop, which ensures income for millions of people in the tropical region. Detailed molecular studies on zygotic embryo development would provide valuable clues for the identification of molecular markers to improve somatic embryogenesis. Since there is no ongoing genome project for this species, coconut expressed sequence tags (EST) would be an interesting technique to identify important coconut embryo specific genes as well as other functional genes in different biochemical pathways. The goal of this study was to analyse the ESTs by examining the transcriptome data of the different embryo tissue types together with one somatic tissue. Here, four cDNA libraries from immature embryo, mature embryo, microspore derived embryo and mature leaves were constructed. cDNA was sequenced by the Roche-454 GS-FLX system and assembled into 32621 putative unigenes and 155017 singletons. Of these unigenes, 18651 had significant sequence similarities to non-redundant protein database, from which 16153 were assigned to one or more gene ontology categories. Homologue genes, which are responsible for embryo development such as chitinase, beta-1,3-glucanase, ATP synthase CF0 subunit, thaumatin-like protein and metallothionein-like protein were identified among the embryo EST collection. Of the unigenes, 6694 were mapped into 139 KEGG pathways including carbohydrate metabolism, energy metabolism, lipid metabolism, amino acid metabolism and nucleotide metabolism. This collection of 454-derived EST data generated from different tissue types provides a significant resource for genome wide studies and gene discovery of coconut, a non-model species.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The sporulation stage of the aquatic fungus Blastocladiella emersonii culminates with the formation and release to the medium of a number of zoospores, which are motile cells responsible for the dispersal of the fungus. The presence in the sporulation solution of 1H-[1,2,4]Oxadiazolo[4,3-a]quinoxalin-1-one (ODQ), a potent and selective inhibitor of nitric oxide-sensitive guanylyl cyclases, completely prevented biogenesis of the zoospores. In addition, this compound was able to significantly reduce cGMP levels, which increase drastically during late sporulation, suggesting the existence of a nitric oxide-dependent mechanism for cGMP synthesis. Furthermore, increased levels of nitric oxide-derived products were detected during sporulation by fluorescence assays using DAF-2 DA, whose signal was drastically reduced in the presence of the nitric oxide synthase inhibitor N omega-Nitro-L-arginine methyl ester (L-NAME). These results were confirmed by quantitative chemiluminescent determination of the intracellular levels of nitric oxide-derived products. A putative nitric oxide synthase (NOS) activity was detected throughout sporulation, and this enzyme activity decreased significantly when L-NAME and 1-[2-(Trifluoromethyl)phenyl]imidazole (TRIM) were added to the assays. NOS assays carried out in the presence of EGTA showed decreased enzyme activity, suggesting the involvement of calcium ions in enzyme activation. Additionally, expressed sequence tags (ESTs) encoding putative guanylyl cyclases and a cGMP-phosphodiesterase were found in B. emersonii EST database (http://blasto.iq.usp.br), and the mRNA levels of the corresponding genes were observed to increase during sporulation. Altogether, data presented here revealed the presence and expression of guanylyl cyclase and cGMP phosphodiesterase genes in B. emersonii and provided evidence of a Ca(2+)-(center dot)NO-cGMP signaling pathway playing a role in zoospore biogenesis. (C) 2009 Elsevier Inc. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The genome sequence of Aedes aegypti was recently reported. A significant amount of Expressed Sequence Tags (ESTs) were sequenced to aid in the gene prediction process. In the present work we describe an integrated analysis of the genomic and EST data, focusing on genes with preferential expression in larvae (LG), adults (AG) and in both stages (SG). A total of 913 genes (5.4% of the transcript complement) are LG, including ion transporters and cuticle proteins that are important for ion homeostasis and defense. From a starting set of 245 genes encoding the trypsin domain, we identified 66 putative LG, AG, and SG trypsins by manual curation. Phylogenetic analyses showed that AG trypsins are divergent from their larval counterparts (LG), grouping with blood-induced trypsins from Anopheles gambiae and Simulium vittatum. These results support the hypothesis that blood-feeding arose only once, in the ancestral Culicomorpha. Peritrophins are proteins that interlock chitin fibrils to form the peritrophic membrane (PM) that compartmentalizes the food in the midgut. These proteins are recognized by having chitin-binding domains with 6 conserved Cys and may also present mucin-like domains (regions expected to be highly O-glycosylated). PM may be formed by a ring of cells (type 2, seen in Ae. aegypti larvae and Drosophila melanogaster) or by most midgut cells (type 1, found in Ae. aegypti adult and Tribolium castaneum). LG and D. melanogaster peritrophins have more complex domain structures than AG and T. castaneum peritrophins. Furthermore, mucin-like domains of peritrophins from T. castaneum (feeding on rough food) are lengthier than those of adult Ae. aegypti (blood-feeding). This suggests, for the first time, that type 1 and type 2 PM may have variable molecular architectures determined by different peritrophins and/or ancillary proteins, which may be partly modulated by diet.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: Human infection by the pork tapeworm Taenia solium affects more than 50 million people worldwide, particularly in underdeveloped and developing countries. Cysticercosis which arises from larval encystation can be life threatening and difficult to treat. Here, we investigate for the first time the transcriptome of the clinically relevant cysticerci larval form. Results: Using Expressed Sequence Tags (ESTs) produced by the ORESTES method, a total of 1,520 high quality ESTs were generated from 20 ORESTES cDNA mini-libraries and its analysis revealed fragments of genes with promising applications including 51 ESTs matching antigens previously described in other species, as well as 113 sequences representing proteins with potential extracellular localization, with obvious applications for immune-diagnosis or vaccine development. Conclusion: The set of sequences described here will contribute to deciphering the expression profile of this important parasite and will be informative for the genome assembly and annotation, as well as for studies of intra- and inter-specific sequence variability. Genes of interest for developing new diagnostic and therapeutic tools are described and discussed.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)