974 resultados para Expressed sequence tag analysis
Resumo:
The pipeline for macro- and microarray analyses (PMmA) is a set of scripts with a web interface developed to analyze DNA array data generated by array image quantification software. PMmA is designed for use with single- or double-color array data and to work as a pipeline in five classes (data format, normalization, data analysis, clustering, and array maps). It can also be used as a plugin in the BioArray Software Environment, an open-source database for array analysis, or used in a local version of the web service. All scripts in PMmA were developed in the PERL programming language and statistical analysis functions were implemented in the R statistical language. Consequently, our package is a platform-independent software. Our algorithms can correctly select almost 90% of the differentially expressed genes, showing a superior performance compared to other methods of analysis. The pipeline software has been applied to 1536 expressed sequence tags macroarray public data of sugarcane exposed to cold for 3 to 48 h. PMmA identified thirty cold-responsive genes previously unidentified in this public dataset. Fourteen genes were up-regulated, two had a variable expression and the other fourteen were down-regulated in the treatments. These new findings certainly were a consequence of using a superior statistical analysis approach, since the original study did not take into account the dependence of data variability on the average signal intensity of each gene. The web interface, supplementary information, and the package source code are available, free, to non-commercial users at http://ipe.cbmeg.unicamp.br/pub/PMmA.
Resumo:
Chez les angiospermes, la reproduction passe par la double fécondation. Le tube pollinique délivre deux cellules spermatiques au sein du gamétophyte femelle. Une cellule féconde la cellule œuf pour produire un zygote; l’autre féconde la cellule centrale pour produire l’endosperme. Pour assurer un succès reproductif, le développement du gamétophyte femelle au sein de l’ovule doit établir un patron cellulaire qui favorise les interactions avec le tube pollinique et les cellules spermatiques. Pour ce faire, un dialogue doit s’établir entre les différentes cellules de l’ovule lors de son développement, de même que lors de la fécondation. D’ailleurs, plusieurs types de communications intercellulaires sont supposées suite à la caractérisation de plusieurs mutants développementaux. De même, ces communications semblent persister au sein du zygote et de l’endosperme pour permettre la formation d’un embryon viable au sein de la graine. Malgré les développements récents qui ont permis de trouver des molécules de signalisation supportant les modèles d’interactions cellulaires avancés par la communauté scientifique, les voies de signalisation sont de loin très incomplètes. Dans le but de caractériser des gènes encodant des protéines de signalisation potentiellement impliqués dans la reproduction chez Solanum chacoense, l’analyse d’expression des gènes de type RALF présents dans une banque d’ESTs (Expressed Sequence Tags) spécifiques à l’ovule après fécondation a été entreprise. RALF, Rapid Alcalinization Factor, est un peptide de 5 kDa qui fait partie de la superfamille des «protéines riches en cystéines (CRPs)», dont les rôles physiologiques au sein de la plante sont multiples. Cette analyse d’expression a conduit à une analyse approfondie de ScRALF3, dont l’expression au sein de la plante se limite essentiellement à l’ovule. L’analyse de plantes transgéniques d’interférence pour le gène ScRALF3 a révélé un rôle particulier lors de la mégagamétogénèse. Les plantes transgéniques présentent des divisions mitotiques anormales qui empêchent le développement complet du sac embryonnaire. Le positionnement des noyaux, de même que la synchronisation des divisions au sein du syncytium, semblent responsables de cette perte de progression lors de la mégagamétogénèse. L’isolement du promoteur de même que l’analyse plus précise d’expression au sein de l’ovule révèle une localisation sporophytique du transcrit. La voie de signalisation de l’auxine régule également la transcription de ScRALF3. De surcroît, ScRALF3 est un peptide empruntant la voie de sécrétion médiée par le réticulum endoplasmique et l’appareil de Golgi. En somme, ScRALF3 est un important facteur facilitant la communication entre le sporophyte et le gamétophyte pour amener à maturité le sac embryonnaire. L’identification d’un orthologue potentiel chez Arabidopsis thaliana a conduit à la caractérisation de AtRALF34. L’absence de phénotype lors du développement du sac embryonnaire suggère, cependant, de la redondance génétique au sein de la grande famille des gènes de type RALF. Néanmoins, les peptides RALFs apparaissent comme d’importants régulateurs lors de la reproduction chez Solanum chacoense et Arabidopsis thaliana.
Resumo:
BACKGROUND: Chronic fatigue syndrome (CFS) is an increasing medical phenomenon of unknown aetiology leading to high levels of chronic morbidity. Of the many hypotheses that purport to explain this disease, immune system activation, as a central feature, has remained prominent but unsubstantiated. Supporting this, a number of important cytokines have previously been shown to be over-expressed in disease subjects. The diagnosis of CFS is highly problematic since no biological markers specific to this disease have been identified. The discovery of genes relating to this condition is an important goal in seeking to correctly categorize and understand this complex syndrome. OBJECTIVE: The aim of this study was to screen for changes in gene expression in the lymphocytes of CFS patients. METHODS: 'Differential Display' is a method for comparing mRNA populations for the induction or suppression of genes. In this technique, mRNA populations from control and test subjects can be 'displayed' by gel electrophoresis and screened for differing banding patterns. These differences are indicative of altered gene expression between samples, and the genes that correspond to these bands can be cloned and identified. Differential display has been used to compare expression levels between four control subjects and seven CFS patients. RESULTS: Twelve short expressed sequence tags have been identified that were over-expressed in lymphocytes from CFS patients. Two of these correspond to cathepsin C and MAIL1 - genes known to be upregulated in activated lymphocytes. The expression level of seven of the differentially displayed sequences have been verified by quantifying relative level of these transcripts using TAQman quantitative PCR. CONCLUSION: Taken as a whole, the identification of novel gene tags up-regulated in CFS patients adds weight to the idea that CFS is a disease characterized by subtle changes in the immune system.
Resumo:
The scarcity and stochastic nature of genetic mutations presents a significant challenge for scientists seeking to characterise de novo mutation frequency at specific loci. Such mutations can be particularly numerous during regeneration of plants from in vitro culture and can undermine the value of germplasm conservation efforts. We used cleaved amplified polymorphic sequence (CAPS) analysis to characterise new mutations amongst a clonal population of cocoa plants regenerated via a somatic embryogenesis protocol used previously for cocoa cryopreservation. Efficacy of the CAPS system for mutation detection was greatly improved after an ‘a priori’ in silico screen of reference target sequences for actual and potential restriction enzyme recognition sites using a new freely available software called Artbio. Artbio surveys known sequences for existing restriction enzyme recognition sites but also identifies all single nucleotide polymorphism (SNP) deviations from such motifs. Using this software, we performed an in silico screen of seven loci for restriction sites and their potential mutant SNP variants that were possible from 21 restriction enzymes. The four most informative locus-enzyme combinations were then used to survey the regenerant populations for de novo mutants. We characterised the pattern of point mutations and, using the outputs of Artbio, calculated the ratio of base substitution in 114 somatic embryo-derived cocoa regenerants originating from two explant genotypes. We found 49 polymorphisms, comprising 26.3% of the samples screened, with an inferred rate of 2.8 × 10−3 substitutions/screened base. This elevated rate is of a similar order of magnitude to previous reports of de novo microsatellite length mutations arising in the crop and suggests caution should be exercised when applying somatic embryogenesis for the conservation of plant germplasm.
Resumo:
The genome sequence of Aedes aegypti was recently reported. A significant amount of Expressed Sequence Tags (ESTs) were sequenced to aid in the gene prediction process. In the present work we describe an integrated analysis of the genomic and EST data, focusing on genes with preferential expression in larvae (LG), adults (AG) and in both stages (SG). A total of 913 genes (5.4% of the transcript complement) are LG, including ion transporters and cuticle proteins that are important for ion homeostasis and defense. From a starting set of 245 genes encoding the trypsin domain, we identified 66 putative LG, AG, and SG trypsins by manual curation. Phylogenetic analyses showed that AG trypsins are divergent from their larval counterparts (LG), grouping with blood-induced trypsins from Anopheles gambiae and Simulium vittatum. These results support the hypothesis that blood-feeding arose only once, in the ancestral Culicomorpha. Peritrophins are proteins that interlock chitin fibrils to form the peritrophic membrane (PM) that compartmentalizes the food in the midgut. These proteins are recognized by having chitin-binding domains with 6 conserved Cys and may also present mucin-like domains (regions expected to be highly O-glycosylated). PM may be formed by a ring of cells (type 2, seen in Ae. aegypti larvae and Drosophila melanogaster) or by most midgut cells (type 1, found in Ae. aegypti adult and Tribolium castaneum). LG and D. melanogaster peritrophins have more complex domain structures than AG and T. castaneum peritrophins. Furthermore, mucin-like domains of peritrophins from T. castaneum (feeding on rough food) are lengthier than those of adult Ae. aegypti (blood-feeding). This suggests, for the first time, that type 1 and type 2 PM may have variable molecular architectures determined by different peritrophins and/or ancillary proteins, which may be partly modulated by diet.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
A detailed genome mapping analysis of 213,636 expressed sequence tags (EST) derived from nontumor and tumor tissues of the oral cavity, larynx, pharynx, and thyroid was done. Transcripts matching known human genes were identified; potential new splice variants were flagged and subjected to manual curation, pointing to 788 putatively new alternative splicing isoforms, the majority (75%) being insertion events. A subset of 34 new splicing isoforms (5% of 788 events) was selected and 23 (68%) were confirmed by reverse transcription-PCR and DNA sequencing. Putative new genes were revealed, including six transcripts mapped to well-studied chromosomes such as 22, as well as transcripts that mapped to 253 intergenic regions. In addition, 2,251 noncoding intronic RNAs, eventually involved in transcriptional regulation, were found. A set of 250 candidate markers for loss of heterozygosis or gene amplification was selected by identifying transcripts that mapped to genomic regions previously known to be frequently amplified or deleted in head, neck, and thyroid tumors. Three of these markers were evaluated by quantitative reverse transcription-PCR in an independent set of individual samples. Along with detailed clinical data about tumor origin, the information reported here is now publicly available on a dedicated Web site as a resource for further biological investigation. This first in silico reconstruction of the head, neck, and thyroid transcriptomes points to a wealth of new candidate markers that can be used for future studies on the molecular basis of these tumors. Similar analysis is warranted for a number of other tumors for which large EST data sets are available.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Over 40,000 sugarcane (Saccharum officinarum) consensus sequences assembled from 237,954 expressed sequence tags were compared with the protein and DNA sequences from other angiosperms, including the genomes of Arabidopsis and rice (Oryza sativa). Approximately two-thirds of the sugarcane transcriptome have similar sequences in Arabidopsis. These sequences may represent a core set of proteins or protein domains that are conserved among monocots and eudicots and probably encode for essential angiosperm. functions. The remaining sequences represent putative monocot-specific genetic material, one-half of which were found only in sugarcane. These monocot-specific cDNAs represent either novelties or, in many cases, fast-evolving sequences that diverged substantially from their eudicot homologs. The wide comparative genome analysis presented here provides information on the evolutionary changes that underlie the divergence of monocots and eudicots. Our comparative analysis also led to the identification of several not yet annotated putative genes and possible gene loss events in Arabidopsis.
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Paracoccidioides brasiliensis is a fungal human pathogen with a wide distribution in Latin America. It causes paracoccidioidomycosis, the most widespread systemic mycosis in Latin America. Although gene expression in P. brasiliensis had been studied, little is known about the genome sequences expressed by this species during the infection process. To better understand the infection process, 4934 expressed sequence tags (ESTs) derived from a non-normalized cDNA library from P. brasiliensis (isolate Pb01) yeast-phase cells recovered from the livers of infected mice were annotated and clustered to a UniGene (clusters containing sequences that represent a unique gene) set with 1602 members. A large-scale comparative analysis was performed between the UniGene sequences of P. brasiliensis yeast-phase cells recovered from infected mice and a database constructed with sequences of the yeast-phase and mycelium transcriptome (isolate Pb01) (https://dna.biomol.unb.br/Pb/), as well as with all public ESTs available at GenBank, including sequences of the P. brasiliensis yeast-phase transcriptome (isolate Pb18) (http:// www.ncbi.nlm.nih.gov/). The focus was on the overexpressed and novel genes. From the total, 3184 ESTs (64.53%) were also present in the previously described transcriptome of yeast-form and mycelium cells obtained from in vitro cultures (https://dna.biomol.unb.br/Pb/) and of those, 1172 ESTs (23.75% of the described sequences) represented transcripts overexpressed during the infection process. Comparative analysis identified 1750 ESTs (35.47% of the total), comprising 649 UniGene sequences representing novel transcripts of P. brasiliensis, not previously described for this isolate or for other isolates in public databases. KEGG pathway mapping showed that the novel and overexpressed transcripts represented standard metabolic pathways, including glycolysis, amino acid biosynthesis, lipid and sterol metabolism. The unique and divergent representation of transcripts in the cDNA library of yeast cells recovered from infected mice suggests differential gene expression in response to the host milieu.
Resumo:
We have analyzed 16 missense mutations of the tissue-nonspecific AP (TNAP) gene found in patients with hypophosphatasia. These mutations span the phenotypic spectrum of the disease, from the lethal perinatal/infantile forms to the less severe adult and odontohypophosphatasia. Site-directed mutagenesis was used to introduce a sequence tag into the TNAP cDNA and eliminate the glycosylphosphatidylinositol (GPI)-anchor recognition sequence to produce a secreted epitope-tagged TNAP (setTNAP). The properties of GPI-anchored TNAP (gpiTNAP) and setTNAP were found comparable. After introducing each single hypophosphatasia mutation, the setTNAP and mutant TNAP cDNAs were expressed in COS-1 cells and the recombinant flagged enzymes were affinity purified. We characterized the kinetic behavior, inhibition, and heat stability properties of each mutant using the artificial substrate p-nitrophenylphosphate (pNPP) at pH 9.8. We also determined the ability of the mutants to metabolize two natural substrates of TNAP, that is, pyridoxal-5'-phosphate (PLP) and inorganic pyrophosphate (PPi), at physiological pH. Six of the mutant enzymes were completely devoid of catalytic activity (R54C, R54P, A94T, R206W, G317D, and V365I), and 10 others (A16V, A115V, A160T, A162T, E174K, E174G, D277A, E281K, D361V, and G439R) showed various levels of residual activity. The A160T substitution was found to decrease the catalytic efficiency of the mutant enzyme toward pNPP to retain normal activity toward PPi and to display increased activity toward PLP. The A162T substitution caused a considerable reduction in the pNPPase, PPiase, and PLPase activities of the mutant enzyme. The D277A mutant was found to maintain high catalytic efficiency toward pNPP as substrate but not against PLP or PPi. Three mutations ( E174G, E174K, and E281K) were found to retain normal or slightly subnormal catalytic efficiency toward pNPP and PPi but not against PLP. Because abnormalities in PLP metabolism have been shown to cause epileptic seizures in mice null for the TNAP gene, these kinetic data help explain the variable expressivity of epileptic seizures in hypophosphatasia patients.
Resumo:
Molecular chaperones perform folding assistance in newly synthesized polypeptides preventing aggregation processes, recovering proteins from aggregates, among other important cellular functions. Thus their study presents great biotechnological importance. The present work discusses the mining for chaperone-related sequences within the sugarcane EST genome project database, which resulted in approximately 300 different sequences. Since molecular chaperones are highly conserved in most organisms studied so far, the number of sequences related to these proteins in sugarcane was very similar to the number found in the Arabidopsis thaliana genome. The Hsp70 family was the main molecular chaperone system present in the sugarcane expressome. However, many other relevant molecular chaperones systems were also present. A digital RNA blot analysis showed that 5'ESTs from all molecular chaperones were found in every sugarcane library, despite their heterogeneous expression profiles. The results presented here suggest the importance of molecular chaperones to polypeptide metabolism in sugarcane cells, based on their abundance and variability. Finally, these data have being used to guide more in deep analysis, permitting the choice of specific targets to study. (c) 2006 Elsevier GmbH. All rights reserved.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
The Coleoptera order is the richest group among Metazoa, but its phylogenetics remains incompletely understood. Among Coleoptera, bioluminescence is found within the Elateroidea, but the evolution of this character remains a mystery. Mitochondrial DNA has been used extensively to reconstruct phylogenetic relationships, however, the evolution of a single gene does not always correspond to the species evolutionary history and the molecular marker choice is a key step in this type of analysis. To create a solid basis to better understand the evolutionary history of Coleoptera and its bioluminescence, we sequenced and comparatively analyzed the mitochondrial genome of the Brazilian luminescent click beetle Pyrophorus divergens (Coleoptera: Elateridae). © 2007 Elsevier B.V. All rights reserved.