106 resultados para Genomic sequence database
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)
Resumo:
Pulp softening is one of the most remarkable changes during ripening of papaya (Carica papaya) fruit and it is a major cause for post-harvest losses. Although cell wall catabolism has a major influence on papaya fruit, quality information on the gene products involved in this process is limited. A full-length polygalacturonase cDNA (cpPG) was isolated from papaya pulp and used to study gene expression and enzyme activity during normal and ethylene-induced ripening and after exposure of the fruit to 1-MCP. Northern-blot analysis demonstrated that cpPG transcription was strongly induced during ripening and was highly ethylene-dependent. The accumulation of cpPG transcript was paralleled by enzyme activity, and inversely correlated to the pulp firmness. Preliminary in silica analysis of the cpPG genomic sequence revealed the occurrence of putative regulatory motifs in the promoter region that may help to explain the effects of plant hormones and non-abiotic stresses on papaya fruit firmness. This newly isolated cpPG is an important candidate for functional characterization and manipulation to control the process of pulp softening during papaya ripening. (C) 2009 Elsevier Masson SAS. All rights reserved.
Resumo:
The mitochondrial genome of the chytrid Blastocladiella emersonii was sequenced and annotated, revealing the complete set of oxidative phosphorylation genes and tRNAs/rRNAs necessary for the translation process. Phylogenetic reconstructions reinforce the proposal of the new phylum Blastocladiomycota. Evidences of gene duplication due to inserted elements suggest shared susceptibility to gene invasion/exchange between chytrids and zygomycetes. The gene content of B. emersonii is very similar to Allomyces macrogynus but the content of intronic and changeable elements is much lower suggesting a stronger resistance to this kind of exchange. in addition, a total of 401 potential nuclear transcripts encoding mitochondrial proteins were obtained after B. emersonii EST database scanning using Saccharomyces cerevisiae, Homo sapiens and Arabidopsis thaliana data as probes and TargetP tool to find N-terminal mitochondrial signal in translated sequences. (c) 2008 Elsevier B.V. All rights reserved.
Resumo:
Xylella fastidiosa is a xylem-restricted plant pathogen that causes a range of diseases in several and important crops. Through comparative genomic sequence analysis many genes were identified and, among them, several potentially involved in plant-pathogen interaction. The experimental determination of the primary sequence of some markedly expressed proteins for X fastidiosa and the comparison with the nucleic acids sequence of genome identified one of them as being SCJ21.16 (XFa0032) gene product. The comparative analysis of this protein against SWISSPROT database, in special, resulted in similarity with a-hydroxynitrile lyase enzyme (HNL) from Arabidopsis thaliana, causing interest for being one of the most abundant proteins both in the whole cell extract as well as in the extracellular protein fraction. It is known that HNL enzyme are involved in a process termed ""cyanogenesis"", which catalyzes the dissociation of alpha-hydroxinitrile into carbonyle and HCN when plant tissue is damaged. Although the complete genome sequences of X.fastidiosa are available and the cyanogenesis process is well known, the biological role of this protein in this organism is not yet functionally characterized. In this study we presented the cloning, expression, characterization of recombinant HNL from X fastidiosa, and its probable function in the cellular metabolism. The successful cloning and heterologous expression in Escherichia coli resulted in a satisfactory amount of the recombinant HNL expressed in a soluble, and active form giving convenient access to pure enzyme for biochemical and structural studies. Finally, our results confirmed that the product of the gene XFa0032 can be positively assigned as FAD-independent HNLs. (C) 2009 Elsevier Ltd. All rights reserved.
Resumo:
An important topic in genomic sequence analysis is the identification of protein coding regions. In this context, several coding DNA model-independent methods based on the occurrence of specific patterns of nucleotides at coding regions have been proposed. Nonetheless, these methods have not been completely suitable due to their dependence on an empirically predefined window length required for a local analysis of a DNA region. We introduce a method based on a modified Gabor-wavelet transform (MGWT) for the identification of protein coding regions. This novel transform is tuned to analyze periodic signal components and presents the advantage of being independent of the window length. We compared the performance of the MGWT with other methods by using eukaryote data sets. The results show that MGWT outperforms all assessed model-independent methods with respect to identification accuracy. These results indicate that the source of at least part of the identification errors produced by the previous methods is the fixed working scale. The new method not only avoids this source of errors but also makes a tool available for detailed exploration of the nucleotide occurrence.
Resumo:
The genome of the most virulent among 22 Brazilian geographical isolates of Spodoptera frugiperda nucleopolyhedrovirus, isolate 19 (SfMNPV-1 9), was completely sequenced and shown to comprise 132 565 bp and 141 open reading frames (ORFs). A total of 11 ORFs with no homology to genes in the GenBank database were found. Of those, four had typical baculovirus; promoter motifs and polyadenylation sites. Computer-simulated restriction enzyme cleavage patterns of SfMNPV-1 9 were compared with published physical maps of other SfMNPV isolates. Differences were observed in terms of the restriction profiles and genome size. Comparison of SfMNPV-1 9 with the sequence of the SfMNPV isolate 3AP2 indicated that they differed due to a 1427 bp deletion, as well as by a series of smaller deletions and point mutations. The majority of genes of SfMNPV-1 9 were conserved in the closely related Spodoptera exigua NPV (SeMNPV) and Agrotis segetum NPV (AgseMNPV-A), but a few regions experienced major changes and rearrangements. Synthenic maps for the genomes of group 11 NPVs revealed that gene collinearity was observed only within certain clusters. Analysis of the dynamics of gene gain and loss along the phylogenetic tree of the NPVs showed that group 11 had only five defining genes and supported the hypothesis that these viruses form ten highly divergent ancient lineages. Crucially, more than 60% of the gene gain events followed a power-law relation to genetic distance among baculoviruses, indicative of temporal organization in the gene accretion process.
Resumo:
The availaibilty of chloroplast genome (cpDNA) sequences of Atropa belladonna, Nicotiana sylvestris, N tabacum, N tomentosiformis, Solanum bulbocastanum, S lycopersicum and S tuberosum, which are Solanaceae species, allowed us to analyze the organization of cpSSRs in their genic and intergenic regions In general, the number of cpSSRs in cpDNA ranged from 161 in S tuberosum to 226 in N tabacum, and the number of intergenic cpSSRs was higher than genic cpSSRs The mononucleotide repeats were the most frequent in studied species, but we also identified di-, tri-, tetra-, penta- and hexanucleotide repeats Multiple alignments of all cpSSRs sequence from Solanaceae species made the identification of nucleotide variability possible and the phylogeny was estimated by maximum parsimony Our study showed that the plastome database can be exploited for phylogenetic analyses and biotechnological approaches
Resumo:
Background: High-throughput SNP genotyping has become an essential requirement for molecular breeding and population genomics studies in plant species. Large scale SNP developments have been reported for several mainstream crops. A growing interest now exists to expand the speed and resolution of genetic analysis to outbred species with highly heterozygous genomes. When nucleotide diversity is high, a refined diagnosis of the target SNP sequence context is needed to convert queried SNPs into high-quality genotypes using the Golden Gate Genotyping Technology (GGGT). This issue becomes exacerbated when attempting to transfer SNPs across species, a scarcely explored topic in plants, and likely to become significant for population genomics and inter specific breeding applications in less domesticated and less funded plant genera. Results: We have successfully developed the first set of 768 SNPs assayed by the GGGT for the highly heterozygous genome of Eucalyptus from a mixed Sanger/454 database with 1,164,695 ESTs and the preliminary 4.5X draft genome sequence for E. grandis. A systematic assessment of in silico SNP filtering requirements showed that stringent constraints on the SNP surrounding sequences have a significant impact on SNP genotyping performance and polymorphism. SNP assay success was high for the 288 SNPs selected with more rigorous in silico constraints; 93% of them provided high quality genotype calls and 71% of them were polymorphic in a diverse panel of 96 individuals of five different species. SNP reliability was high across nine Eucalyptus species belonging to three sections within subgenus Symphomyrtus and still satisfactory across species of two additional subgenera, although polymorphism declined as phylogenetic distance increased. Conclusions: This study indicates that the GGGT performs well both within and across species of Eucalyptus notwithstanding its nucleotide diversity >= 2%. The development of a much larger array of informative SNPs across multiple Eucalyptus species is feasible, although strongly dependent on having a representative and sufficiently deep collection of sequences from many individuals of each target species. A higher density SNP platform will be instrumental to undertake genome-wide phylogenetic and population genomics studies and to implement molecular breeding by Genomic Selection in Eucalyptus.
Resumo:
In Brazil, human T-lymphotropic virus type 2 (HTLV-2) is endemic in Amerindians and epidemic in intravenous drug users (IDUs). The long terminal repeat (LTR) is the most divergent genomic region of HTLV-2, therefore useful to characterize subtypes. Nucleotide sequence and restriction fragment length polymorphism (RFLP) analysis of LTR genomic segments of fourteen HTLV-2 strains isolated from HIV-infected patients of Londrina, Southern Brazil, were carried out. Molecular analysis disclosed that all HTLV-2 strains belonged to 2a subtype, and RFLP detected the presence of the a4, a5, and a6 subgroups according to Switzer's nomenclature. RFLP correlated with nucleotide sequence, and phylogenetic analysis clustered HTLV-2 sequences of IDUs into subgroups a5 and a6. HTLV-2 sequences from individuals of sexual risk factor clustered into the a4 subgroup. These results extend the knowledge of the genetic diversity of HTLV-2 circulating in Brazil and provide insights into HTLV-2 transmission and virus movement in this geographic area.
Resumo:
We present the genome sequences of a new clinical isolate of the important human pathogen, Aspergillus fumigatus, A1163, and two closely related but rarely pathogenic species, Neosartorya fischeri NRRL181 and Aspergillus clavatus NRRL1. Comparative genomic analysis of A1163 with the recently sequenced A. fumigatus isolate Af293 has identified core, variable and up to 2% unique genes in each genome. While the core genes are 99.8% identical at the nucleotide level, identity for variable genes can be as low 40%. The most divergent loci appear to contain heterokaryon incompatibility ( het) genes associated with fungal programmed cell death such as developmental regulator rosA. Cross-species comparison has revealed that 8.5%, 13.5% and 12.6%, respectively, of A. fumigatus, N. fischeri and A. clavatus genes are species-specific. These genes are significantly smaller in size than core genes, contain fewer exons and exhibit a subtelomeric bias. Most of them cluster together in 13 chromosomal islands, which are enriched for pseudogenes, transposons and other repetitive elements. At least 20% of A. fumigatus-specific genes appear to be functional and involved in carbohydrate and chitin catabolism, transport, detoxification, secondary metabolism and other functions that may facilitate the adaptation to heterogeneous environments such as soil or a mammalian host. Contrary to what was suggested previously, their origin cannot be attributed to horizontal gene transfer ( HGT), but instead is likely to involve duplication, diversification and differential gene loss (DDL). The role of duplication in the origin of lineage-specific genes is further underlined by the discovery of genomic islands that seem to function as designated ""gene dumps'' and, perhaps, simultaneously, as "" gene factories''.
Resumo:
Background: High-throughput molecular approaches for gene expression profiling, such as Serial Analysis of Gene Expression (SAGE), Massively Parallel Signature Sequencing (MPSS) or Sequencing-by-Synthesis (SBS) represent powerful techniques that provide global transcription profiles of different cell types through sequencing of short fragments of transcripts, denominated sequence tags. These techniques have improved our understanding about the relationships between these expression profiles and cellular phenotypes. Despite this, more reliable datasets are still necessary. In this work, we present a web-based tool named S3T: Score System for Sequence Tags, to index sequenced tags in accordance with their reliability. This is made through a series of evaluations based on a defined rule set. S3T allows the identification/selection of tags, considered more reliable for further gene expression analysis. Results: This methodology was applied to a public SAGE dataset. In order to compare data before and after filtering, a hierarchical clustering analysis was performed in samples from the same type of tissue, in distinct biological conditions, using these two datasets. Our results provide evidences suggesting that it is possible to find more congruous clusters after using S3T scoring system. Conclusion: These results substantiate the proposed application to generate more reliable data. This is a significant contribution for determination of global gene expression profiles. The library analysis with S3T is freely available at http://gdm.fmrp.usp.br/s3t/.S3T source code and datasets can also be downloaded from the aforementioned website.
Resumo:
Disruption or loss of tumor suppressor gene TP53 is implicated in the development or progression of almost all different types of human malignancies. Other members of the p53 family have been identified. One member, p73, not only shares a high degree of similarity with p53 in its primary sequence, but also has similar functions. Like p53, p73 can bind to DNA and activate transcription. Using PCR-SSCP and gene sequencing, we analyzed the TP53 and TP73 genes in a case of a grade III anaplastic astrocytoma that progressed to glioblastoma. We found a deletion of AAG at position 595-597 of TP53 (exon 6), resulting in the deletion of Glu 199 in the protein and a genomic polymorphism of TP73, identified as an A-to-G change, at position E8/+15 at intron 8 (IVS8-15A>G). The mutation found at exon 6 of the gene TP53 could be associated with the rapid tumoral progression found in this case, since the mutated p53 may inactivate the wild-type p53 and the p73 alpha protein, which was conserved here, leading to an increase in cellular instability.
Resumo:
Context. A sample of 27 sources, cataloged as pre-main sequence stars by the Pico dos Dias Survey (PDS), is analyzed to investigate a possible contamination by post-AGB stars. The far-infrared excess due to dust present in the circumstellar envelope is typical of both categories: young stars and objects that have already left the main sequence and are suffering severe mass loss. Aims. The two known post-AGB stars in our sample inspired us to seek for other very likely or possible post-AGB objects among PDS sources previously suggested to be Herbig Ae/Be stars, by revisiting the observational database of this sample. Methods. In a comparative study with well known post-AGBs, several characteristics were evaluated: (i) parameters related to the circumstellar emission; (ii) spatial distribution to verify the background contribution from dark clouds; (iii) spectral features; and (iv) optical and infrared colors. Results. These characteristics suggest that seven objects of the studied sample are very likely post-AGBs, five are possible post-AGBs, eight are unlikely post-AGBs, and the nature of seven objects remains unclear.
Resumo:
Background: Citrus canker is a disease that has severe economic impact on the citrus industry worldwide. There are three types of canker, called A, B, and C. The three types have different phenotypes and affect different citrus species. The causative agent for type A is Xanthomonas citri subsp. citri, whose genome sequence was made available in 2002. Xanthomonas fuscans subsp. aurantifolii strain B causes canker B and Xanthomonas fuscans subsp. aurantifolii strain C causes canker C. Results: We have sequenced the genomes of strains B and C to draft status. We have compared their genomic content to X. citri subsp. citri and to other Xanthomonas genomes, with special emphasis on type III secreted effector repertoires. In addition to pthA, already known to be present in all three citrus canker strains, two additional effector genes, xopE3 and xopAI, are also present in all three strains and are both located on the same putative genomic island. These two effector genes, along with one other effector-like gene in the same region, are thus good candidates for being pathogenicity factors on citrus. Numerous gene content differences also exist between the three cankers strains, which can be correlated with their different virulence and host range. Particular attention was placed on the analysis of genes involved in biofilm formation and quorum sensing, type IV secretion, flagellum synthesis and motility, lipopolysacharide synthesis, and on the gene xacPNP, which codes for a natriuretic protein. Conclusion: We have uncovered numerous commonalities and differences in gene content between the genomes of the pathogenic agents causing citrus canker A, B, and C and other Xanthomonas genomes. Molecular genetics can now be employed to determine the role of these genes in plant-microbe interactions. The gained knowledge will be instrumental for improving citrus canker control.
Resumo:
The seeds of Theobroma cacao (cacao) are the source of cocoa, the raw material for the multi-billion dollar chocolate industry. Cacao`s two most important traits are its unique seed storage triglyceride (cocoa butter) and the flavor of its fermented beans (chocolate). The genome of T. cacao is being sequenced, and to expand the utility of the genome sequence to the improvement of cacao, we are evaluating Theobroma grandiflorum, the closest economically important species of Theobroma for its potential use in a comparative genomic study. T. grandiflorum differs from cacao in important agronomic traits such as flavor of the fermented beans, disease resistance to witches` broom and abscission of mature fruits. By comparing genomic sequences and analyzing viable inter-specific hybrids, we hope to identify the key genes that regulate cacao`s most important traits. We have investigated the utility in T. grandiflorum of three types of markers (microsatellite markers, single-strand conformational polymorphism markers and single nucleotide polymorphism (SNP) markers) developed in cacao. Through sequencing of amplicons of 12 diverse individuals of both cacao and T. grandiflorum, we have identified new intra- and inter-specific SNPs. Two markers which had no overlap of alleles between the species were used to genotype putative inter-specific hybrid seedlings. Sequence conservation was significant and species-specific differences numerous enough to suggest that comparative genomics of T. grandiflorum and T. cacao will be useful in elucidating the genetic differences that lead to a variety of important agronomic trait differences.
Resumo:
Expressed sequence tags derived markers have a great potential to be used in functional map construction and QTL tagging. In the present work, sugarcane genomic probes and expressed sequence tags having homology to genes, mostly involved in carbohydrate metabolism were used in RFLP assays to identify putative QTLs as well as their epistatic interactions for fiber content, cane yield, pol and tones of sugar per hectare, at two crop cycles in a progeny derived from a bi-parental cross of sugarcane elite materials. A hundred and twenty marker trait associations were found, of which 26 at both crop cycle and 32 only at first ratoon cane. A sucrose synthase derived marker was associated with a putative QTL having a high negative effect on cane yield and also with a QTL having a positive effect on Pol at both crop cycles. Fifty digenic epistatic marker interactions were identified for the four traits evaluated. Of these, only two were observed at both crop cycles.