956 resultados para Genomic sequence database
Resumo:
In humans, a polymorphic gene encodes the drug-metabolizing enzyme NATI (arylamine N-acetyltransferase Type 1), which is widely expressed throughout the body. While the protein-coding region of NATI is contained within a single exon, examination of the human EST (expressed sequence tag) database at the NCBI revealed the presence of nine separate exons, eight of which were located in the 5'non-coding region of NATI. Differential splicing produced at least eight unique mRNA isoforms that could be grouped according to the location of the first exon, which suggested that NATI expression occurs from three alternative promoters. Using RT (reverse transcriptase)-PCR, we identified one major transcript in various epithelial cells derived from different tissues. In contrast, multiple transcripts were observed in blood-derived cell lines (CEM, THP-1 and Jurkat), with a novel variant, not identified in the EST database, found in CEM cells only. The major splice variant increased gene expression 9-11-fold in a luciferase reporter assay, while the other isoforrns were similar or slightly greater than the control. We examined the upstream region of the most active splice variant in a promoter-reporter assay, and isolated a 257 bp sequence that produced maximal promoter activity. This sequence lacked a TATA box, but contained a consensus Sp1 site and a CAAT box, as well as several other putative transcription-factor-binding sites. Cell-specific expression of the different NATI transcripts may contribute to the variation in NATI activity in vivo.
Resumo:
Recently, we identified a large number of ultraconserved (uc) sequences in noncoding regions of human, mouse, and rat genomes that appear to be essential for vertebrate and amniote ontogeny. Here, we used similar methods to identify ultraconserved genomic regions between the insect species Drosophila melanogaster and Drosophila pseudoobscura, as well as the more distantly related Anopheles gambiae. As with vertebrates, ultraconserved sequences in insects appear to Occur primarily in intergenic and intronic sequences, and at intron-exon junctions. The sequences are significantly associated with genes encoding developmental regulators and transcription factors, but are less frequent and are smaller in size than in vertebrates. The longest identical, nongapped orthologous match between the three genomes was found within the homothorax (hth) gene. This sequence spans an internal exon-intron junction, with the majority located within the intron, and is predicted to form a highly stable stem-loop RNA structure. Real-time quantitative PCR analysis of different hth splice isoforms and Northern blotting showed that the conserved element is associated with a high incidence of intron retention in hth pre-mRNA, suggesting that the conserved intronic element is critically important in the post-transcriptional regulation of hth expression in Diptera.
Resumo:
CyBase is a curated database and information source for backbone-cyclized proteins. The database incorporates naturally occurring cyclic proteins as well as synthetic derivatives, grafted analogues and acyclic permutants. The database provides a centralized repository of information on all aspects of cyclic protein biology and addresses issues pertaining to the management and searching of topologically circular sequences. The database is freely available at http://research.imb.uq.edu.au/cybase.
Resumo:
Replication of eukaryotic chromosomes initiates at multiple sites called replication origins. Replication origins are best understood in the budding yeast Saccharomyces cerevisiae, where several complementary studies have mapped their locations genome-wide. We have collated these datasets, taking account of the resolution of each study, to generate a single list of distinct origin sites. OriDB provides a web-based catalogue of these confirmed and predicted S.cerevisiae DNA replication origin sites. Each proposed or confirmed origin site appears as a record in OriDB, with each record comprising seven pages. These pages provide, in text and graphical formats, the following information: genomic location and chromosome context of the origin site; time of origin replication; DNA sequence of proposed or experimentally confirmed origin elements; free energy required to open the DNA duplex (stress-induced DNA duplex destabilization or SIDD); and phylogenetic conservation of sequence elements. In addition, OriDB encourages community submission of additional information for each origin site through a User Notes facility. Origin sites are linked to several external resources, including the Saccharomyces Genome Database (SGD) and relevant publications at PubMed. Finally, a Chromosome Viewer utility allows users to interactively generate graphical representations of DNA replication data genome-wide. OriDB is available at www.oridb.org.
Resumo:
The phytopathogenic fungus Moniliophthora perniciosa (Stahel) Aime & Philips-Mora, causal agent of witches' broom disease of cocoa, causes countless damage to cocoa production in Brazil. Molecular studies have attempted to identify genes that play important roles in fungal survival and virulence. In this study, sequences deposited in the M. perniciosa Genome Sequencing Project database were analyzed to identify potential biological targets. For the first time, the ergosterol biosynthetic pathway in M. perniciosa was studied and the lanosterol 14α-demethylase gene (ERG11) that encodes the main enzyme of this pathway and is a target for fungicides was cloned, characterized molecularly and its phylogeny analyzed. ERG11 genomic DNA and cDNA were characterized and sequence analysis of the ERG11 protein identified highly conserved domains typical of this enzyme, such as SRS1, SRS4, EXXR and the heme-binding region (HBR). Comparison of the protein sequences and phylogenetic analysis revealed that the M. perniciosa enzyme was most closely related to that of Coprinopsis cinerea.
Resumo:
The availaibilty of chloroplast genome (cpDNA) sequences of Atropa belladonna, Nicotiana sylvestris, N tabacum, N tomentosiformis, Solanum bulbocastanum, S lycopersicum and S tuberosum, which are Solanaceae species, allowed us to analyze the organization of cpSSRs in their genic and intergenic regions In general, the number of cpSSRs in cpDNA ranged from 161 in S tuberosum to 226 in N tabacum, and the number of intergenic cpSSRs was higher than genic cpSSRs The mononucleotide repeats were the most frequent in studied species, but we also identified di-, tri-, tetra-, penta- and hexanucleotide repeats Multiple alignments of all cpSSRs sequence from Solanaceae species made the identification of nucleotide variability possible and the phylogeny was estimated by maximum parsimony Our study showed that the plastome database can be exploited for phylogenetic analyses and biotechnological approaches
Resumo:
Background: High-throughput SNP genotyping has become an essential requirement for molecular breeding and population genomics studies in plant species. Large scale SNP developments have been reported for several mainstream crops. A growing interest now exists to expand the speed and resolution of genetic analysis to outbred species with highly heterozygous genomes. When nucleotide diversity is high, a refined diagnosis of the target SNP sequence context is needed to convert queried SNPs into high-quality genotypes using the Golden Gate Genotyping Technology (GGGT). This issue becomes exacerbated when attempting to transfer SNPs across species, a scarcely explored topic in plants, and likely to become significant for population genomics and inter specific breeding applications in less domesticated and less funded plant genera. Results: We have successfully developed the first set of 768 SNPs assayed by the GGGT for the highly heterozygous genome of Eucalyptus from a mixed Sanger/454 database with 1,164,695 ESTs and the preliminary 4.5X draft genome sequence for E. grandis. A systematic assessment of in silico SNP filtering requirements showed that stringent constraints on the SNP surrounding sequences have a significant impact on SNP genotyping performance and polymorphism. SNP assay success was high for the 288 SNPs selected with more rigorous in silico constraints; 93% of them provided high quality genotype calls and 71% of them were polymorphic in a diverse panel of 96 individuals of five different species. SNP reliability was high across nine Eucalyptus species belonging to three sections within subgenus Symphomyrtus and still satisfactory across species of two additional subgenera, although polymorphism declined as phylogenetic distance increased. Conclusions: This study indicates that the GGGT performs well both within and across species of Eucalyptus notwithstanding its nucleotide diversity >= 2%. The development of a much larger array of informative SNPs across multiple Eucalyptus species is feasible, although strongly dependent on having a representative and sufficiently deep collection of sequences from many individuals of each target species. A higher density SNP platform will be instrumental to undertake genome-wide phylogenetic and population genomics studies and to implement molecular breeding by Genomic Selection in Eucalyptus.
Resumo:
In Brazil, human T-lymphotropic virus type 2 (HTLV-2) is endemic in Amerindians and epidemic in intravenous drug users (IDUs). The long terminal repeat (LTR) is the most divergent genomic region of HTLV-2, therefore useful to characterize subtypes. Nucleotide sequence and restriction fragment length polymorphism (RFLP) analysis of LTR genomic segments of fourteen HTLV-2 strains isolated from HIV-infected patients of Londrina, Southern Brazil, were carried out. Molecular analysis disclosed that all HTLV-2 strains belonged to 2a subtype, and RFLP detected the presence of the a4, a5, and a6 subgroups according to Switzer's nomenclature. RFLP correlated with nucleotide sequence, and phylogenetic analysis clustered HTLV-2 sequences of IDUs into subgroups a5 and a6. HTLV-2 sequences from individuals of sexual risk factor clustered into the a4 subgroup. These results extend the knowledge of the genetic diversity of HTLV-2 circulating in Brazil and provide insights into HTLV-2 transmission and virus movement in this geographic area.
Resumo:
We present the genome sequences of a new clinical isolate of the important human pathogen, Aspergillus fumigatus, A1163, and two closely related but rarely pathogenic species, Neosartorya fischeri NRRL181 and Aspergillus clavatus NRRL1. Comparative genomic analysis of A1163 with the recently sequenced A. fumigatus isolate Af293 has identified core, variable and up to 2% unique genes in each genome. While the core genes are 99.8% identical at the nucleotide level, identity for variable genes can be as low 40%. The most divergent loci appear to contain heterokaryon incompatibility ( het) genes associated with fungal programmed cell death such as developmental regulator rosA. Cross-species comparison has revealed that 8.5%, 13.5% and 12.6%, respectively, of A. fumigatus, N. fischeri and A. clavatus genes are species-specific. These genes are significantly smaller in size than core genes, contain fewer exons and exhibit a subtelomeric bias. Most of them cluster together in 13 chromosomal islands, which are enriched for pseudogenes, transposons and other repetitive elements. At least 20% of A. fumigatus-specific genes appear to be functional and involved in carbohydrate and chitin catabolism, transport, detoxification, secondary metabolism and other functions that may facilitate the adaptation to heterogeneous environments such as soil or a mammalian host. Contrary to what was suggested previously, their origin cannot be attributed to horizontal gene transfer ( HGT), but instead is likely to involve duplication, diversification and differential gene loss (DDL). The role of duplication in the origin of lineage-specific genes is further underlined by the discovery of genomic islands that seem to function as designated ""gene dumps'' and, perhaps, simultaneously, as "" gene factories''.
Resumo:
Background: High-throughput molecular approaches for gene expression profiling, such as Serial Analysis of Gene Expression (SAGE), Massively Parallel Signature Sequencing (MPSS) or Sequencing-by-Synthesis (SBS) represent powerful techniques that provide global transcription profiles of different cell types through sequencing of short fragments of transcripts, denominated sequence tags. These techniques have improved our understanding about the relationships between these expression profiles and cellular phenotypes. Despite this, more reliable datasets are still necessary. In this work, we present a web-based tool named S3T: Score System for Sequence Tags, to index sequenced tags in accordance with their reliability. This is made through a series of evaluations based on a defined rule set. S3T allows the identification/selection of tags, considered more reliable for further gene expression analysis. Results: This methodology was applied to a public SAGE dataset. In order to compare data before and after filtering, a hierarchical clustering analysis was performed in samples from the same type of tissue, in distinct biological conditions, using these two datasets. Our results provide evidences suggesting that it is possible to find more congruous clusters after using S3T scoring system. Conclusion: These results substantiate the proposed application to generate more reliable data. This is a significant contribution for determination of global gene expression profiles. The library analysis with S3T is freely available at http://gdm.fmrp.usp.br/s3t/.S3T source code and datasets can also be downloaded from the aforementioned website.
Resumo:
Disruption or loss of tumor suppressor gene TP53 is implicated in the development or progression of almost all different types of human malignancies. Other members of the p53 family have been identified. One member, p73, not only shares a high degree of similarity with p53 in its primary sequence, but also has similar functions. Like p53, p73 can bind to DNA and activate transcription. Using PCR-SSCP and gene sequencing, we analyzed the TP53 and TP73 genes in a case of a grade III anaplastic astrocytoma that progressed to glioblastoma. We found a deletion of AAG at position 595-597 of TP53 (exon 6), resulting in the deletion of Glu 199 in the protein and a genomic polymorphism of TP73, identified as an A-to-G change, at position E8/+15 at intron 8 (IVS8-15A>G). The mutation found at exon 6 of the gene TP53 could be associated with the rapid tumoral progression found in this case, since the mutated p53 may inactivate the wild-type p53 and the p73 alpha protein, which was conserved here, leading to an increase in cellular instability.
Resumo:
Context. A sample of 27 sources, cataloged as pre-main sequence stars by the Pico dos Dias Survey (PDS), is analyzed to investigate a possible contamination by post-AGB stars. The far-infrared excess due to dust present in the circumstellar envelope is typical of both categories: young stars and objects that have already left the main sequence and are suffering severe mass loss. Aims. The two known post-AGB stars in our sample inspired us to seek for other very likely or possible post-AGB objects among PDS sources previously suggested to be Herbig Ae/Be stars, by revisiting the observational database of this sample. Methods. In a comparative study with well known post-AGBs, several characteristics were evaluated: (i) parameters related to the circumstellar emission; (ii) spatial distribution to verify the background contribution from dark clouds; (iii) spectral features; and (iv) optical and infrared colors. Results. These characteristics suggest that seven objects of the studied sample are very likely post-AGBs, five are possible post-AGBs, eight are unlikely post-AGBs, and the nature of seven objects remains unclear.
Resumo:
Background: Citrus canker is a disease that has severe economic impact on the citrus industry worldwide. There are three types of canker, called A, B, and C. The three types have different phenotypes and affect different citrus species. The causative agent for type A is Xanthomonas citri subsp. citri, whose genome sequence was made available in 2002. Xanthomonas fuscans subsp. aurantifolii strain B causes canker B and Xanthomonas fuscans subsp. aurantifolii strain C causes canker C. Results: We have sequenced the genomes of strains B and C to draft status. We have compared their genomic content to X. citri subsp. citri and to other Xanthomonas genomes, with special emphasis on type III secreted effector repertoires. In addition to pthA, already known to be present in all three citrus canker strains, two additional effector genes, xopE3 and xopAI, are also present in all three strains and are both located on the same putative genomic island. These two effector genes, along with one other effector-like gene in the same region, are thus good candidates for being pathogenicity factors on citrus. Numerous gene content differences also exist between the three cankers strains, which can be correlated with their different virulence and host range. Particular attention was placed on the analysis of genes involved in biofilm formation and quorum sensing, type IV secretion, flagellum synthesis and motility, lipopolysacharide synthesis, and on the gene xacPNP, which codes for a natriuretic protein. Conclusion: We have uncovered numerous commonalities and differences in gene content between the genomes of the pathogenic agents causing citrus canker A, B, and C and other Xanthomonas genomes. Molecular genetics can now be employed to determine the role of these genes in plant-microbe interactions. The gained knowledge will be instrumental for improving citrus canker control.
Resumo:
The seeds of Theobroma cacao (cacao) are the source of cocoa, the raw material for the multi-billion dollar chocolate industry. Cacao`s two most important traits are its unique seed storage triglyceride (cocoa butter) and the flavor of its fermented beans (chocolate). The genome of T. cacao is being sequenced, and to expand the utility of the genome sequence to the improvement of cacao, we are evaluating Theobroma grandiflorum, the closest economically important species of Theobroma for its potential use in a comparative genomic study. T. grandiflorum differs from cacao in important agronomic traits such as flavor of the fermented beans, disease resistance to witches` broom and abscission of mature fruits. By comparing genomic sequences and analyzing viable inter-specific hybrids, we hope to identify the key genes that regulate cacao`s most important traits. We have investigated the utility in T. grandiflorum of three types of markers (microsatellite markers, single-strand conformational polymorphism markers and single nucleotide polymorphism (SNP) markers) developed in cacao. Through sequencing of amplicons of 12 diverse individuals of both cacao and T. grandiflorum, we have identified new intra- and inter-specific SNPs. Two markers which had no overlap of alleles between the species were used to genotype putative inter-specific hybrid seedlings. Sequence conservation was significant and species-specific differences numerous enough to suggest that comparative genomics of T. grandiflorum and T. cacao will be useful in elucidating the genetic differences that lead to a variety of important agronomic trait differences.
Resumo:
Expressed sequence tags derived markers have a great potential to be used in functional map construction and QTL tagging. In the present work, sugarcane genomic probes and expressed sequence tags having homology to genes, mostly involved in carbohydrate metabolism were used in RFLP assays to identify putative QTLs as well as their epistatic interactions for fiber content, cane yield, pol and tones of sugar per hectare, at two crop cycles in a progeny derived from a bi-parental cross of sugarcane elite materials. A hundred and twenty marker trait associations were found, of which 26 at both crop cycle and 32 only at first ratoon cane. A sucrose synthase derived marker was associated with a putative QTL having a high negative effect on cane yield and also with a QTL having a positive effect on Pol at both crop cycles. Fifty digenic epistatic marker interactions were identified for the four traits evaluated. Of these, only two were observed at both crop cycles.