894 resultados para SEQUENCE DATABASES
Resumo:
SUMMARY Genomic imprinting is an epigenetic mechanism of transcriptional regulation that ensures restriction of expression of a subset of mammalian genes to a single parental allele. The best studied example of imprinted gene regulation is the Igf2/H19 locus, which is also the most commonly altered by loss of imprinting (LOT) in cancer. LOT is associated with numerous hereditary diseases and several childhood, and adult cancers. Differential expression of reciprocal H19 and 1gf2 alleles in somatic cells depends on the methylation status of the imprinting control region (ICR) which regulates binding of CTCF, an ubiquitously expressed 11-zinc finger protein that binds specifically to non-methylated maternal ICR and thereby attenuates expression of Igf2, while it does not bind to methylated paternal ICR, which enables Igf2 expression. Initial ICR methylation occurs during gametogenesis by an as yet unknown mechanism. The accepted hypothesis is that the event of differential maternal and paternal DNA methylation depends on germ-line specific proteins. Our Laboratory identified a novel 11-zinc-finger protein CTCF-T (also known as CTCFL and BORIS) that is uniquely expressed in the male germ-line and is highly homologous within its zinc-finger region with CTCF. The amino-acid sequences flanking the zinc-finger regions of CTCF and CTCF-T have widely diverged, suggesting that though they could bind to the same DNA targets (ICRs) they are likely to have different functions. Interestingly, expression of CTCF-T and CTCF is mutually exclusive; CTCF-T-positive (CTCF-negative) cells occur in the stage of spermatogenesis that coincides with epigenetic reprogramming, including de novo DNA methylation. In our study we demonstrate the role that CTCF-T plays in genomic imprinting. Here we show that CTCF-T binds in vivo to the ICRs of Igf2/H19 and Dlk/Gt12 imprinted genes. In addition, we identified two novel proteins interacting with CTCF-T: a protein arginine methyltransferase PRMT7 and an arginine-rich histone H2A variant that we named trH2A. These interactions were confirmed and show that the two proteins interact with the amino-teiminal region of CTCF-T. Additionally, we show interaction of the amino- terminal region of CTCF-T with histones H1, H2A and H3. These results suggest that CTCF-T is a sequence-specific DNA (ICR) binding protein that associates with histones and recruits PRMT7. Interestingly, PRMT7 has a histone-methyltransferase activity. It has been shown that histone methylation can mark chromatin regions thereby directing DNA-methylation; thus, our hypothesis is that the CTCF-T protein-scaffold directs PRMT7 to methylate histone(s) assembled on ICRs, which marks chromatin for the recruitment of the de novo DNA methyltransferases to methylate DNA. To test this hypothesis, we developed an in vivo DNA-methylation assay using Xenopus laevis' oocytes, where H19 ICR and different expression cDNAs, including CTCF-T, PRMT7 and the de novo DNA methyltransferases (Dnmt3a, Dnmt3b and Dnmt3L) are microinjected into the nucleus. The methylation status of CpGs within the H19 ICR was analysed 48 or 72 hours after injection. Here we demonstrate that CpGs in the ICR are methylated in the presence of both CTCF-T and PRMT7, while control oocytes injected only with ICR did not show any methylation. Additionally, we showed for the first time that Dnmt3L is crucial for the establishment of the imprinting marks on H19 ICR. Moreover, we confirmed that Dnmt3a and Dnmt3b activities are complementary. Our data indicate that all three Dnmt3s are important for efficient de novo DNA methylation. In conclusion, we propose a mechanism for the establishment of de novo imprinting marks during spermatogenesis: the CTCF-T/PRMT7 protein complex directs histone methylation leading to sequence-specific de novo DNA methylation of H19 ICR. RESUME L'empreinte génomique parentale est un mécanisme épigénétique de régulation transcriptionelle qui se traduit par une expression différentielle des deux allèles de certains gènes, en fonction de leur origine parentale. L'exemple le mieux caractérisé de gènes soumis à l'empreinte génomique parentale est le locus Igf2/H19, qui est aussi le plus fréquemment altéré par relaxation d'empreinte (en anglais: loss of imprinting, LOI) dans les cancers. Cette relaxation d'empreinte est aussi associée à de nombreuses maladies héréditaires, ainsi qu'à de nombreux cancers chez l'enfant et l'adulte. Dans les cellules somatiques, les différences d'expression des allèles réciproques H19 et Ig12 est sous le contrôle d'une région ICR (Imprinting Control Region). La méthylation de cette région ICR régule l'ancrage de la protéine à douze doigts de zinc CTCF, qui se lie spécifiquement à l'ICR maternel non-méthylé, atténuant ainsi l'expression de Igf2, alors qu'elle ne s'ancre pas à l'ICR paternel méthyle. Le mécanisme qui accompagne la méthylation initiale de la région ICR durant la gamétogenèse n'a toujours pas été élucidé. L'hypothèse actuelle propose que la différence de méthylation entre l'ADN maternel et paternel résulte de l'expression de protéines propres aux zones germinales. Notre laboratoire a récemment identifié une nouvelle protéine à douze doigts de zinc, CTCF-T (aussi dénommée CTCFL et BORRIS), qui est exprimée uniquement dans les cellules germinales mâles, dont la partie à douze doigts de zinc est fortement homologue à la protéine CTCF. La séquence d'acides aminés de part et d'autre de cette région est quant à elle très divergente, ce qui implique que CTCF-T se lie sans doute au même ADN cible que CTCF, mais possède des fonctions différentes. De plus, l'expression de CTCF-T et de CTCF s'oppose mutuellement; l'expression de la protéine CTCF-T (cellules CTCF-T positives, CTCF negatives) qui a lieu pendant la spermatogenèse coïncide avec la reprogrammation épigénétique, notamment la méthylation de novo de l'ADN. La présente étude démontre le rôle essentiel joué par la protéine CTCF-T dans l'acquisition de l'empreinte génomique parentale. Nous montrons ici que CTCF-T s'associe in vivo avec les régions ICR des loci Igf2/H19 et Dlk/Gt12. Nous avons également identifié deux nouvelles protéines qui interagissent avec CTCF-T : une protéine arginine méthyl transférase PRMT7, et un variant de l'histone H2A, riche en arginine, que nous avons dénommé trH2A. Ces interactions ont été analysées plus en détail, et confinnent que ces deux protéines s'associent avec la région N-terminale de CTCF-T. Aussi, nous présentons une interaction de la région N-terminale de CTCF-T avec les histones H1, H2, et H3. Ces résultats suggèrent que CTCF-T est une protéine qui se lie spécifiquement aux régions ICR, qui s'associe avec différents histones et qui recrute PRMT7. PRMT7 possède une activité méthyl-tansférase envers les histones. Il a été montré que la méthylation des histones marque certains endroits de la chromatine, dirigeant ainsi la méthylation de l'ADN. Notre hypothèse est donc la suivante : la protéine CTCF-T sert de base qui dirige la méthylation des histones par PRMT7 dans les régions ICR, ce qui contribue à marquer la chromatine pour le recrutement de nouvelles méthyl transférases pour méthyler l'ADN. Afin de valider cette hypothèse, nous avons développé un système de méthylation de l'ADN in vivo, dans des oeufs de Xenopus laevis, dans le noyau desquels nous avons mico-injecté la région ICR du locus H19, ainsi que différents vecteurs d'expression pour CTCF-T, PRMT7, et les de novo méthyl transférases (Dnmt3a, Dnmt3b et Dnmt3L). Les CpGs méthyles de la région ICR du locus H19 ont été analysé 48 et 72 heures après l'injection. Cette technique nous a permis de démontrer que les CpGs de la région ICR sont méthyles en présence de CTCF-T et de PRMT7, tandis que les contrôles injectés seulement avec la région ICR ne présentent aucun signe de méthylation. De plus, nous démontrons pour la première fois que la protéine méthyl transférase Dnmt3L est déterminant pour l'établissement de l'empreinte génomique parentale au niveau de la région ICR du locus H19. Aussi, nous confirmons que les activités méthyl transférases de Dnmt3a et Dnmt3b sont complémentaires. Nos données indiquent que les trois protéines Dnmt3 sont impliquées dans la méthylation de l'ADN. En conclusion, nous proposons un mécanisme responsable de la mise en place de nouvelles empreintes génomiques pendant la spermatogenèse : le complexe protéique CTCF-T/PRMT7 dirige la méthylation des histones aboutissant à la méthylation de novo de l'ADN au locus H19.
Resumo:
Plant-parasitic nematodes are major agricultural pests worldwide and novel approaches to control them are sorely needed. We report the draft genome sequence of the root-knot nematode Meloidogyne incognita, a biotrophic parasite of many crops, including tomato, cotton and coffee. Most of the assembled sequence of this asexually reproducing nematode, totaling 86 Mb, exists in pairs of homologous but divergent segments. This suggests that ancient allelic regions in M. incognita are evolving toward effective haploidy, permitting new mechanisms of adaptation. The number and diversity of plant cell wall-degrading enzymes in M. incognita is unprecedented in any animal for which a genome sequence is available, and may derive from multiple horizontal gene transfers from bacterial sources. Our results provide insights into the adaptations required by metazoans to successfully parasitize immunocompetent plants, and open the way for discovering new antiparasitic strategies.
Resumo:
The P126 protein, a parasitosphorus vacuole antigen of Plasmodium falciparum has beenshoen to induce protective immunity in Saimiri and Aotus monkeys. In the present work we investigated its immunogenicity. Our results suggest that the N-term of P126 is poorly immunogenic and antibody response against the P126 could be under a MHC restricted control in C57BL/6(H-2b) mice, which could be problematic in ternms of a use of the P126 in a vaccine program. However, we observed that a synthetic peptide, copying the 6 octapeptide repeat corresponding to the N-term of the P126, induces an antibody response to the native molecule in C57BL/6 non-responder mice. Moreover, the vaccine-P126 recombinant induced anmtibodies against the N-term of the molecule in rabbits while the unprocessed P126 did not.
Resumo:
BACKGROUND: The availability of the P. falciparum genome has led to novel ways to identify potential vaccine candidates. A new approach for antigen discovery based on the bioinformatic selection of heptad repeat motifs corresponding to alpha-helical coiled coil structures yielded promising results. To elucidate the question about the relationship between the coiled coil motifs and their sequence conservation, we have assessed the extent of polymorphism in putative alpha-helical coiled coil domains in culture strains, in natural populations and in the single nucleotide polymorphism data available at PlasmoDB. METHODOLOGY/PRINCIPAL FINDINGS: 14 alpha-helical coiled coil domains were selected based on preclinical experimental evaluation. They were tested by PCR amplification and sequencing of different P. falciparum culture strains and field isolates. We found that only 3 out of 14 alpha-helical coiled coils showed point mutations and/or length polymorphisms. Based on promising immunological results 5 of these peptides were selected for further analysis. Direct sequencing of field samples from Papua New Guinea and Tanzania showed that 3 out of these 5 peptides were completely conserved. An in silico analysis of polymorphism was performed for all 166 putative alpha-helical coiled coil domains originally identified in the P. falciparum genome. We found that 82% (137/166) of these peptides were conserved, and for one peptide only the detected SNPs decreased substantially the probability score for alpha-helical coiled coil formation. More SNPs were found in arrays of almost perfect tandem repeats. In summary, the coiled coil structure prediction was rarely modified by SNPs. The analysis revealed a number of peptides with strictly conserved alpha-helical coiled coil motifs. CONCLUSION/SIGNIFICANCE: We conclude that the selection of alpha-helical coiled coil structural motifs is a valuable approach to identify potential vaccine targets showing a high degree of conservation.
Resumo:
The recent advances in sequencing technologies have given all microbiology laboratories access to whole genome sequencing. Providing that tools for the automated analysis of sequence data and databases for associated meta-data are developed, whole genome sequencing will become a routine tool for large clinical microbiology laboratories. Indeed, the continuing reduction in sequencing costs and the shortening of the 'time to result' makes it an attractive strategy in both research and diagnostics. Here, we review how high-throughput sequencing is revolutionizing clinical microbiology and the promise that it still holds. We discuss major applications, which include: (i) identification of target DNA sequences and antigens to rapidly develop diagnostic tools; (ii) precise strain identification for epidemiological typing and pathogen monitoring during outbreaks; and (iii) investigation of strain properties, such as the presence of antibiotic resistance or virulence factors. In addition, recent developments in comparative metagenomics and single-cell sequencing offer the prospect of a better understanding of complex microbial communities at the global and individual levels, providing a new perspective for understanding host-pathogen interactions. Being a high-resolution tool, high-throughput sequencing will increasingly influence diagnostics, epidemiology, risk management, and patient care.
Resumo:
BACKGROUND: Superinfection with drug resistant HIV strains could potentially contribute to compromised therapy in patients initially infected with drug-sensitive virus and receiving antiretroviral therapy. To investigate the importance of this potential route to drug resistance, we developed a bioinformatics pipeline to detect superinfection from routinely collected genotyping data, and assessed whether superinfection contributed to increased drug resistance in a large European cohort of viremic, drug treated patients. METHODS: We used sequence data from routine genotypic tests spanning the protease and partial reverse transcriptase regions in the Virolab and EuResist databases that collated data from five European countries. Superinfection was indicated when sequences of a patient failed to cluster together in phylogenetic trees constructed with selected sets of control sequences. A subset of the indicated cases was validated by re-sequencing pol and env regions from the original samples. RESULTS: 4425 patients had at least two sequences in the database, with a total of 13816 distinct sequence entries (of which 86% belonged to subtype B). We identified 107 patients with phylogenetic evidence for superinfection. In 14 of these cases, we analyzed newly amplified sequences from the original samples for validation purposes: only 2 cases were verified as superinfections in the repeated analyses, the other 12 cases turned out to involve sample or sequence misidentification. Resistance to drugs used at the time of strain replacement did not change in these two patients. A third case could not be validated by re-sequencing, but was supported as superinfection by an intermediate sequence with high degenerate base pair count within the time frame of strain switching. Drug resistance increased in this single patient. CONCLUSIONS: Routine genotyping data are informative for the detection of HIV superinfection; however, most cases of non-monophyletic clustering in patient phylogenies arise from sample or sequence mix-up rather than from superinfection, which emphasizes the importance of validation. Non-transient superinfection was rare in our mainly treatment experienced cohort, and we found a single case of possible transmitted drug resistance by this route. We therefore conclude that in our large cohort, superinfection with drug resistant HIV did not compromise the efficiency of antiretroviral treatment.
Resumo:
We have initiated a gene discovery program in Schistosoma mansoni based on the technique of Expressed Sequence Tags (ESTs), i.e. partial sequences of cDNAs obtained from single passes in automatic DNA sequencers. ESTs can be used to identify genese onf the basis of their homology whith sequences from other species deposited in DNA or protein databases. Trasncripts with sequences without matches in teh databases may represent novel parasite-specific genes. This approach has shown to be very efficient and in less than two years a broad range of novel genes has already been ascertained, more than doubling the number of known S. mansoni genes.
Resumo:
The Eukaryotic Promoter Database (EPD) is an annotated non-redundant collection of eukaryotic POL II promoters for which the transcription start site has been determined experimentally. Access to promoter sequences is provided by pointers to positions in nucleotide sequence entries. The annotation part of an entry includes a description of the initiation site mapping data, exhaustive cross-references to the EMBL nucleotide sequence database, SWISS-PROT, TRANSFAC and other databases, as well as bibliographic references. EPD is structured in a way that facilitates dynamic extraction of biologically meaningful promoter subsets for comparative sequence analysis. WWW-based interfaces have been developed that enable the user to view EPD entries in different formats, to select and extract promoter sequences according to a variety of criteria, and to navigate to related databases exploiting different cross-references. The EPD web site also features yearly updated base frequency matrices for major eukaryotic promoter elements. EPD can be accessed at http://www.epd.isb-sib.ch
Resumo:
Signature databases are vital tools for identifying distant relationships in novel sequences and hence for inferring protein function. InterPro is an integrated documentation resource for protein families, domains and functional sites, which amalgamates the efforts of the PROSITE, PRINTS, Pfam and ProDom database projects. Each InterPro entry includes a functional description, annotation, literature references and links back to the relevant member database(s). Release 2.0 of InterPro (October 2000) contains over 3000 entries, representing families, domains, repeats and sites of post-translational modification encoded by a total of 6804 different regular expressions, profiles, fingerprints and Hidden Markov Models. Each InterPro entry lists all the matches against SWISS-PROT and TrEMBL (more than 1,000,000 hits from 462,500 proteins in SWISS-PROT and TrEMBL). The database is accessible for text- and sequence-based searches at http://www.ebi.ac.uk/interpro/. Questions can be emailed to interhelp@ebi.ac.uk.
Update of the Gene Discovery Program in Schistosoma mansoni with the Expressed Sequence Tag Approach
Resumo:
Continuing the Schistosoma mansoni Genome Project 363 new templates were sequenced generating 205 more ESTs corresponding to 91 genes. Seventy four of these genes (81%) had not previously been described in S. mansoni. Among the newly discovered genes there are several of significant biological interest such as synaptophysin, NIFs-like and rho-GDP dissociation inhibitor
Resumo:
Random single pass sequencing of cDNA fragments, also known as generation of Expressed Sequence Tags (ESTs), has been highly successful in the study of the gene content of higher organisms, and forms an integral part of most genome projects, with the objective to identify new genes and targets for disease control and prevention and to generate mapping probes. In the Trypanosoma cruzi genome project, EST sequencing has also been a starting point, and here we report data on the first 797 sequences obtained, partly from a CL Brener epimastigote non-normalized library, partly on a normalized library. Only around 30% of the sequences obtained showed similarity with Genbank and dbEST databases, half of which with sequences already reported for T. cruzi.
Resumo:
The detection of latent fingermarks on thermal papers proves to be particularly challenging because the application of conventional detection techniques may turn the sample dark grey or black, thus preventing the observation of fingermarks. Various approaches aiming at avoiding or solving this problem have been suggested. However, in view of the many propositions available in the literature, it gets difficult to choose the most advantageous method and to decide which processing sequence should be followed when dealing with a thermal paper. In this study, 19 detection techniques adapted to the processing of thermal papers were assessed individually and then were compared to each other. An updated processing sequence, assessed through a pseudo-operational test, is suggested.
Resumo:
The numbat has been reduced to two populations in Western Australia. To better understand the effects of range reduction on gene flow and genetic variation, and to address questions crucial for the species' management, we analysed mitochondrial DNA (mtDNA) sequences of free-ranging individuals and museum specimens. The results suggest recent connectivity between the remnant populations, although one of those may have lost significant amounts of genetic diversity during the recent population size reduction. We propose that for management purposes the remnant populations should be treated as a single historical lineage and that, subject to certain caveats, consideration should be given to population augmentation by translocation.