983 resultados para Dna Identification
Resumo:
Epstein-Barr virus nuclear antigen (EBNA)-6 is essential for EBV-induced immortalization of primary human B-lymphocytes in vitro. Previous studies have shown that EBNA-6 acts as a transcriptional regulator of viral and cellular genes; however at present, few functional domains of the 140 kDa EBNA-6 protein have been completely characterized. There are five computer-predicted nuclear localization signals (NLS), four monopartite and one bipartite, present in the EBNA-6 amino acid sequence. To identify which of these NLS are functional, fusion proteins between green fluorescent protein and deletion constructs of EBNA-6 were expressed in HeLa cells, Each of the constructs containing at least one of the NLS was targeted to the nucleus of cells whereas a construct lacking all of the NLS was cytoplasmic. Site-directed mutation of these NLS demonstrated that only three of the NLS were functional, one at the N-terminal end (aa 72-80), one in the middle (aa 412-418) and one at the C-terminal end (aa 939-945) of the EBNA-6 protein.
Resumo:
The intestinal spirochaete Brachyspira pilosicoli causes colitis in a wide variety of host species. Little is known about the structure or protein constituents of the B. pilosicoli outer membrane (OM). To identify surface-exposed proteins in this species, membrane vesicles were isolated from B. pilosicoli strain 95-1000 cells by osmotic lysis in dH(2)O followed by isopycnic centrifugation in sucrose density gradients. The membrane vesicles were separated into a high-density fraction (HDMV; p = 1.18 g CM-3) and a low-density fraction (LDMV; rho=1.12 g cm(-3)). Both fractions were free of flagella and soluble protein contamination. LDMV contained predominantly OM markers (lipo-oligosaccharide and a 29 kDa B. pilosicoli OM protein) and was used as a source of antigens to produce mAbs. Five B. pilosicoli-specific mAbs reacting with proteins with molecular masses of 23, 24, 35, 61 and 79 kDa were characterized. The 23 kDa protein was only partially soluble in Triton X-114, whereas the 24 and 35 kDa proteins were enriched in the detergent phase, implying that they were integral membrane proteins or lipoproteins. All three proteins were localized to the B. pilosicoli OM by immunogold labelling using specific mAbs. The gene encoding the abundant, surface-exposed 23 kDa protein was identified by screening a B. pilosicoli 95-1000 genome library with the mAb and was expressed in Escherichia coli. Sequence analysis showed that it encoded a unique lipoprotein, designated BmpC. Recombinant BmpC partitioned predominantly in the OM fraction of E. coli strain SOLR. The mAb to BmpC was used to screen a collection of 13 genetically heterogeneous strains of B. pilosicoli isolated from five different host species. Interestingly, only strain 95-1000 was reactive with the mAb, indicating that either the surface-exposed epitope on BmpC is variable between strains or that the protein is restricted in its distribution within B. pilosicoli.
Resumo:
Cross-species comparative genomics is a powerful strategy for identifying functional regulatory elements within noncoding DNA. In this paper, comparative analysis of human and mouse intronic sequences in the breast cancer susceptibility gene (BRCA1) revealed two evolutionarily conserved noncoding sequences (CNS) in intron 2, 5 kb downstream of the core BRCA1 promoter. The functionality of these elements was examined using homologous-recombination-based mutagenesis of reporter gene-tagged cosmids incorporating these regions and flanking sequences from the BRCA1 locus. This showed that CNS-1 and CNS-2 have differential transcriptional regulatory activity in epithelial cell lines. Mutation of CNS-1 significantly reduced reporter gene expression to 30% of control levels. Conversely mutation of CNS-2 increased expression to 200% of control levels. Regulation is at the level of transcription and shows promoter specificity. Both elements also specifically bind nuclear proteins in vitro. These studies demonstrate that the combination of comparative genomics and functional analysis is a successful strategy to identify novel regulatory elements and provide the first direct evidence that conserved noncoding sequences in BRCA1 regulate gene expression. (c) 2005 Elsevier Inc. All rights reserved.
Resumo:
Genetic analysis in animals has been used for many applications, such as kinship analysis, for determining the sire of an offspring when a female has been exposed to multiple males, determining parentage when an animal switches offspring with another dam, extended lineage reconstruction, estimating inbreeding, identification in breed registries, and speciation. It now also is being used increasingly to characterize animal materials in forensic cases. As such, it is important to operate under a set of minimum guidelines that assures that all service providers have a template to follow for quality practices. None have been delineated for animal genetic identity testing. Based on the model for human DNA forensic analyses, a basic discussion of the issues and guidelines is provided for animal testing to include analytical practices, data evaluation, nomenclature, allele designation, statistics, validation, proficiency testing, lineage markers, casework files, and reporting. These should provide a basis for professional societies and/or working groups to establish more formalized recommendations.
Resumo:
Background: This paper describes SeqDoC, a simple, web-based tool to carry out direct comparison of ABI sequence chromatograms. This allows the rapid identification of single nucleotide polymorphisms (SNPs) and point mutations without the need to install or learn more complicated analysis software. Results: SeqDoC produces a subtracted trace showing differences between a reference and test chromatogram, and is optimised to emphasise those characteristic of single base changes. It automatically aligns sequences, and produces straightforward graphical output. The use of direct comparison of the sequence chromatograms means that artefacts introduced by automatic base-calling software are avoided. Homozygous and heterozygous substitutions and insertion/deletion events are all readily identified. SeqDoC successfully highlights nucleotide changes missed by the Staden package 'tracediff' program. Conclusion: SeqDoC is ideal for small-scale SNP identification, for identification of changes in random mutagenesis screens, and for verification of PCR amplification fidelity. Differences are highlighted, not interpreted, allowing the investigator to make the ultimate decision on the nature of the change.
Resumo:
In just over a decade, the use of molecular approaches for the recognition of parasites has become commonplace. For trematodes, the internal transcribed spacer region of ribosomal DNA (ITS rDNA) has become the default region of choice. Here, we review the findings of 63 studies that report ITS rDNA sequence data for about 155 digenean species from 19 families, and then review the levels of variation that have been reported and how the variation has been interpreted. Overall, complete ITS sequences (or ITS1 or ITS2 regions alone) usually distinguish trematode species clearly, including combinations for which morphology gives ambiguous results. Closely related species may have few base differences and in at least one convincing case the ITS2 sequences of two good species are identical. In some cases, the ITS1 region gives greater resolution than the ITS2 because of the presence of variable repeat units that are generally lacking in the ITS2. Intraspecific variation is usually low and frequently apparently absent. Information on geographical variation of digeneans is limited but at least some of the reported variation probably reflects the presence of multiple species. Despite the accepted dogma that concerted evolution makes the individual representative of the entire species, a significant number of studies have reported at least some intraspecific variation. The significance of such variation is difficult to assess a posteriori, but it seems likely that identification and sequencing errors account for some of it and failure to recognise separate species may also be significant. Some reported variation clearly requires further analysis. The use of a yardstick to determine when separate species should be recognised is flawed. Instead, we argue that consistent genetic differences that are associated with consistent morphological or biological traits should be considered the marker for separate species. We propose a generalised approach to the use of rDNA to distinguish trematode species.
Resumo:
A variety of morphological and molecular characters were compared for their ability to separate the three plant pathogenic species that comprise the genus Sclerotinia: Sclerotinia sclerotiorum, Sclerotinia minor and Sclerotinia trifoliorum. Restriction fragment length polymorphism ( RFLP) probes generated from cloned genomic DNA fragments of S. sclerotiorum were used for accurate species designation and to compare against other markers, before further use in population genetics and breeding studies. Other characters used for comparison included host species, sclerotial diameters, ascospore morphism and breeding type. Several RFLP probes, either singly or in combination, enabled clear separation of the Sclerotinia species. Sclerotial diameters remain a good criterion for separating S. minor from S. sclerotiorum and S. trifoliorum, but the host species criterion was inadequate for accurately differentiating the 3 species of Sclerotinia.
Resumo:
Craniofacial anomalies are a common feature of human congenital dysmorphology syndromes, suggesting that genes expressed in the developing face are likely to play a wider role in embryonic development. To facilitate the identification of genes involved in embryogenesis, we previously constructed an enriched cDNA library by subtracting adult mouse liver cDNA from that of embryonic day (E)10.5 mouse pharyngeal arch cDNA. From this library, 273 unique clones were sequenced and known proteins binned into functional categories in order to assess enrichment of the library (1). We have now selected 31 novel and poorly characterised genes from this library and present bioinformatic analysis to predict proteins encoded by these genes, and to detect evolutionary conservation. Of these genes 61% (19/31) showed restricted expression in the developing embryo, and a subset of these was chosen for further in silico characterisation as well as experimental determination of subcellular localisation based on transient transfection of predicted full-length coding sequences into mammalian cell lines. Where a human orthologue of these genes was detected, chromosomal localisation was determined relative to known loci for human congenital disease.
Resumo:
Gateway technology is a powerful system for converting a single entry vector into a wide variety of expression vectors. We expressed recombinant influenza matrix protein M1 (FMP), a potent antigen for cytotoxic T cells, using the Gateway vector pET-DEST42 containing the FMP cDNA, and purified the expressed FMP as a single 32 kDa recombinant protein. N-terminal and internal protein sequencing, however, showed that the recombinant FMP contained an extra 10 amino acids fused to the N-terminal of native FMP. Further investigation of the DNA sequence adjacent to the 5'-FMP cDNA indicated that the TTG in the attB1 site (30bp upstream of the ATG in the 5'-FMP cDNA) behaved as a dominant translation start site, resulting in a 10 amino acid extension of the recombinant FMP. Thus, it is possible that recombinant proteins produced by this Gateway vector contain unexpected vector-derived peptides, which may affect experimental outcomes. (c) 2006 Elsevier Inc. All rights reserved.
Resumo:
Little is known about the extent of allelic diversity of genes in the complex polyploid, sugarcane. Using sucrose phosphate synthase (SPS) Gene (SPS) Family III as an example, we have amplified and sequenced a 400 nt region from this gene from two sugarcane lines that are parents of a mapping population. Ten single nucleotide polymorphisms (SNPs) were identified within the 400 nt region of which seven were present in both lines. In the elite commercial cultivar Q165(A), 10 sequence haplotypes were identified, with four haplotypes recovered at 9% or greater frequency. Based on SNP presence, two clusters of haplotypes were observed. In IJ76-514, a Saccharum officinarum accession, 8 haplotypes were identified with 4 haplotypes recovered at 13% or greater frequency. Again, two clusters of haplotypes were observed. The results suggest that there may be two SPS Gene Family III genes per genome in sugarcane, each with different numbers of different alleles. This suggestion is supported by sequencing results in an elite parental sorghum line, 403463-2-1, in which 4 haplotypes, corresponding to two broad types, were also identified. Primers were designed to the sugarcane SNPs and screened over bulked DNA from high and low Sucrose-containing progeny from a cross between Q165(A) and IJ76-514. The SNP frequency did not vary in the two bulked DNA samples, suggesting that these SNPs from this SPS gene family are not associated with variation in sucrose content. Using an ecotilling approach, two of the SPS Gene Family III haplotypes were mapped to two different linkage groups in homology group 1 in Q165(A). Both haplotypes mapped near QTLs for increased sucrose content but were not themselves associated with any sugar-related trait.
Resumo:
We have successfully linked protein library screening directly with the identification of active proteins, without the need for individual purification, display technologies or physical linkage between the protein and its encoding sequence. By using 'MAX' randomization we have rapidly constructed 60 overlapping gene libraries that encode zinc finger proteins, randomized variously at the three principal DNA-contacting residues. Expression and screening of the libraries against five possible target DNA sequences generated data points covering a potential 40,000 individual interactions. Comparative analysis of the resulting data enabled direct identification of active proteins. Accuracy of this library analysis methodology was confirmed by both in vitro and in vivo analyses of identified proteins to yield novel zinc finger proteins that bind to their target sequences with high affinity, as indicated by low nanomolar apparent dissociation constants.
Resumo:
Background: DNA-binding proteins play a pivotal role in various intra- and extra-cellular activities ranging from DNA replication to gene expression control. Identification of DNA-binding proteins is one of the major challenges in the field of genome annotation. There have been several computational methods proposed in the literature to deal with the DNA-binding protein identification. However, most of them can't provide an invaluable knowledge base for our understanding of DNA-protein interactions. Results: We firstly presented a new protein sequence encoding method called PSSM Distance Transformation, and then constructed a DNA-binding protein identification method (SVM-PSSM-DT) by combining PSSM Distance Transformation with support vector machine (SVM). First, the PSSM profiles are generated by using the PSI-BLAST program to search the non-redundant (NR) database. Next, the PSSM profiles are transformed into uniform numeric representations appropriately by distance transformation scheme. Lastly, the resulting uniform numeric representations are inputted into a SVM classifier for prediction. Thus whether a sequence can bind to DNA or not can be determined. In benchmark test on 525 DNA-binding and 550 non DNA-binding proteins using jackknife validation, the present model achieved an ACC of 79.96%, MCC of 0.622 and AUC of 86.50%. This performance is considerably better than most of the existing state-of-the-art predictive methods. When tested on a recently constructed independent dataset PDB186, SVM-PSSM-DT also achieved the best performance with ACC of 80.00%, MCC of 0.647 and AUC of 87.40%, and outperformed some existing state-of-the-art methods. Conclusions: The experiment results demonstrate that PSSM Distance Transformation is an available protein sequence encoding method and SVM-PSSM-DT is a useful tool for identifying the DNA-binding proteins. A user-friendly web-server of SVM-PSSM-DT was constructed, which is freely accessible to the public at the web-site on http://bioinformatics.hitsz.edu.cn/PSSM-DT/.