14 resultados para methylated DNA protein cysteine methyltransferase
em Aston University Research Archive
Resumo:
Current approaches for purifying plasmids from bacterial production systems exploit the physiochemical properties of nucleic acids in non-specific capture systems. In this study, an affinity system for plasmid DNA (pDNA) purification has been developed utilizing the interaction between the lac operon (lacO) sequence contained in the pDNA and a 64mer synthetic peptide representing the DNA-binding domain of the lac repressor protein, LacI. Two plasmids were evaluated, the native pUC19 and pUC19 with dual lacO3/lacOs operators (pUC19lacO3/lacOs), where the lacOs operator is perfectly symmetrical. The DNA-protein affinity interaction was evaluated by surface plasmon resonance using a Biacore system. The affinity capture of DNA in a chromatography system was evaluated using LacI peptide that had been immobilized to Streamline™ adsorbent. The KD-values for double stranded DNA (dsDNA) fragments containing lacO1 and lacO3 and lacOs and lacO3 were 5.7 ± 0.3 × 10 -11 M and 4.1 ± 0.2 × 10-11 M respectively, which compare favorably with literature reports of 5 × 10-10 - 1 × 10-9 M for native laCO1 and 1-1.2 × 10-10 M for lacO1 in a saline buffer. Densitometric analysis of the gel bands from the affinity chromatography run clearly showed a significant preference for capture of the supercoiled fraction from the feed pDNA sample. The results indicate the feasibility of the affinity approach for pDNA capture and purification using native protein-DNA interaction. © 2006 Wiley Periodicals, Inc.
Resumo:
It has been recognised for some time that a full code of amino acid-based recognition of DNA sequences would be useful. Several approaches, which utilise small DNA binding motifs called zinc fingers, are presently employed. None of the current approaches successfully combine a combinatorial approach to the elucidation of a code with a single stage high throughput screening assay. The work outlined here describes the development of a model system for the study of DNA protein interactions and the development of a high throughput assay for detection of such interactions. A zinc finger protein was designed which will bind with high affinity and specificity to a known DNA sequence. For future work it is possible to mutate the region of the zinc finger responsible for the specificity of binding, in order to observe the effect on the DNA / protein interactions. The zinc finger protein was initially synthesised as a His tagged product. It was not possible however to develop a high throughput assay using the His tagged zinc finger protein. The gene encoding the zinc finger protein was altered and the protein synthesised as a Glutathione S-Transferase (GST) fusion product. A successful assay was developed using the GST protein and Scintillation Proximity Assay technology (Amersham Pharmacia Biotech). The scintillation proximity assay is a dynamic assay that allows the DNA protein interactions to be studied in "real time". This assay not only provides a high throughput method of screening zinc finger proteins for potential ligands but also allows the effect of addition of reagents or competitor ligands to be monitored.
Resumo:
Background: DNA-binding proteins play a pivotal role in various intra- and extra-cellular activities ranging from DNA replication to gene expression control. Identification of DNA-binding proteins is one of the major challenges in the field of genome annotation. There have been several computational methods proposed in the literature to deal with the DNA-binding protein identification. However, most of them can't provide an invaluable knowledge base for our understanding of DNA-protein interactions. Results: We firstly presented a new protein sequence encoding method called PSSM Distance Transformation, and then constructed a DNA-binding protein identification method (SVM-PSSM-DT) by combining PSSM Distance Transformation with support vector machine (SVM). First, the PSSM profiles are generated by using the PSI-BLAST program to search the non-redundant (NR) database. Next, the PSSM profiles are transformed into uniform numeric representations appropriately by distance transformation scheme. Lastly, the resulting uniform numeric representations are inputted into a SVM classifier for prediction. Thus whether a sequence can bind to DNA or not can be determined. In benchmark test on 525 DNA-binding and 550 non DNA-binding proteins using jackknife validation, the present model achieved an ACC of 79.96%, MCC of 0.622 and AUC of 86.50%. This performance is considerably better than most of the existing state-of-the-art predictive methods. When tested on a recently constructed independent dataset PDB186, SVM-PSSM-DT also achieved the best performance with ACC of 80.00%, MCC of 0.647 and AUC of 87.40%, and outperformed some existing state-of-the-art methods. Conclusions: The experiment results demonstrate that PSSM Distance Transformation is an available protein sequence encoding method and SVM-PSSM-DT is a useful tool for identifying the DNA-binding proteins. A user-friendly web-server of SVM-PSSM-DT was constructed, which is freely accessible to the public at the web-site on http://bioinformatics.hitsz.edu.cn/PSSM-DT/.
Resumo:
The imidazotetrazinones are clinically active antitumour agents, temozolomide currently proving successful in the treatment of melanomas and gliomas. The exact nature of the biological processes underlying response are as yet unclear.This thesis attempts to identify the cellular targets important to the cytotoxicity of imidazotetrazinones, to elucidate the pathways by which this damage leads to cell death, and to identify mechanisms by which tumour cells may circumvent this action. The levels of the DNA repair enzymes O6-alkylguanine-DNA-alkyltransferase (O6-AGAT) and 3-methyladenine-DNA-glycosylase (3MAG) have been examined in a range of murine and human cell lines with differential sensitivity to temozolomide. All the cell lines were proficient in 3MAG despite there being 40-fold difference in sensitivity to temozolomide. This suggests that while 3-methyladenine is a major product of temozolomide alkylation of DNA it is unlikely to be a cytotoxic lesion. In contrast, there was a 20-fold variation in O6-AGAT levels and the concentration of this repair enzyme correlated with variations in cytotoxicity. Furthermore, depletion of this enzyme in a resistant, O6-AGAT proficient cell line (Raji), by pre-treatment with the free base O6-methylguanine resulted in 54% sensitisation to the effects of temozolomide. These observations have been extended to 3 glioma cell lines; results that support the view that the cytotoxicity of temozolomide is related to alkylation at the O6-position of guanine and that resistance to this drug is determined by efficient repair of this lesion. It is clear, however, the other factors may influence tumour response since temozolomide showed little differential activity towards 3 established solid murine tumours in vivo, despite different tumour O6-AGAT levels. Unlike mitozolomide, temozolomide is incapable of cross-linking DNA and a mechanism by which O6-methylguanine may exert lethality is unclear. The cytotoxicity of the methyl group may be due to its disruption of DNA-protein interactions, or alternatively cell death may not be a direct result of the alkyl group itself, but manifested by DNA single-strand breaks. Enhanced alkaline elution rates were found for the DNA of Raji cells treated with temozolomide following alkyltransferase depletion, suggesting a relationship between O6-methylguanine and the induction single-strand breaks. Such breaks can activate poly(ADP-ribose) synthetase (ADPRT) an enzyme capable of rapid and lethal depletion of cellular NAD levels. However, at concentrations of temozolomlde relevant in vivo little change in adenine nucleotides was detected in cell lines, although this enzyme would appear important in modulating DNA repair since inhibition of ADPRT potentiated temozolomide cytotoxicity in Raji cells but not O6-AGAT deficient GM892A cells. Cell lines have been reported that are O6-AGAT deficient yet resistant to methylating agents. Thus, resistance to temozolomide may arise not only by removal of the methyl group from the O6-position of guanine, but also from another mechanism involving caffeine-sensitive post-replication repair or mismatch repair activity. A modification of the standard Maxam Gilbert sequencing technique was used to determine the sequence specificity of guanine-N7 alkylation. Temozolomide preferentially alkylated runs of guanines with the intensity of reaction increasing with the number of adjacent guanines in the DNA sequence. Comparable results were obtained with a polymerase-stop assay, although neither technique elucidates the sequence specificity of O6-guanine alkylation. The importance of such specificity to cytotoxicity is uncertain, although guanine-rich sequences are common to the promoter regions of oncogenes. Expression of a plasmid reporter gene under the control of the Ha-ras proto~oncogene promoter was inhibited by alkylation with temozolomide when transfected into cancer cell lines, However, this inhibition did not appear to be related to O6~guanine alkylation and therefore would seem unimportant to the chemotherapeutic activity of temozolomide.
Resumo:
S-glutathionylation occurs when reactive oxygen or nitrogen species react with protein-cysteine thiols. Glutaredoxin-1 (Glrx) is a cytosolic enzyme which enzymatically catalyses the reduction in S-glutathionylation, conferring reversible signalling function to proteins with redox-sensitive thiols. Glrx can regulate vascular hypertrophy and inflammation by regulating the activity of nuclear factor κB (NF-κB) and actin polymerization. Vascular endothelial growth factor (VEGF)-induced endothelial cell (EC) migration is inhibited by Glrx overexpression. In mice overexpressing Glrx, blood flow recovery, exercise function and capillary density were significantly attenuated after hindlimb ischaemia (HLI). Wnt5a and soluble Fms-like tyrosine kinase-1 (sFlt-1) were enhanced in the ischaemic-limb muscle and plasma respectively from Glrx transgenic (TG) mice. A Wnt5a/sFlt-1 pathway had been described in myeloid cells controlling retinal blood vessel development. Interestingly, a Wnt5a/sFlt-1 pathway was found also to play a role in EC to inhibit network formation. S-glutathionylation of NF-κB components inhibits its activation. Up-regulated Glrx stimulated the Wnt5a/sFlt-1 pathway through enhancing NF-κB signalling. These studies show a novel role for Glrx in post-ischaemic neovascularization, which could define a potential target for therapy of impaired angiogenesis in pathological conditions including diabetes.
Resumo:
DNA methylation appears to be involved in the regulation of gene expression. Transcriptionally inactive (silenced) genes normally contain a high proportion of 5-methyl-2'-deoxycytosine residues whereas transcriptionally active genes show much reduced levels. There appears good reason to believe that chemical agents capable of methylating 2'-deoxycytosine might affect gene expression and as a result of hypermethylating promoter regions of cytosine-guanine rich oncogenic sequences, cancer related genes may be silenced. This thesis describes the synthesis of a number of `electrophilic' S-methylsulphonium compounds and assesses their ability to act as molecules capable of methylating cytosine at position 5 and also considers their potential as cytotoxic agents. DNA is methylated in vivo by DNA methyltransferase utilising S-adenoxylmethionine as the methyl donor. This thesis addresses the theory that S-adenoxylmethionine may be replaced as the methyl donor for DNA methytransferase by other sulphonium compounds. S-[3H-methyl]methionine sulphonium iodide was synthesised and experiments to assess the ability of this compounds to transfer methyl groups to cytosine in the presence of DNA methyltransferase were unsuccessful. A proline residue adjacent to a cysteine residue has been identified to a highly conserved feature of the active site region of a large number of prokaryotic DNA methyltransferases. The thesis examines the possibility that short peptides containing the Pro-Cys fragment may be able to facilitate the alkylation of cytosine position 5 by sulphonium compounds. Peptides were synthesised up to 9 amino acids in length but none were shown to exhibit significant activity. Molecular modelling techniques, including Chem-X, Quanta, BIPED and protein structure prediction programs were used to assess any structural similarities that may exist between short peptides containing a Pro-Cys fragment and similar sequences present in proteins. A number of similar structural features were observed.
Resumo:
Proteins are susceptible to oxidation by reactive oxygen species, where the type of damage induced is characteristic of the denaturing species. The induction of protein carbonyls is a widely applied biomarker, arising from primary oxidative insult. However, when applied to complex biological and pathological conditions it can be subject to interference from lipid, carbohydrate and DNA oxidation products. More recently, interest has focused on the analysis of specific protein bound oxidised amino acids. Of the 22 amino acids, aromatic and sulphydryl containing residues have been regarded as being particularly susceptible to oxidative modification, with L-DOPA from tyrosine, ortho-tyrosine from phenylalanine; sulphoxides and disulphides from methionine and cysteine respectively; and kynurenines from tryptophan. Latterly, the identification of valine and leucine hydroxides, reduced from hydroperoxide intermediates, has been described and applied. In order to examine the nature of oxidative damage and protective efficacy of antioxidants the markers must be thoroughly evaluated for dosimetry in vitro following damage by specific radical species. Antioxidant protection against formation of the biomarker should be demonstrated in vitro. Quantification of biomarkers in proteins from normal subjects should be within the limits of detection of any analytical procedure. Further to this, the techniques for isolation and hydrolysis of specific proteins should demonstrate that in vitro oxidation is minimised. There is a need for the development of standards for quality assurance material to standardise procedures between laboratories. At present, antioxidant effects on protein oxidation in vivo are limited to animal studies, where dietary antioxidants have been reported to reduce dityrosine formation during rat exercise training. Two studies on humans have been reported last year. The further application of these methods to human studies is indicated, where the quality of the determinations will be enhanced through inter-laboratory validation.
Resumo:
A simple protein-DNA interaction analysis has been developed using a high-affinity/high-specificity zinc finger protein. In essence, purified protein samples are immobilized directly onto the surface of microplate wells, and fluorescently labeled DNA is added in solution. After incubation and washing, bound DNA is detected in a standard microplate reader. The minimum sensitivity of the assay is approximately 0.2 nM DNA. Since the detection of bound DNA is noninvasive and the protein-DNA interaction is not disrupted during detection, iterative readings may be taken from the same well, after successive alterations in interaction conditions, if required. In this respect, the assay may therefore be considered real time and permits appropriate interaction conditions to be determined quantitatively. The assay format is ideally suited to investigate the interactions of purified unlabeled DNA binding proteins in a high-throughput format.
Resumo:
A simple protein-DNA interaction analysis has been developed using both a high-affinity/high-specificity zinc finger protein and a low-specificity zinc finger protein with nonspecific DNA binding capability. The latter protein is designed to mimic background binding by proteins generated in randomized or shuffled gene libraries. In essence, DNA is immobilized onto the surface of microplate wells via streptavidin capture, and green fluorescent protein (GFP)-labeled protein is added in solution as part of a crude cell lysate or protein mixture. After incubation and washing, bound protein is detected in a standard microplate reader. The minimum sensitivity of the assay is approximately 0.4 nM protein. The assay format is ideally suited to investigate the interactions of DNA binding proteins from within crude cell extracts and/or mixtures of proteins that may be encountered in protein libraries generated by codon randomization or gene shuffling.
Resumo:
The affinity isolation of pre-purified plasmid DNA (pDNA) from model buffer solutions using native and poly(ethylene glycol) (PEG) derivatized zinc finger–GST (Glutathione-S-Transferase) fusion protein was examined in PEG–dextran (DEX) aqueous two-phase systems (ATPSs). In the absence of pDNA, partitioning of unbound PEGylated fusion protein into the PEG-rich phase was confirmed with 97.5% of the PEGylated fusion protein being detected in the PEG phase of a PEG 600–DEX 40 ATPS. This represents a 1322-fold increase in the protein partition coefficient in comparison to the non-PEGylated protein (Kc = 0.013). In the presence of pDNA containing a specific oligonucleotide recognition sequence, the zinc finger moiety of the PEGylated fusion protein bound to the plasmid and steered the complex to the PEG-rich phase. An increase in the proportion of pDNA that partitioned to the PEG-rich phase was observed as the concentration of PEGylated fusion protein was increased. Partitioning of the bound complex occurred to such an extent that no DNA was detected by the picogreen assay in the dextran phase. It was also possible to partition pDNA using a non-PEGylated (native) zinc finger–GST fusion protein in a PEG 1000–DEX 500 ATPS. In this case the native ligand accumulated mainly in the PEG phase. These results indicate good prospects for the design of new plasmid DNA purification methods using fusion proteins as affinity ligands.
Protein-mediated isolation of plasmid DNA by a zinc finger-glutathione S-transferase affinity linker
Resumo:
The sequence-specific affinity chromatographic isolation of plasmid DNA from crude lysates of E. coli DH5α fermentations is addressed. A zinc finger-GST fusion protein that binds a synthetic oligonucleotide cassette containing the appropriate DNA recognition sequence is described. This cassette was inserted into the Smal site of pUC19 to enable the affinity isolation of the plasmid. It is shown that zinc finger-GST fusion proteins can bind both their DNA recognition sequence and a glutathione-derivatized solid support simultaneously. Furthermore, a simple procedure for the isolation of such plasmids from clarified cell lysates is demonstrated. Cell lysates were clarified by cross-flow Dean vortex microfiltration, and the permeate was incubated with zinc finger-GST fusion protein. The resulting complex was adsorbed directly onto glutathione-Sepharose. Analysis of the glutathione-eluted complex showed that plasmid DNA had been recovered, largely free from contamination by genomic DNA or bacterial cell proteins. © 2002 Wiley Periodicals, Inc.
Resumo:
Aims: Previous data suggest heterogeneity in laminar distribution of the pathology in the molecular disorder frontotemporal lobar degeneration (FTLD) with transactive response (TAR) DNA-binding protein of 43kDa (TDP-43) proteinopathy (FTLD-TDP). To study this heterogeneity, we quantified the changes in density across the cortical laminae of neuronal cytoplasmic inclusions, glial inclusions, neuronal intranuclear inclusions, dystrophic neurites, surviving neurones, abnormally enlarged neurones, and vacuoles in regions of the frontal and temporal lobe. Methods: Changes in density of histological features across cortical gyri were studied in 10 sporadic cases of FTLD-TDP using quantitative methods and polynomial curve fitting. Results: Our data suggest that laminar neuropathology in sporadic FTLD-TDP is highly variable. Most commonly, neuronal cytoplasmic inclusions, dystrophic neurites and vacuolation were abundant in the upper laminae and glial inclusions, neuronal intranuclear inclusions, abnormally enlarged neurones, and glial cell nuclei in the lower laminae. TDP-43-immunoreactive inclusions affected more of the cortical profile in longer duration cases; their distribution varied with disease subtype, but was unrelated to Braak tangle score. Different TDP-43-immunoreactive inclusions were not spatially correlated. Conclusions: Laminar distribution of pathological features in 10 sporadic cases of FTLD-TDP is heterogeneous and may be accounted for, in part, by disease subtype and disease duration. In addition, the feedforward and feedback cortico-cortical connections may be compromised in FTLD-TDP. © 2012 The Authors. Neuropathology and Applied Neurobiology © 2012 British Neuropathological Society.
Resumo:
Full text: The idea of producing proteins from recombinant DNA hatched almost half a century ago. In his PhD thesis, Peter Lobban foresaw the prospect of inserting foreign DNA (from any source, including mammalian cells) into the genome of a λ phage in order to detect and recover protein products from Escherichia coli [ 1 and 2]. Only a few years later, in 1977, Herbert Boyer and his colleagues succeeded in the first ever expression of a peptide-coding gene in E. coli — they produced recombinant somatostatin [ 3] followed shortly after by human insulin. The field has advanced enormously since those early days and today recombinant proteins have become indispensable in advancing research and development in all fields of the life sciences. Structural biology, in particular, has benefitted tremendously from recombinant protein biotechnology, and an overwhelming proportion of the entries in the Protein Data Bank (PDB) are based on heterologously expressed proteins. Nonetheless, synthesizing, purifying and stabilizing recombinant proteins can still be thoroughly challenging. For example, the soluble proteome is organized to a large part into multicomponent complexes (in humans often comprising ten or more subunits), posing critical challenges for recombinant production. A third of all proteins in cells are located in the membrane, and pose special challenges that require a more bespoke approach. Recent advances may now mean that even these most recalcitrant of proteins could become tenable structural biology targets on a more routine basis. In this special issue, we examine progress in key areas that suggests this is indeed the case. Our first contribution examines the importance of understanding quality control in the host cell during recombinant protein production, and pays particular attention to the synthesis of recombinant membrane proteins. A major challenge faced by any host cell factory is the balance it must strike between its own requirements for growth and the fact that its cellular machinery has essentially been hijacked by an expression construct. In this context, Bill and von der Haar examine emerging insights into the role of the dependent pathways of translation and protein folding in defining high-yielding recombinant membrane protein production experiments for the common prokaryotic and eukaryotic expression hosts. Rather than acting as isolated entities, many membrane proteins form complexes to carry out their functions. To understand their biological mechanisms, it is essential to study the molecular structure of the intact membrane protein assemblies. Recombinant production of membrane protein complexes is still a formidable, at times insurmountable, challenge. In these cases, extraction from natural sources is the only option to prepare samples for structural and functional studies. Zorman and co-workers, in our second contribution, provide an overview of recent advances in the production of multi-subunit membrane protein complexes and highlight recent achievements in membrane protein structural research brought about by state-of-the-art near-atomic resolution cryo-electron microscopy techniques. E. coli has been the dominant host cell for recombinant protein production. Nonetheless, eukaryotic expression systems, including yeasts, insect cells and mammalian cells, are increasingly gaining prominence in the field. The yeast species Pichia pastoris, is a well-established recombinant expression system for a number of applications, including the production of a range of different membrane proteins. Byrne reviews high-resolution structures that have been determined using this methylotroph as an expression host. Although it is not yet clear why P. pastoris is suited to producing such a wide range of membrane proteins, its ease of use and the availability of diverse tools that can be readily implemented in standard bioscience laboratories mean that it is likely to become an increasingly popular option in structural biology pipelines. The contribution by Columbus concludes the membrane protein section of this volume. In her overview of post-expression strategies, Columbus surveys the four most common biochemical approaches for the structural investigation of membrane proteins. Limited proteolysis has successfully aided structure determination of membrane proteins in many cases. Deglycosylation of membrane proteins following production and purification analysis has also facilitated membrane protein structure analysis. Moreover, chemical modifications, such as lysine methylation and cysteine alkylation, have proven their worth to facilitate crystallization of membrane proteins, as well as NMR investigations of membrane protein conformational sampling. Together these approaches have greatly facilitated the structure determination of more than 40 membrane proteins to date. It may be an advantage to produce a target protein in mammalian cells, especially if authentic post-translational modifications such as glycosylation are required for proper activity. Chinese Hamster Ovary (CHO) cells and Human Embryonic Kidney (HEK) 293 cell lines have emerged as excellent hosts for heterologous production. The generation of stable cell-lines is often an aspiration for synthesizing proteins expressed in mammalian cells, in particular if high volumetric yields are to be achieved. In his report, Buessow surveys recent structures of proteins produced using stable mammalian cells and summarizes both well-established and novel approaches to facilitate stable cell-line generation for structural biology applications. The ambition of many biologists is to observe a protein's structure in the native environment of the cell itself. Until recently, this seemed to be more of a dream than a reality. Advances in nuclear magnetic resonance (NMR) spectroscopy techniques, however, have now made possible the observation of mechanistic events at the molecular level of protein structure. Smith and colleagues, in an exciting contribution, review emerging ‘in-cell NMR’ techniques that demonstrate the potential to monitor biological activities by NMR in real time in native physiological environments. A current drawback of NMR as a structure determination tool derives from size limitations of the molecule under investigation and the structures of large proteins and their complexes are therefore typically intractable by NMR. A solution to this challenge is the use of selective isotope labeling of the target protein, which results in a marked reduction of the complexity of NMR spectra and allows dynamic processes even in very large proteins and even ribosomes to be investigated. Kerfah and co-workers introduce methyl-specific isotopic labeling as a molecular tool-box, and review its applications to the solution NMR analysis of large proteins. Tyagi and Lemke next examine single-molecule FRET and crosslinking following the co-translational incorporation of non-canonical amino acids (ncAAs); the goal here is to move beyond static snap-shots of proteins and their complexes and to observe them as dynamic entities. The encoding of ncAAs through codon-suppression technology allows biomolecules to be investigated with diverse structural biology methods. In their article, Tyagi and Lemke discuss these approaches and speculate on the design of improved host organisms for ‘integrative structural biology research’. Our volume concludes with two contributions that resolve particular bottlenecks in the protein structure determination pipeline. The contribution by Crepin and co-workers introduces the concept of polyproteins in contemporary structural biology. Polyproteins are widespread in nature. They represent long polypeptide chains in which individual smaller proteins with different biological function are covalently linked together. Highly specific proteases then tailor the polyprotein into its constituent proteins. Many viruses use polyproteins as a means of organizing their proteome. The concept of polyproteins has now been exploited successfully to produce hitherto inaccessible recombinant protein complexes. For instance, by means of a self-processing synthetic polyprotein, the influenza polymerase, a high-value drug target that had remained elusive for decades, has been produced, and its high-resolution structure determined. In the contribution by Desmyter and co-workers, a further, often imposing, bottleneck in high-resolution protein structure determination is addressed: The requirement to form stable three-dimensional crystal lattices that diffract incident X-ray radiation to high resolution. Nanobodies have proven to be uniquely useful as crystallization chaperones, to coax challenging targets into suitable crystal lattices. Desmyter and co-workers review the generation of nanobodies by immunization, and highlight the application of this powerful technology to the crystallography of important protein specimens including G protein-coupled receptors (GPCRs). Recombinant protein production has come a long way since Peter Lobban's hypothesis in the late 1960s, with recombinant proteins now a dominant force in structural biology. The contributions in this volume showcase an impressive array of inventive approaches that are being developed and implemented, ever increasing the scope of recombinant technology to facilitate the determination of elusive protein structures. Powerful new methods from synthetic biology are further accelerating progress. Structure determination is now reaching into the living cell with the ultimate goal of observing functional molecular architectures in action in their native physiological environment. We anticipate that even the most challenging protein assemblies will be tackled by recombinant technology in the near future.
Resumo:
Protein-DNA interactions are involved in many fundamental biological processes essential for cellular function. Most of the existing computational approaches employed only the sequence context of the target residue for its prediction. In the present study, for each target residue, we applied both the spatial context and the sequence context to construct the feature space. Subsequently, Latent Semantic Analysis (LSA) was applied to remove the redundancies in the feature space. Finally, a predictor (PDNAsite) was developed through the integration of the support vector machines (SVM) classifier and ensemble learning. Results on the PDNA-62 and the PDNA-224 datasets demonstrate that features extracted from spatial context provide more information than those from sequence context and the combination of them gives more performance gain. An analysis of the number of binding sites in the spatial context of the target site indicates that the interactions between binding sites next to each other are important for protein-DNA recognition and their binding ability. The comparison between our proposed PDNAsite method and the existing methods indicate that PDNAsite outperforms most of the existing methods and is a useful tool for DNA-binding site identification. A web-server of our predictor (http://hlt.hitsz.edu.cn:8080/PDNAsite/) is made available for free public accessible to the biological research community.