971 resultados para Dna binding domains


Relevância:

100.00% 100.00%

Publicador:

Resumo:

DNA-binding proteins are crucial for various cellular processes and hence have become an important target for both basic research and drug development. With the avalanche of protein sequences generated in the postgenomic age, it is highly desired to establish an automated method for rapidly and accurately identifying DNA-binding proteins based on their sequence information alone. Owing to the fact that all biological species have developed beginning from a very limited number of ancestral species, it is important to take into account the evolutionary information in developing such a high-throughput tool. In view of this, a new predictor was proposed by incorporating the evolutionary information into the general form of pseudo amino acid composition via the top-n-gram approach. It was observed by comparing the new predictor with the existing methods via both jackknife test and independent data-set test that the new predictor outperformed its counterparts. It is anticipated that the new predictor may become a useful vehicle for identifying DNA-binding proteins. It has not escaped our notice that the novel approach to extract evolutionary information into the formulation of statistical samples can be used to identify many other protein attributes as well.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

DNA-binding proteins are crucial for various cellular processes, such as recognition of specific nucleotide, regulation of transcription, and regulation of gene expression. Developing an effective model for identifying DNA-binding proteins is an urgent research problem. Up to now, many methods have been proposed, but most of them focus on only one classifier and cannot make full use of the large number of negative samples to improve predicting performance. This study proposed a predictor called enDNA-Prot for DNA-binding protein identification by employing the ensemble learning technique. Experiential results showed that enDNA-Prot was comparable with DNA-Prot and outperformed DNAbinder and iDNA-Prot with performance improvement in the range of 3.97-9.52% in ACC and 0.08-0.19 in MCC. Furthermore, when the benchmark dataset was expanded with negative samples, the performance of enDNA-Prot outperformed the three existing methods by 2.83-16.63% in terms of ACC and 0.02-0.16 in terms of MCC. It indicated that enDNA-Prot is an effective method for DNA-binding protein identification and expanding training dataset with negative samples can improve its performance. For the convenience of the vast majority of experimental scientists, we developed a user-friendly web-server for enDNA-Prot which is freely accessible to the public. © 2014 Ruifeng Xu et al.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: DNA-binding proteins play a pivotal role in various intra- and extra-cellular activities ranging from DNA replication to gene expression control. Identification of DNA-binding proteins is one of the major challenges in the field of genome annotation. There have been several computational methods proposed in the literature to deal with the DNA-binding protein identification. However, most of them can't provide an invaluable knowledge base for our understanding of DNA-protein interactions. Results: We firstly presented a new protein sequence encoding method called PSSM Distance Transformation, and then constructed a DNA-binding protein identification method (SVM-PSSM-DT) by combining PSSM Distance Transformation with support vector machine (SVM). First, the PSSM profiles are generated by using the PSI-BLAST program to search the non-redundant (NR) database. Next, the PSSM profiles are transformed into uniform numeric representations appropriately by distance transformation scheme. Lastly, the resulting uniform numeric representations are inputted into a SVM classifier for prediction. Thus whether a sequence can bind to DNA or not can be determined. In benchmark test on 525 DNA-binding and 550 non DNA-binding proteins using jackknife validation, the present model achieved an ACC of 79.96%, MCC of 0.622 and AUC of 86.50%. This performance is considerably better than most of the existing state-of-the-art predictive methods. When tested on a recently constructed independent dataset PDB186, SVM-PSSM-DT also achieved the best performance with ACC of 80.00%, MCC of 0.647 and AUC of 87.40%, and outperformed some existing state-of-the-art methods. Conclusions: The experiment results demonstrate that PSSM Distance Transformation is an available protein sequence encoding method and SVM-PSSM-DT is a useful tool for identifying the DNA-binding proteins. A user-friendly web-server of SVM-PSSM-DT was constructed, which is freely accessible to the public at the web-site on http://bioinformatics.hitsz.edu.cn/PSSM-DT/.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Protein-DNA interactions are involved in many fundamental biological processes essential for cellular function. Most of the existing computational approaches employed only the sequence context of the target residue for its prediction. In the present study, for each target residue, we applied both the spatial context and the sequence context to construct the feature space. Subsequently, Latent Semantic Analysis (LSA) was applied to remove the redundancies in the feature space. Finally, a predictor (PDNAsite) was developed through the integration of the support vector machines (SVM) classifier and ensemble learning. Results on the PDNA-62 and the PDNA-224 datasets demonstrate that features extracted from spatial context provide more information than those from sequence context and the combination of them gives more performance gain. An analysis of the number of binding sites in the spatial context of the target site indicates that the interactions between binding sites next to each other are important for protein-DNA recognition and their binding ability. The comparison between our proposed PDNAsite method and the existing methods indicate that PDNAsite outperforms most of the existing methods and is a useful tool for DNA-binding site identification. A web-server of our predictor (http://hlt.hitsz.edu.cn:8080/PDNAsite/) is made available for free public accessible to the biological research community.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The mammalian high mobility group protein AT-hook 2 (HMGA2) is a small transcriptional factor involved in cell development and oncogenesis. It contains three "AT-hook" DNA binding domains, which specifically recognize the minor groove of AT-rich DNA sequences. It also has an acidic C-terminal motif. Previous studies showed that HMGA2 mediates all its biological effects through interactions with AT-rich DNA sequences in the promoter regions. In this dissertation, I used a variety of biochemical and biophysical methods to examine the physical properties of HMGA2 and to further investigate HMGA2's interactions with AT-rich DNA sequences. The following are three avenues perused in this study: (1) due to the asymmetrical charge distribution of HMGA2, I have developed a rapid procedure to purify HMGA2 in the milligram range. Preparation of large amounts of HMGA2 makes biophysical studies possible; (2) Since HMGA2 binds to different AT-rich sequences in the promoter regions, I used a combination of isothermal titration calorimetry (ITC) and DNA UV melting experiment to characterize interactions of HMGA2 with poly(dA-dT) 2 and poly(dA)poly(dT). My results demonstrated that (i) each HMGA2 molecule binds to 15 AT bp; (ii) HMGA2 binds to both AT DNAs with very high affinity. However, the binding reaction of HMGA2 to poly(dA-dT) 2 is enthalpy-driven and the binding reaction of HMGA2 with poly(dA)poly(dT) is entropy-driven; (iii) the binding reactions are strongly depended on salt concentrations; (3) Previous studies showed that HMGA2 may have sequence specificity. In this study, I used a PCR-based SELEX procedure to examine the DNA binding specificity of HMGA2. Two consensus sequences for HMGA2 have been identified: 5'-ATATTCGCGAWWATT-3' and 5'-ATATTGCGCAWWATT-3', where W represents A or T. These consensus sequences have a unique feature: the first five base pairs are AT-rich, the middle four to five base pairs are GC-rich, and the last five to six base pairs are AT-rich. All three segments are critical for high affinity binding. Replacing either one of the AT-rich sequences to a non-AT-rich sequence causes at least 100-fold decrease in the binding affinity. Intriguingly, if the GC-segment is substituted by an AT-rich segment, the binding affinity of HMGA2 is reduced approximately 5-fold. Identification of the consensus sequences for HMGA2 represents an important step towards finding its binding sites within the genome.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Schistosomiasis is a chronically debilitating helminth infection with a significant socio-economic and public health impact. Accurate diagnostics play a pivotal role in achieving current schistosomiasis control and elimination goals. However, many of the current diagnostic procedures, which rely on detection of schistosome eggs, have major limitations including lack of accuracy and the inability to detect pre-patent infections. DNA-based detection methods provide a viable alternative to the current tests commonly used for schistosomiasis diagnosis. Here we describe the optimisation of a novel droplet digital PCR (ddPCR) duplex assay for the diagnosis of Schistosoma japonicum infection which provides improved detection sensitivity and specificity. The assay involves the amplification of two specific and abundant target gene sequences in S. japonicum; a retrotransposon (SjR2) and a portion of a mitochondrial gene (nad1). The assay detected target sequences in different sources of schistosome DNA isolated from adult worms, schistosomules and eggs, and exhibits a high level of specificity, thereby representing an ideal tool for the detection of low levels of parasite DNA in different clinical samples including parasite cell free DNA in the host circulation and other bodily fluids. Moreover, being quantitative, the assay can be used to determine parasite infection intensity and, could provide an important tool for the detection of low intensity infections in low prevalence schistosomiasis-endemic areas.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The Bacillus subtilis DnaI, DnaB and DnaD proteins load the replicative ring helicase DnaC onto DNA during priming of DNA replication. Here we show that DnaI consists of a C-terminal domain (Cd) with ATPase and DNA-binding activities and an N-terminal domain (Nd) that interacts with the replicative ring helicase. A Zn2+-binding module mediates the interaction with the helicase and C67, C70 and H84 are involved in the coordination of the Zn2+. DnaI binds ATP and exhibits ATPase activity that is not stimulated by ssDNA, because the DNA-binding site on Cd is masked by Nd. The ATPase activity resides on the Cd domain and when detached from the Nd domain, it becomes sensitive to stimulation by ssDNA because its cryptic DNA-binding site is exposed. Therefore, Nd acts as a molecular 'switch' regulating access to the ssDNA binding site on Cd, in response to binding of the helicase. DnaI is sufficient to load the replicative helicase from a complex with six DnaI molecules, so there is no requirement for a dual helicase loader system.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Nurr1, NGFI-B and Nor1 (NR4A2, NR4A1 and NR4A3, respectively) belong to the NR4A subfamily of nuclear receptors. The NR4A receptors are orphan nuclear receptors which means that activating or repressing ligands for these receptors have not been found. NR4A expression is rapidly induced in response to various stimuli including growth factors and the parathyroid hormone (PTH). The studies concerning the NR4A receptors in the central nervous system have demonstrated that they have a major role in the development and function of the dopaminergic neurons of the midbrain and in regulating hypothalamus-pituitary-adrenal-axis. However, the peripheral functions of the NR4A family are largely unknown. Cultured mouse primary osteoblasts, a preosteoblastic cell line and several osteoblastic cell lines were used to investigate the role of NR4A receptors in osteoblasts. NR4A receptors were shown to directly bind to and activate the promoter of the osteopontin gene (OPN) in osteoblastic cells, thus regulating its expression. OPN is a major bone matrix protein expressed throughout the differentiation of preosteoblastic cells into osteoblasts. The activation of the OPN promoter was shown to be dependent on the activation function-1 located in the N-terminal part of Nurr1 and to occur in both monomeric and RXR heterodimeric forms of NR4A receptors. Furthermore, PTH was shown to upregulate OPN expression through the NR4A family. It was also demonstrated that the fibroblast growth factor-8b (FGF-8b) induces the expression of NR4A receptors in osteoblasts as immediate early genes. This induction involved phosphatidylinositol-3 kinase, protein kinase C, and mitogen activated protein kinase, which are all major pathways of FGF signalling. Nurr1 and NGFI-B were shown to induce the proliferation of preosteoblastic cells and to reduce their apoptosis. FGF-8b was shown to stimulate the proliferation of osteoblastic cells through the NR4A receptors. These results suggest that NR4A receptors have a role both in the differentiation of osteoblasts and in the proliferation and apoptosis of preosteoblast. The NR4A receptors were found to bind to the same response element on OPN as the members of the NR3B family of orphan receptors do. Mutual repression was observed between the NR4A receptors and the NR3B receptors. This repression was shown to be dependent on the DNA-binding domains of both receptor families, but to result neither from the competition of DNA binding nor from the competition for coactivators. As the repression was dependent on the relative expression levels of the NR4As and NR3Bs, it seems likely that the ratio of the receptors mediates their activity on their response elements. Rapid induction of the NR4As in response to various stimuli and differential expression of the NR3Bs can effectively control the gene activation by the NR4A receptors. NR4A receptors can bind DNA as monomers, and Nurr1 and NGFI-B can form permissive heterodimers with the retinoid X receptor (RXR). Permissive heterodimers can be activated with RXR agonists, unlike non-permissive heterodimers, which are formed by RXR and retinoic acid receptor or thyroid hormone receptor (RAR and TR, respectively). Non-permissive heterodimers can only be activated by the agonists of the heterodimerizing partner. The mechanisms behind differential response to RXR agonists have remained unresolved. As there are no activating or repressing ligands for the NR4A receptors, it would be important to find out, how they are regulated. Permissiviness of Nurr1/RXR heterodimers was linked to the N-terminal part of Nurr1 ligand-binding domain. This region has previously been shown to mediate the interaction between NRs and corepressors. Non-permissive RAR and TR, permissive Nurr1 and NGFI-B, and RXR were overexpressed with corepressors silencing mediator for retinoic acid and thyroid hormone receptors (SMRT), and with nuclear receptor corepressor in several cell lines. Nurr1 and NGFI-B were found to be repressed by SMRT. The interaction of RXR heterodimers with corepressors was weak in permissive heterodimers and much stronger in non-permissive heterodimers. Non-permissive heterodimers also released corepressors only in response to the agonist of the heterodimeric partner of RXR. In the permissive Nurr1/RXR heterodimer, however, SMRT was released following the treatment with RXR agonists. Corepressor release in response to ligands was found to differentiate permissive heterodimers from non-permissive ones. Corepressors were thus connected to the regulation of NR4A functions. In summary, the studies presented here linked the NR4A family of orphan nuclear receptors to the regulation of osteoblasts. Nurr1 and NGFI-B were found to control the proliferation and apoptosis of preosteoblasts. The studies also demonstrated that cross-talk with the NR3B receptors controls the activity of these orphan receptors. The results clarified the mechanism of permissiviness of RXR-heterodimers. New information was obtained on the regulation and functions of NR4A receptors, for which the ligands are unknown.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The double-stranded-RNA-dependent protein kinase (PKR) is an important component in an antiviral defence pathway that is mediated by interferon (IFN) in vertebrates. Previously, some important IFN system genes had been identified from an IFN-producing CAB (crucian carp Carassius auratus blastulae embryonic) cells after treatment with UV-inactivated GCHV (grass carp haemorrhage virus). Here, a fish PKR-like gene, named CaPKR-like, is cloned and sequenced from the same virally infected CAB cells. It has 2192 base pairs in length with a largest open reading frame (ORF) encoding a protein of 513 amino acid residues. BLAST search reveals that the putative CaPKR-like protein is most homologous to human PKR and also has a high-level homology with all members of a family of eIF2alpha kinases. Structurally, CaPKR-like possesses a conserved C-terminal catalytic domain of eIF2alpha kinase family and the most similarity to mammalian PKRs. Within its N-terminus, there are no dsRNA-binding domains conserved in mammalian PKRs instead of two putative Z-DNA binding domains (Zalpha). Like mammalian PKRs, CaPKR-like had a very low level of constitutive expression in normal CAB cells but was up-regulated in response to active GCHV, UV-inactivated GCHV and CAB IFN, implying that the transcriptional activation of CaPKR-like by viral infection is mediated possibly by newly produced CAB IFN, which was further supported by using cycloheximide, a potent inhibitor of protein synthesis. The results together suggested that CaPKR-like was the first identified fish gene most similar to mammalian PKRs. (C) 2004 Elsevier Ltd. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

NR2E3, also called photoreceptor-specific nuclear receptor (PNR), is a transcription factor of the nuclear hormone receptor superfamily whose expression is uniquely restricted to photoreceptors. There, its physiological activity is essential for proper rod and cone photoreceptor development and maintenance. Thirty-two different mutations in NR2E3 have been identified in either homozygous or compound heterozygous state in the recessively inherited enhanced S-cone sensitivity syndrome (ESCS), Goldmann-Favre syndrome (GFS), and clumped pigmentary retinal degeneration (CPRD). The clinical phenotype common to all these patients is night blindness, rudimental or absent rod function, and hyperfunction of the "blue" S-cones. A single p.G56R mutation is inherited in a dominant manner and causes retinitis pigmentosa (RP). We have established a new locus-specific database for NR2E3 (www.LOVD.nl/eye), containing all reported mutations, polymorphisms, and unclassified sequence variants, including novel ones. A high proportion of mutations are located in the evolutionarily-conserved DNA-binding domains (DBDs) and ligand-binding domains (LBDs) of NR2E3. Based on homology modeling of these NR2E3 domains, we propose a structural localization of mutated residues. The high variability of clinical phenotypes observed in patients affected by NR2E3-linked retinal degenerations may be caused by different disease mechanisms, including absence of DNA-binding, altered interactions with transcriptional coregulators, and differential activity of modifier genes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Les dinoflagellés jouent un rôle très important dans l’écologie des océans en y réalisant une grande partie de la production primaire, en formant une association symbiotique avec les coraux et en ayant la capacité de produire des fleurs d’algues potentiellement toxiques pour les communautés côtières humaines et animales. Malgré tout, la biologie moléculaire des dinoflagellés n’a que très peu été étudiée dans les dernières années, les connaissances de processus de base comme la régulation de la transcription y étant fortement limitées. Une tentative pour élucider ce mécanisme a été réalisée chez les dinoflagellés photosynthétiques Lingulodinium polyedrum et Amphidinium carterae. Une expérience d’induction de la transcription du gène de la Peridinin chlorophyll-a binding protein, le complexe majeur de collecte de lumière, a été réalisée par une baisse de l’intensité lumineuse et a montré une faible augmentation (moins de 2 fois) du transcrit à court et long terme. Des expériences de simple-hybride et de retard sur gel (EMSA) ont été faits pour identifier de potentielles interactions protéine-ADN dans la région intergénique du gène PCP organisé en tandem. Ces essais ont été infructueux pour identifier de telles protéines. Une analyse du transcriptome de L. polyedrum a été effectuée, montrant une importante sous-représentation de domaines de liaison à l’ADN classique (comme Heat-shock factor, bZIP ou Myb) et une surreprésentation du domaine d’origine bactérienne Cold shock en comparaison avec d’autres eucaryotes unicellulaires. Ce travail suggère que les mécanismes de régulation transcriptionnelle des dinoflagellés pourraient différer substantiellement de ceux des autres eucaryotes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Les membres de la famille SMC (Structural Maintenance of Chromosomes), présents dans tous les domaines de la vie, sont impliqués dans des processus allant de la cohésion des chromatides-sœurs jusqu’à la réparation de l’ADN. Chacun des membres de cette famille, composée de 6 membres (Smc1 à Smc6), s’associe avec un autre membre ainsi qu’à des sous-unités non-SMC pour former 3 complexes : cohésine, condensine et Smc5-6. L’implication du complexe Smc5-6 dans plusieurs aspects du maintien de l’intégrité génomique est bien démontrée. Néanmoins, une question fondamentale concernant ce complexe demeure encore sans réponse: comment peut-il être impliqué dans autant d’aspects de la vie d’une cellule? Encore à ce jour, il est difficile de répondre à cette question en raison du manque d’information disponible au sujet des activités biochimiques de ce complexe. C’est pourquoi l’objectif de ce travail consiste en la caractérisation biochimique du complexe Smc5-6. La biochimie de cohésine et condensine suggère diverses possibilités en ce qui a trait aux activités biochimiques du complexe Smc5-6. La première étape de mon projet fut donc d’élaborer une procédure pour la purification de Smc5 et Smc6 après surexpression en levure. Après plusieurs expériences, il apparut clair que les deux protéines possèdent une activité de liaison à l’ADN simple brin (ADNsb) ainsi qu’à l’ADN double brins (ADNdb) et que, même si les protéines peuvent se lier aux deux types d’ADN, elles possèdent une plus grande affinité pour l’ADNsb. De plus, ces expériences permirent de démontrer que l’interaction entre Smc5 ou Smc6 et l’ADNsb est très stable, alors que l’interaction avec l’ADNdb ne l’est pas. Suite à l’obtention de ces résultats, la seconde étape fut la détermination de la ou des partie(s) de Smc5 et Smc6 permettant la liaison à l’ADN. Pour répondre à cette question, une dissection moléculaire fut réalisée, suivi d’une caractérisation des différents domaines constituants Smc5 et Smc6. De cette façon, il fut possible de démontrer qu’il existe deux sites de liaison à l’ADN sur Smc5 et Smc6 ; le premier site se trouvant dans le domaine «hinge» ainsi que dans la région adjacente du domaine «coiled-coil» et le second au niveau de la tête ATPase des deux protéines. Bien que les deux domaines puissent lier l’ADNsb, il fut démontré qu’une différence majeure existe au niveau de leur affinité pour ce type d’ADN. En effet, le domaine «hinge» possède une affinité plus forte pour l’ADNsb que la tête ATPase. De plus, cette dernière est incapable de lier l’ADNdb alors que le domaine «hinge» le peut. L’identification des sites de liaison à l’ADN sur Smc5 et Smc6 permettra de créer de nouveaux mutants possédant un défaut dans la liaison à l’ADN. Ainsi, l’étude du complexe Smc5-6 durant la réparation de l’ADN in vivo sera facilité.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

CopY of Enterococcus hirae is a well characterized copper-responsive repressor involved in copper homeostasis. In the absence of copper, it binds to the promoter. In high copper, the CopZ copper chaperone donates copper to CopY, thereby releasing it from the promoter and allowing transcription of the downstream copper homeostatic genes of the cop operon. We here show that the CopY-like repressors from E. hirae, Lactococcus lactis, and Streptococcus mutans have similar affinities not only for their native promoters, but also for heterologous cop promoters. CopZ of L. lactis accelerated the release of CopY from the promoter, suggesting that CopZ of L. lactis acts as copper chaperone, similar to CopZ in E. hirae. The consensus binding motif of the CopY-like repressors was shown to be TACAxxTGTA. The same binding motif is present in promoters controlled by BlaI of Bacillus licheniformis, MecI of Staphylococcus aureus and related repressors. BlaI and MecI have known structures and belong to the family of 'winged helix' proteins. In the N- terminal domain, they share significant sequence similarity with CopY of E. hirae. Moreover, they bind to the same TACAxxTGTA motif. NMR analysis of the N-terminal DNA binding domain of CopY of L. lactis showed that it contained the same alpha-helical content like the same regions of BlaI and MecI. These findings suggest that the DNA binding domains of CopY-like repressors are also of the 'winged helix' type.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

USF, Upstream Stimulatory Factor, is a family of ubiquitous transcription factors that contain highly conserved basic helix-loop-helix leucine zipper DNA binding domains and recognize the core DNA sequence CACGTG. In human and mouse, two members of the USF family, USF1 and USF2, encoded by two different genes, contribute to the USF activity. In order to gain insights into the mechanisms by which USFs function as transcriptional activators, different approaches were used to map the domains of USF2 responsible for nuclear localization and transcriptional activation. Two stretches of amino acids, one in the basic region of the DNA binding domain, the other in a highly conserved N-terminal region, were found to direct nuclear localization independently of one another. Two distinct activation domains were also identified. The first one, located in the conserved N-terminal region that overlaps the C-terminal nuclear localization signal, functioned only in the presence of an initiator element in the promoter of the reporter. The second, in a nonconserved region, activated transcription in the absence of an initiator element or when fused to a heterologous DNA binding domain. These results suggest that USF2 functions in different promoter contexts by selectively utilizing different activation domains.^ The deletion analysis of USF2 also identified two dominant negative mutants of USF, one lacking the activation domain, the other lacking the basic domain. The latter proved useful for testing the direct involvement of USFs in the transcriptional activation mediated by the viral protein IE62.^ To investigate the biological function of USFs, foci and colony formation assays were used to study the growth regulation by USFs. It was found that USFs had a strong antagonistic effect on cellular transformation mediated by the bHLH/LZ protein Myc. This effect required the DNA binding activity of either USF 1 or USF2. Moreover, USF2, but not USF1 or other mutants of USFs, was also found to have strong inhibitory effect on the cellular transformation by E1a and on the growth of HeLa cells. These results demonstrate that USFs could potentially regulate growth through two mechanisms, one by antagonizing the function of Myc in cellular transformation, the other by mediating a more general growth inhibitory effect. ^