16 resultados para Protein structures
em Chinese Academy of Sciences Institutional Repositories Grid Portal
Resumo:
CLEMAPS is a tool for multiple alignment of protein structures. It distinguishes itself from other existing algorithms for multiple structure alignment by the use of conformational letters, which are discretized states of 3D segmental structural states. A letter corresponds to a cluster of combinations of three angles formed by C-alpha pseudobonds of four contiguous residues. A substitution matrix called CLESUM is available to measure the similarity between any two such letters. The input 3D structures are first converted to sequences of conformational letters. Each string of a fixed length is then taken as the center seed to search other sequences for neighbors of the seed, which are strings similar to the seed. A seed and its neighbors form a center-star, which corresponds to a fragment set of local structural similarity shared by many proteins. The detection of center-stars using CLESUM is extremely efficient. Local similarity is a necessary, but insufficient, condition for structural alignment. Once center-stars are found, the spatial consistency between any two stars are examined to find consistent star duads using atomic coordinates. Consistent duads are later joined to create a core for multiple alignment, which is further polished to produce the final alignment. The utility of CLEMAPS is tested on various protein structure ensembles.
Resumo:
This paper reports the availability of a database of protein structural domains (DDBASE), an alignment database of homologous proteins (HOMSTRAD) and a database of structurally aligned superfamilies (CAMPASS) on the World Wide Web (WWW). DDBASE contains information on the organization of structural domains and their boundaries; it includes only one representative domain from each of the homologous families. This database has been derived by identifying the presence of structural domains in proteins on the basis of inter-secondary structural distances using the program DIAL [Sowdhamini & Blundell (1995), Protein Sci. 4, 506-520]. The alignment of proteins in superfamilies has been performed on the basis of the structural features and relationships of individual residues using the program COMPARER [Sali & Blundell (1990), J. Mol. Biol. 212, 403-428]. The alignment databases contain information on the conserved structural features in homologous proteins and those belonging to superfamilies. Available data include the sequence alignments in structure-annotated formats and the provision for viewing superposed structures of proteins using a graphical interface. Such information, which is freely accessible on the WWW, should be of value to crystallographers in the comparison of newly determined protein structures with previously identified protein domains or existing families.
Resumo:
A comparative study on the structures of some mRNAs and their encoded proteins shows an intriguing correlation between the two foldings. Non-random distribution of codons in the secondary structures of mRNAs is also shown, which appears to be in accordance with the conformational properties of amino acids in protein structures to some extent. These results seem to suggest that there may be a kind of genetic relationship between mRNA and protein at three-dimensional level.
Resumo:
In our studies, 88 human mRNA samples were collected from the Integrated Sequence-Structure database and then the dynamic process in co-transcriptional mRNA folding was simulated using the RNAstructure version 4.1 program. Through statistical analyses of the frequencies of occurrence of hairpins, a group of special folding structures-the 'common hairpins'-were identified. These 'common hairpins' have lower energies and occur in all the subsequent folding units that formed in the dynamic folding process. By applying the formulas (1)-(4) of the 'common hairpins' statistical model, 163 'common hairpins' were found, to make up about 7% of the total of 2286 hairpins. Classified studies further show that the 'common hairpins' that were studied may oscillate in the dynamic folding process. However, the hairpin loops of the 'common hairpins' and stems proximal to those 'common hairpins' loops maintain topologically stable structures, while other loops and stems distal to the 'common hairpins' loops are shown to be alterable structures. Strikingly, further studies indicate that the stable structures of these 'common hairpins' may have unbeknown effects on controlling the formation of protein structures in the translation process (unpublished results). (c) 2005 Elsevier B.V. All rights reserved.
Resumo:
The identification of near native protein-protein complexes among a set of decoys remains highly challenging. A stategy for improving the success rate of near native detection is to enrich near native docking decoys in a small number of top ranked decoys. Recently, we found that a combination of three scoring functions (energy, conservation, and interface propensity) can predict the location of binding interface regions with reasonable accuracy. Here, these three scoring functions are modified and combined into a consensus scoring function called ENDES for enriching near native docking decoys. We found that all individual scores result in enrichment for the majority of 28 targets in ZDOCK2.3 decoy set and the 22 targets in Benchmark 2.0. Among the three scores, the interface propensity score yields the highest enrichment in both sets of protein complexes. When these scores are combined into the ENDES consensus score, a significant increase in enrichment of near-native structures is found. For example, when 2000 dock decoys are reduced to 200 decoys by ENDES, the fraction of near-native structures in docking decoys increases by a factor of about six in average. ENDES was implemented into a computer program that is available for download at http://sparks.informatics.iupui.edu.
Resumo:
The aminoacyl-tRNA synthetases (AARS) are very important during the protein biosynthesis, which can make the gene sequence be accurately translated into the protein sequence by the specific recognition between AARS and tRNA/amino acids. However, the recog
Resumo:
Protein tyrosine phosphatases (PTPs) are comprised of two superfamilies, the phosphatase I superfamily containing a single low-molecular-weight PTP (lmwPTP) family and the phosphatase II superfamily including both the higher-molecular-weight PTP (hmwPTP) and the dual-specificity phosphatase (DSP) families. The phosphatase I and H superfamilies are often considered to be the result of convergent evolution. The PTP sequence and structure analyses indicate that lmwPTPs, hmwPTPs, and DSPs share similar structures, functions, and a common signature motif, although they have low sequence identities and a different order of active sites in sequence or a circular permutation. The results of this work suggest that lmwPTPs and hmwPTPs/DSPs are remotely related in evolution. The earliest ancestral gene of PTPs could be from a short fragment containing about 90similar to120 nucleotides or 30similar to40 residues; however, a probable full PTP ancestral gene contained one transcript unit with two lmwPTP genes. All three PTP families may have resulted from a common ancestral gene by a series of duplications, fusions, and circular permutations. The circular permutation in PTPs is caused by a reading frame difference, which is similar to that in DNA methyltransferases. Nevertheless, the evolutionary mechanism of circular permutation in PTP genes seems to be more complicated than that in DNA methyltransferase genes. Both mechanisms in PTPs and DNA methyltransferases can be used to explain how some protein families and superfamilies came to be formed by circular permutations during molecular evolution.
Resumo:
The origin of new structures and functions is an important process in evolution. In the past decades, we have obtained some preliminary knowledge of the origin and evolution of new genes. However, as the basic unit of genes, the origin and evolution of exons remain unclear. Because young exons retain the footprints of origination, they can be good materials for studying origin and evolution of new exons. In this paper, we report two young exons in a zinc finger protein gene of rodents. Since they are unique sequences in mouse and rat genome and no homologous sequences were found in the orthologous genes of human and pig, the young exons might originate after the divergence of primates and rodents through exonization of intronic sequences. Strong positive selection was detected in the new exons between mouse and rat, suggesting that these exons have undergone significant functional divergence after the separation of the two species. On the other hand, population genetics data of mouse demonstrate that the new exons have been subject to functional constraint, indicating an important function of the new exons in mouse. Functional analyses suggest that these new exons encode a nuclear localization signal peptide, which may mediate new ways of nuclear protein transport. To our knowledge, this is the first example of the origin and evolution of young exons.
Resumo:
Two three-dimensional structure models of the 21nt oligodeoxyribonucleotides, CPI (G3TG-2TGT2G5TG2TGT) and CP3 (TGTG2TGST2GTG2TG3), were constructed by InsightII (MSI) software in IRIS Indigo2 (SGI) workstation using the crystal structure of TAT tripler formation as the template. The initial structures subsequently were minimized by molecular mechanics. The final structures were believed as the dominant conformation. The results showed that the energy of CP1 is lower than that of CP3, and the former is more stable than the latter. Moreover, the results further proved that the 21nt oligodeoxyribo-nucleotide CP1 stably combines with the core promoter (Cp) fragment of hepatitis B virus (HBV) to form a tripler DNA, and CP1 specifically inhibits a specific cellular factor (DNA binding protein) binding to Cp fragment. These results indicated that specific repression of gene transcription of HBV DNA might be possible by tripler-formation DNA.
Resumo:
Lysozyme monolayer-protected gold nanoparticles (Au NPs) which are hydrophilic and biocompatible and show excellent colloidal stability at low temperature, ca. 4 degrees C, were synthesized in aqueous medium by chemical reduction of HAuCl4 with NaBH4 in the presence of a familiar small enzyme, lysozyme. UV-vis spectra, transmission electron microscopy (TEM), atomic force microscopy, and X-ray photoelectron spectroscopy characterization of the as prepared nanoparticles revealed the formation of well-dispersed An NPs of ca. 2 nm diameter. Moreover, the color change of the An NP solution as well as UV-vis spectroscopy and TEM measurements have also demonstrated the occurrence of Ostwald ripening of the nanoparticles at low temperature. Further characterization with Fourier transform infrared spectroscopy (FTIR) and dynamic light scattering indicated the formation of a monolayer of lysozyme molecules on the particle surface. FTIR data also indicated the intactness of the protein molecules coated on An NPs. All the characterization results showed that the monodisperse An NPs are well-coated directly with lysozyme. Driven by the dipole-dipole attraction, the protein-stabilized Au NPs self-assembled into network structures and nanowires upon aging under ambient temperature.
Resumo:
The study of associations between two biomolecules is the key to understanding molecular function and recognition. Molecular function is often thought to be determined by underlying structures. Here, combining a single-molecule study of protein binding with an energy-landscape-inspired microscopic model, we found strong evidence that biomolecular recognition is determined by flexibilities in addition to structures. Our model is based on coarse-grained molecular dynamics on the residue level with the energy function biased toward the native binding structure ( the Go model). With our model, the underlying free-energy landscape of the binding can be explored. There are two distinct conformational states at the free-energy minimum, one with partial folding of CBD itself and significant interface binding of CBD to Cdc42, and the other with native folding of CBD itself and native interface binding of CBD to Cdc42. This shows that the binding process proceeds with a significant interface binding of CBD with Cdc42 first, without a complete folding of CBD itself, and that binding and folding are then coupled to reach the native binding state.
Resumo:
The structure characterization of proteins or enzymes by STM on electrochemically prepared HOPG surface studied in this laboratory is reviewed. The serial structures of Hb were observed. The differences between the denaturation and inactivation of HRP were investigated by in situ and ex situ STM. The structural variation of Hb in an organic solvent was imaged while protein denaturation was easily observed in a polar solvent.
Resumo:
Background: Serine/threonine kinases (STKs) have been found in an increasing number of prokaryotes, showing important roles in signal transduction that supplement the well known role of two-component system. Cyanobacteria are photoautotrophic prokaryotes able to grow in a wide range of ecological environments, and their signal transduction systems are important in adaptation to the environment. Sequence information from several cyanobacterial genomes offers a unique opportunity to conduct a comprehensive comparative analysis of this kinase family. In this study, we extracted information regarding Ser/Thr kinases from 21 species of sequenced cyanobacteria and investigated their diversity, conservation, domain structure, and evolution. Results: 286 putative STK homologues were identified. STKs are absent in four Prochlorococcus strains and one marine Synechococcus strain and abundant in filamentous nitrogen-fixing cyanobacteria. Motifs and invariant amino acids typical in eukaryotic STKs were conserved well in these proteins, and six more cyanobacteria- or bacteria-specific conserved residues were found. These STK proteins were classified into three major families according to their domain structures. Fourteen types and a total of 131 additional domains were identified, some of which are reported to participate in the recognition of signals or substrates. Cyanobacterial STKs show rather complicated phylogenetic relationships that correspond poorly with phylogenies based on 16S rRNA and those based on additional domains. Conclusion: The number of STK genes in different cyanobacteria is the result of the genome size, ecophysiology, and physiological properties of the organism. Similar conserved motifs and amino acids indicate that cyanobacterial STKs make use of a similar catalytic mechanism as eukaryotic STKs. Gene gain-and-loss is significant during STK evolution, along with domain shuffling and insertion. This study has established an overall framework of sequence-structure-function interactions for the STK gene family, which may facilitate further studies of the role of STKs in various organisms.
Resumo:
Background: Serine/threonine kinases (STKs) have been found in an increasing number of prokaryotes, showing important roles in signal transduction that supplement the well known role of two-component system. Cyanobacteria are photoautotrophic prokaryotes able to grow in a wide range of ecological environments, and their signal transduction systems are important in adaptation to the environment. Sequence information from several cyanobacterial genomes offers a unique opportunity to conduct a comprehensive comparative analysis of this kinase family. In this study, we extracted information regarding Ser/Thr kinases from 21 species of sequenced cyanobacteria and investigated their diversity, conservation, domain structure, and evolution. Results: 286 putative STK homologues were identified. STKs are absent in four Prochlorococcus strains and one marine Synechococcus strain and abundant in filamentous nitrogen-fixing cyanobacteria. Motifs and invariant amino acids typical in eukaryotic STKs were conserved well in these proteins, and six more cyanobacteria- or bacteria-specific conserved residues were found. These STK proteins were classified into three major families according to their domain structures. Fourteen types and a total of 131 additional domains were identified, some of which are reported to participate in the recognition of signals or substrates. Cyanobacterial STKs show rather complicated phylogenetic relationships that correspond poorly with phylogenies based on 16S rRNA and those based on additional domains. Conclusion: The number of STK genes in different cyanobacteria is the result of the genome size, ecophysiology, and physiological properties of the organism. Similar conserved motifs and amino acids indicate that cyanobacterial STKs make use of a similar catalytic mechanism as eukaryotic STKs. Gene gain-and-loss is significant during STK evolution, along with domain shuffling and insertion. This study has established an overall framework of sequence-structure-function interactions for the STK gene family, which may facilitate further studies of the role of STKs in various organisms.
Resumo:
How to refine a near-native structure to make it closer to its native conformation is an unsolved problem in protein-structure and protein-protein complex-structure prediction. In this article, we first test several scoring functions for selecting locally resampled near-native protein-protein docking conformations and then propose a computationally efficient protocol for structure refinement via local resampling and energy minimization. The proposed method employs a statistical energy function based on a Distance-scaled Ideal-gas REference state (DFIRE) as an initial filter and an empirical energy function EMPIRE (EMpirical Protein-InteRaction Energy) for optimization and re-ranking. Significant improvement of final top-1 ranked structures over initial near-native structures is observed in the ZDOCK 2.3 decoy set for Benchmark 1.0 (74% whose global rmsd reduced by 0.5 angstrom or more and only 7% increased by 0.5 angstrom or more). Less significant improvement is observed for Benchmark 2.0 (38% versus 33%). Possible reasons are discussed.