952 resultados para PROTEIN STRUCTURE
Resumo:
The increasing number of available protein structures requires efficient tools for multiple structure comparison. Indeed, multiple structural alignments are essential for the analysis of function, evolution and architecture of protein structures. For this purpose, we proposed a new web server called multiple Protein Block Alignment (mulPBA). This server implements a method based on a structural alphabet to describe the backbone conformation of a protein chain in terms of dihedral angles. This sequence-like' representation enables the use of powerful sequence alignment methods for primary structure comparison, followed by an iterative refinement of the structural superposition. This approach yields alignments superior to most of the rigid-body alignment methods and highly comparable with the flexible structure comparison approaches. We implement this method in a web server designed to do multiple structure superimpositions from a set of structures given by the user. Outputs are given as both sequence alignment and superposed 3D structures visualized directly by static images generated by PyMol or through a Jmol applet allowing dynamic interaction. Multiple global quality measures are given. Relatedness between structures is indicated by a distance dendogram. Superimposed structures in PDB format can be also downloaded, and the results are quickly obtained. mulPBA server can be accessed at www.dsimb.inserm.fr/dsimb_tools/mulpba/.
Resumo:
An online computing server, Online_DPI (where DPI denotes the diffraction precision index), has been created to calculate the `Cruickshank DPI' value for a given three-dimensional protein or macromolecular structure. It also estimates the atomic coordinate error for all the atoms available in the structure. It is an easy-to-use web server that enables users to visualize the computed values dynamically on the client machine. Users can provide the Protein Data Bank (PDB) identification code or upload the three-dimensional atomic coordinates from the client machine. The computed DPI value for the structure and the atomic coordinate errors for all the atoms are included in the revised PDB file. Further, users can graphically view the atomic coordinate error along with `temperature factors' (i.e. atomic displacement parameters). In addition, the computing engine is interfaced with an up-to-date local copy of the Protein Data Bank. New entries are updated every week, and thus users can access all the structures available in the Protein Data Bank. The computing engine is freely accessible online at http://cluster.physics.iisc.ernet.in/dpi/.
Resumo:
Network theory has become an excellent method of choice through which biological data are smoothly integrated to gain insights into complex biological problems. Understanding protein structure, folding, and function has been an important problem, which is being extensively investigated by the network approach. Since the sequence uniquely determines the structure, this review focuses on the networks of non-covalently connected amino acid side chains in proteins. Questions in structural biology are addressed within the framework of such a formalism. While general applications are mentioned in this review, challenging problems which have demanded the attention of scientific community for a long time, such as allostery and protein folding, are considered in greater detail. Our aim has been to explore these important problems through the eyes of networks. Various methods of constructing protein structure networks (PSN) are consolidated. They include the methods based on geometry, edges weighted by different schemes, and also bipartite network of protein-nucleic acid complexes. A number of network metrics that elegantly capture the general features as well as specific features related to phenomena, such as allostery and protein model validation, are described. Additionally, an integration of network theory with ensembles of equilibrium structures of a single protein or that of a large number of structures from the data bank has been presented to perceive complex phenomena from network perspective. Finally, we discuss briefly the capabilities, limitations, and the scope for further explorations of protein structure networks.
Resumo:
Protein structure prediction has remained a major challenge in structural biology for more than half a century. Accelerated and cost efficient sequencing technologies have allowed researchers to sequence new organisms and discover new protein sequences. Novel protein structure prediction technologies will allow researchers to study the structure of proteins and to determine their roles in the underlying biology processes and develop novel therapeutics.
Difficulty of the problem stems from two folds: (a) describing the energy landscape that corresponds to the protein structure, commonly referred to as force field problem; and (b) sampling of the energy landscape, trying to find the lowest energy configuration that is hypothesized to be the native state of the structure in solution. The two problems are interweaved and they have to be solved simultaneously. This thesis is composed of three major contributions. In the first chapter we describe a novel high-resolution protein structure refinement algorithm called GRID. In the second chapter we present REMCGRID, an algorithm for generation of low energy decoy sets. In the third chapter, we present a machine learning approach to ranking decoys by incorporating coarse-grain features of protein structures.
Resumo:
A new metalloproteinase-disintegrin, named Jerdonitin, was purified from Trimeresurus jerdonii venom with a molecular weight of 36 kDa on SDS-PAGE. It dose-dependently inhibited ADP-induced human platelet aggregation with IC50 of 120 nM. cDNA cloning and sequencing revealed that Jerdonitin belonged to the class II of snake venom metalloproteinases (SVMPs) (P-II class). Different from other P-II class SVMPs, metalloproteinase and disintegrin domains of its natural protein were not separated, confirmed by internal peptide sequencing. Compared to other P-II class SVMPs, Jerdonitin has two additional cysteines (Cys219 and Cys238) located in the spacer domain and disintegrin domain, respectively. They probably form a disulfide bond and therefore the metalloproteinase and disintegrin domains cannot be separated by posttranslationally processing. In summary, comparison of the amino acid sequences of Jerdonitin with those of other P-II class SVMPs by sequence alignment and phylogenetic analysis, in conjunction with natural protein structure data, suggested that it was a new type of P-II class SVMPs. (C) 2003 Elsevier Inc. All rights reserved.
Resumo:
We propose a new characterization of protein structure based on the natural tetrahedral geometry of the β carbon and a new geometric measure of structural similarity, called visible volume. In our model, the side-chains are replaced by an ideal tetrahedron, the orientation of which is fixed with respect to the backbone and corresponds to the preferred rotamer directions. Visible volume is a measure of the non-occluded empty space surrounding each residue position after the side-chains have been removed. It is a robust, parameter-free, locally-computed quantity that accounts for many of the spatial constraints that are of relevance to the corresponding position in the native structure. When computing visible volume, we ignore the nature of both the residue observed at each site and the ones surrounding it. We focus instead on the space that, together, these residues could occupy. By doing so, we are able to quantify a new kind of invariance beyond the apparent variations in protein families, namely, the conservation of the physical space available at structurally equivalent positions for side-chain packing. Corresponding positions in native structures are likely to be of interest in protein structure prediction, protein design, and homology modeling. Visible volume is related to the degree of exposure of a residue position and to the actual rotamers in native proteins. In this article, we discuss the properties of this new measure, namely, its robustness with respect to both crystallographic uncertainties and naturally occurring variations in atomic coordinates, and the remarkable fact that it is essentially independent of the choice of the parameters used in calculating it. We also show how visible volume can be used to align protein structures, to identify structurally equivalent positions that are conserved in a family of proteins, and to single out positions in a protein that are likely to be of biological interest. These properties qualify visible volume as a powerful tool in a variety of applications, from the detailed analysis of protein structure to homology modeling, protein structural alignment, and the definition of better scoring functions for threading purposes.
Resumo:
MOTIVATION: The accurate prediction of the quality of 3D models is a key component of successful protein tertiary structure prediction methods. Currently, clustering or consensus based Model Quality Assessment Programs (MQAPs) are the most accurate methods for predicting 3D model quality; however they are often CPU intensive as they carry out multiple structural alignments in order to compare numerous models. In this study, we describe ModFOLDclustQ - a novel MQAP that compares 3D models of proteins without the need for CPU intensive structural alignments by utilising the Q measure for model comparisons. The ModFOLDclustQ method is benchmarked against the top established methods in terms of both accuracy and speed. In addition, the ModFOLDclustQ scores are combined with those from our older ModFOLDclust method to form a new method, ModFOLDclust2, that aims to provide increased prediction accuracy with negligible computational overhead. RESULTS: The ModFOLDclustQ method is competitive with leading clustering based MQAPs for the prediction of global model quality, yet it is up to 150 times faster than the previous version of the ModFOLDclust method at comparing models of small proteins (<60 residues) and over 5 times faster at comparing models of large proteins (>800 residues). Furthermore, a significant improvement in accuracy can be gained over the previous clustering based MQAPs by combining the scores from ModFOLDclustQ and ModFOLDclust to form the new ModFOLDclust2 method, with little impact on the overall time taken for each prediction. AVAILABILITY: The ModFOLDclustQ and ModFOLDclust2 methods are available to download from: http://www.reading.ac.uk/bioinf/downloads/ CONTACT: l.j.mcguffin@reading.ac.uk.
Resumo:
A number of state-of-the-art protein structure prediction servers have been developed by researchers working in the Bioinformatics Unit at University College London. The popular PSIPRED server allows users to perform secondary structure prediction, transmembrane topology prediction and protein fold recognition. More recent servers include DISOPRED for the prediction of protein dynamic disorder and DomPred for domain boundary prediction.
Resumo:
The PSIPRED protein structure prediction server allows users to submit a protein sequence, perform a prediction of their choice and receive the results of the prediction both textually via e-mail and graphically via the web. The user may select one of three prediction methods to apply to their sequence: PSIPRED, a highly accurate secondary structure prediction method; MEMSAT 2, a new version of a widely used transmembrane topology prediction method; or GenTHREADER, a sequence profile based fold recognition method.
Resumo:
The PilZ protein was originally identified as necessary for type IV pilus (T4P) biogenesis. Since then, a large and diverse family of bacterial PilZ homology domains have been identified, some of which have been implicated in signaling pathways that control important processes, including motility, virulence and biofilm formation. Furthermore, many PilZ homology domains, though not PilZ itself, have been shown to bind the important bacterial second messenger bis(3`-> 5`)cyclic diGMP (c-diGMP). The crystal structures of the PilZ orthologs from Xanthomonas axonopodis pv Citri (PilZ(XAC1133), this work) and from Xanthomonas campestris pv campestris (XC1028) present significant structural differences to other PilZ homologs that explain its failure to bind c-diGMP. NMR analysis of PilZ(XAC1133) shows that these structural differences are maintained in solution. In spite of their emerging importance in bacterial signaling, the means by which NZ proteins regulate specific processes is not clear. In this study, we show that PilZ(XAC1133) binds to PilB, an ATPase required for TV polymerization, and to the EAL domain of FiMX(XAC2398), which regulates TV biogenesis and localization in other bacterial species. These interactions were confirmed in NMR, two-hybrid and far-Western blot assays and are the first interactions observed between any PilZ domain and a target protein. While we were unable to detect phosphodiesterase activity for FimXX(AC2398) in vitro, we show that it binds c-diGMP both in the presence and in the absence of PilZ(XAC1133). Site-directed mutagenesis studies for conserved and exposed residues suggest that PilZ(XAC1133) interactions with FimX(XAC2398) and PilB(XAC3239) are mediated through a hydrophobic surface and an unstructured C-terminal extension conserved only in PilZ orthologs. The FimX-PilZ-PilB interactions involve a full set of ""degenerate"" GGDEF, EAL and PilZ domains and provide the first evidence of the means by which PilZ orthologs and FimX interact directly with the TP4 machinery. (C) 2009 Elsevier Ltd. All rights reserved.
Resumo:
The vast majority of known proteins have not yet been experimentally characterized and little is known about their function. The design and implementation of computational tools can provide insight into the function of proteins based on their sequence, their structure, their evolutionary history and their association with other proteins. Knowledge of the three-dimensional (3D) structure of a protein can lead to a deep understanding of its mode of action and interaction, but currently the structures of <1% of sequences have been experimentally solved. For this reason, it became urgent to develop new methods that are able to computationally extract relevant information from protein sequence and structure. The starting point of my work has been the study of the properties of contacts between protein residues, since they constrain protein folding and characterize different protein structures. Prediction of residue contacts in proteins is an interesting problem whose solution may be useful in protein folding recognition and de novo design. The prediction of these contacts requires the study of the protein inter-residue distances related to the specific type of amino acid pair that are encoded in the so-called contact map. An interesting new way of analyzing those structures came out when network studies were introduced, with pivotal papers demonstrating that protein contact networks also exhibit small-world behavior. In order to highlight constraints for the prediction of protein contact maps and for applications in the field of protein structure prediction and/or reconstruction from experimentally determined contact maps, I studied to which extent the characteristic path length and clustering coefficient of the protein contacts network are values that reveal characteristic features of protein contact maps. Provided that residue contacts are known for a protein sequence, the major features of its 3D structure could be deduced by combining this knowledge with correctly predicted motifs of secondary structure. In the second part of my work I focused on a particular protein structural motif, the coiled-coil, known to mediate a variety of fundamental biological interactions. Coiled-coils are found in a variety of structural forms and in a wide range of proteins including, for example, small units such as leucine zippers that drive the dimerization of many transcription factors or more complex structures such as the family of viral proteins responsible for virus-host membrane fusion. The coiled-coil structural motif is estimated to account for 5-10% of the protein sequences in the various genomes. Given their biological importance, in my work I introduced a Hidden Markov Model (HMM) that exploits the evolutionary information derived from multiple sequence alignments, to predict coiled-coil regions and to discriminate coiled-coil sequences. The results indicate that the new HMM outperforms all the existing programs and can be adopted for the coiled-coil prediction and for large-scale genome annotation. Genome annotation is a key issue in modern computational biology, being the starting point towards the understanding of the complex processes involved in biological networks. The rapid growth in the number of protein sequences and structures available poses new fundamental problems that still deserve an interpretation. Nevertheless, these data are at the basis of the design of new strategies for tackling problems such as the prediction of protein structure and function. Experimental determination of the functions of all these proteins would be a hugely time-consuming and costly task and, in most instances, has not been carried out. As an example, currently, approximately only 20% of annotated proteins in the Homo sapiens genome have been experimentally characterized. A commonly adopted procedure for annotating protein sequences relies on the "inheritance through homology" based on the notion that similar sequences share similar functions and structures. This procedure consists in the assignment of sequences to a specific group of functionally related sequences which had been grouped through clustering techniques. The clustering procedure is based on suitable similarity rules, since predicting protein structure and function from sequence largely depends on the value of sequence identity. However, additional levels of complexity are due to multi-domain proteins, to proteins that share common domains but that do not necessarily share the same function, to the finding that different combinations of shared domains can lead to different biological roles. In the last part of this study I developed and validate a system that contributes to sequence annotation by taking advantage of a validated transfer through inheritance procedure of the molecular functions and of the structural templates. After a cross-genome comparison with the BLAST program, clusters were built on the basis of two stringent constraints on sequence identity and coverage of the alignment. The adopted measure explicity answers to the problem of multi-domain proteins annotation and allows a fine grain division of the whole set of proteomes used, that ensures cluster homogeneity in terms of sequence length. A high level of coverage of structure templates on the length of protein sequences within clusters ensures that multi-domain proteins when present can be templates for sequences of similar length. This annotation procedure includes the possibility of reliably transferring statistically validated functions and structures to sequences considering information available in the present data bases of molecular functions and structures.
Resumo:
Structure and folding of membrane proteins are important issues in molecular and cell biology. In this work new approaches are developed to characterize the structure of folded, unfolded and partially folded membrane proteins. These approaches combine site-directed spin labeling and pulse EPR techniques. The major plant light harvesting complex LHCIIb was used as a model system. Measurements of longitudinal and transversal relaxation times of electron spins and of hyperfine couplings to neighboring nuclei by electron spin echo envelope modulation(ESEEM) provide complementary information about the local environment of a single spin label. By double electron electron resonance (DEER) distances in the nanometer range between two spin labels can be determined. The results are analyzed in terms of relative water accessibilities of different sites in LHCIIb and its geometry. They reveal conformational changes as a function of micelle composition. This arsenal of methods is used to study protein folding during the LHCIIb self assembly and a spatially and temporally resolved folding model is proposed. The approaches developed here are potentially applicable for studying structure and folding of any protein or other self-assembling structure if site-directed spin labeling is feasible and the time scale of folding is accessible to freeze-quench techniques.
Resumo:
We have quantitated the degree of structural preservation in cryo-sections of a vitrified biological specimen. Previous studies have used sections of periodic specimens to assess the resolution present, but preservation before sectioning was not assessed and so the damage due particularly to cutting was not clear. In this study large single crystals of lysozyme were vitrified and from these X-ray diffraction patterns extending to better than 2.1A were obtained. The crystals were high pressure frozen in 30% dextran, and cryo-sectioned using a diamond knife. In the best case, preservation to a resolution of 7.9A was shown by electron diffraction, the first observation of sub-nanometre structural preservation in a vitreous section.
Resumo:
We present evidence that the size of an active site side chain may modulate the degree of hydrogen tunneling in an enzyme-catalyzed reaction. Primary and secondary kH/kT and kD/kT kinetic isotope effects have been measured for the oxidation of benzyl alcohol catalyzed by horse liver alcohol dehydrogenase at 25°C. As reported in earlier studies, the relationship between secondary kH/kT and kD/kT isotope effects provides a sensitive probe for deviations from classical behavior. In the present work, catalytic efficiency and the extent of hydrogen tunneling have been correlated for the alcohol dehydrogenase-catalyzed hydride transfer among a group of site-directed mutants at position 203. Val-203 interacts with the opposite face of the cofactor NAD+ from the alcohol substrate. The reduction in size of this residue is correlated with diminished tunneling and a two orders of magnitude decrease in catalytic efficiency. Comparison of the x-ray crystal structures of a ternary complex of a high-tunneling (Phe-93 → Trp) and a low-tunneling (Val-203 → Ala) mutant provides a structural basis for the observed effects, demonstrating an increase in the hydrogen transfer distance for the low-tunneling mutant. The Val-203 → Ala ternary complex crystal structure also shows a hyperclosed interdomain geometry relative to the wild-type and the Phe-93 → Trp mutant ternary complex structures. This demonstrates a flexibility in interdomain movement that could potentially narrow the distance between the donor and acceptor carbons in the native enzyme and may enhance the role of tunneling in the hydride transfer reaction.