18 resultados para Fold
Resumo:
New experiments underpin the interpretation of the basic division in crystallization behaviour of polyethylene in terms of whether or not there is time for the fold surface to order before the next molecular layer is added at the growth front. For typical growth rates, in Regime 11, polyethylene lamellae form with disordered {001} fold surfaces then transform, with lamellar thickening and twisting, towards the more-ordered condition found for slower crystallization in Regime 1, in which lamellae form with and retain {201} fold surfaces. Several linear and linear-low-density polyethylenes have been used to show that, for the same polymer crystallized alone or in a blend, the growth rate at which the change in initial lamellar condition occurs is reasonably constant thereby supporting the concept of a specific time for surfaces to attain the ordered {201}) state. This specific time, in the range from milliseconds to seconds, increases with molecular length, and in linear-low-density polymer, for higher branch contents. (c) 2006 Elsevier Ltd. All rights reserved.
Resumo:
The nuclear magnetic resonance (NMR) structure of a central segment of the previously annotated severe acute respiratory syndrome (SARS)-unique domain (SUD-M, for "middle of the SARS-unique domain") in SARS coronavirus (SARS-CoV) nonstructural protein 3 (nsp3) has been determined. SUD-M(513-651) exhibits a macrodomain fold containing the nsp3 residues 528 to 648, and there is a flexibly extended N-terminal tail with the residues 513 to 527 and a C-terminal flexible tail of residues 649 to 651. As a follow-up to this initial result, we also solved the structure of a construct representing only the globular domain of residues 527 to 651 [SUD-M(527-651)]. NMR chemical shift perturbation experiments showed that SUD-M(527-651) binds single-stranded poly(A) and identified the contact area with this RNA on the protein surface, and electrophoretic mobility shift assays then confirmed that SUD-M has higher affinity for purine bases than for pyrimidine bases. In a further search for clues to the function, we found that SUD-M(527-651) has the closest three-dimensional structure homology with another domain of nsp3, the ADP-ribose-1 ''-phosphatase nsp3b, although the two proteins share only 5% sequence identity in the homologous sequence regions. SUD-M(527-651) also shows three-dimensional structure homology with several helicases and nucleoside triphosphate-binding proteins, but it does not contain the motifs of catalytic residues found in these structural homologues. The combined results from NMR screening of potential substrates and the structure-based homology studies now form a basis for more focused investigations on the role of the SARS-unique domain in viral infection.
Resumo:
The NMR structure of a central segment of the previously annotated "SARS-unique domain" (SUD-M; "middle of the SARS-unique domain") in the SARS coronavirus (SARS-CoV) non-structural protein 3 (nsp3) has been determined. SUD-M(513-651) exhibits a macrodomain fold containing the nsp3-residues 528-648, and there is a flexibly extended N-terminal tail with the residues 513-527 and a C-terminal flexible tail of residues 649-651. As a follow-up to this initial result, we also solved the structure of a construct representing only the globular domain of residues 527-651 [SUD-M(527-651)]. NMR chemical shift perturbation experiments showed that SUD-M(527-651) binds single-stranded poly-A and identified the contact area with this RNA on the protein surface, and electrophoretic mobility shift assays then confirmed that SUD-M has higher affinity for purine bases than for pyrimidine bases. In further search for clues to the function, we found that SUD-M(527-651) has the closest three-dimensional structure homology with another domain of nsp3, the ADP-ribose-1''-phosphatase nsp3b, although the two proteins share only 5% sequence identity in the homologous sequence regions. SUD-M(527-651) also shows 3D structure homology with several helicases and NTP-binding proteins, but it does not contain the motifs of catalytic residues found in these structural homologues. The combined results from NMR screening of potential substrates and the structure-based homology studies now form a basis for more focused investigations on the role of the SARS-unique domain in viral infection.
Resumo:
Motivation: Intrinsic protein disorder is functionally implicated in numerous biological roles and is, therefore, ubiquitous in proteins from all three kingdoms of life. Determining the disordered regions in proteins presents a challenge for experimental methods and so recently there has been much focus on the development of improved predictive methods. In this article, a novel technique for disorder prediction, called DISOclust, is described, which is based on the analysis of multiple protein fold recognition models. The DISOclust method is rigorously benchmarked against the top.ve methods from the CASP7 experiment. In addition, the optimal consensus of the tested methods is determined and the added value from each method is quantified. Results: The DISOclust method is shown to add the most value to a simple consensus of methods, even in the absence of target sequence homology to known structures. A simple consensus of methods that includes DISOclust can significantly outperform all of the previous individual methods tested.
Resumo:
Background: Selecting the highest quality 3D model of a protein structure from a number of alternatives remains an important challenge in the field of structural bioinformatics. Many Model Quality Assessment Programs (MQAPs) have been developed which adopt various strategies in order to tackle this problem, ranging from the so called "true" MQAPs capable of producing a single energy score based on a single model, to methods which rely on structural comparisons of multiple models or additional information from meta-servers. However, it is clear that no current method can separate the highest accuracy models from the lowest consistently. In this paper, a number of the top performing MQAP methods are benchmarked in the context of the potential value that they add to protein fold recognition. Two novel methods are also described: ModSSEA, which based on the alignment of predicted secondary structure elements and ModFOLD which combines several true MQAP methods using an artificial neural network. Results: The ModSSEA method is found to be an effective model quality assessment program for ranking multiple models from many servers, however further accuracy can be gained by using the consensus approach of ModFOLD. The ModFOLD method is shown to significantly outperform the true MQAPs tested and is competitive with methods which make use of clustering or additional information from multiple servers. Several of the true MQAPs are also shown to add value to most individual fold recognition servers by improving model selection, when applied as a post filter in order to re-rank models. Conclusion: MQAPs should be benchmarked appropriately for the practical context in which they are intended to be used. Clustering based methods are the top performing MQAPs where many models are available from many servers; however, they often do not add value to individual fold recognition servers when limited models are available. Conversely, the true MQAP methods tested can often be used as effective post filters for re-ranking few models from individual fold recognition servers and further improvements can be achieved using a consensus of these methods.
Resumo:
BACKGROUND: In order to maintain the most comprehensive structural annotation databases we must carry out regular updates for each proteome using the latest profile-profile fold recognition methods. The ability to carry out these updates on demand is necessary to keep pace with the regular updates of sequence and structure databases. Providing the highest quality structural models requires the most intensive profile-profile fold recognition methods running with the very latest available sequence databases and fold libraries. However, running these methods on such a regular basis for every sequenced proteome requires large amounts of processing power.In this paper we describe and benchmark the JYDE (Job Yield Distribution Environment) system, which is a meta-scheduler designed to work above cluster schedulers, such as Sun Grid Engine (SGE) or Condor. We demonstrate the ability of JYDE to distribute the load of genomic-scale fold recognition across multiple independent Grid domains. We use the most recent profile-profile version of our mGenTHREADER software in order to annotate the latest version of the Human proteome against the latest sequence and structure databases in as short a time as possible. RESULTS: We show that our JYDE system is able to scale to large numbers of intensive fold recognition jobs running across several independent computer clusters. Using our JYDE system we have been able to annotate 99.9% of the protein sequences within the Human proteome in less than 24 hours, by harnessing over 500 CPUs from 3 independent Grid domains. CONCLUSION: This study clearly demonstrates the feasibility of carrying out on demand high quality structural annotations for the proteomes of major eukaryotic organisms. Specifically, we have shown that it is now possible to provide complete regular updates of profile-profile based fold recognition models for entire eukaryotic proteomes, through the use of Grid middleware such as JYDE.
Resumo:
The hydrothermal reactions of Ni(NO3)(2).6H(2)O, disodium fumarate (fum) and 1,2-bis(4-pyridyl)ethane (bpe)/1,3-bis(4-pyridyl) propane (bpp) in aqueous-methanol medium yield one 3-D and one 2-D metal-organic hybrid material, [Ni(fum)(bpe)] (1) and [Ni(fum)(bpp)(H2O)] (2), respectively. Complex 1 possesses a novel unprecedented structure, the first example of an "unusual mode" of a five-fold distorted interpenetrated network with metal-ligand linkages where the four six-membered windows in each adamantane-type cage are different. The structural characterization of complex 2 evidences a buckled sheet where nickel ions are in a distorted octahedral geometry, with two carboxylic groups, one acting as a bis-chelate, the other as a bis-monodentate ligand. The metal ion completes the coordination sphere through one water molecule and two bpp nitrogens in cis position. Variable-temperature magnetic measurements of complexes 1 and 2 reveal the existence of very weak antiferromagnetic intramolecular interactions and/or the presence of single-ion zero field splitting (D) of isolated Ni-II ions in both the compounds. Experimentally, both the J parameters are close, comparable and very small. Considering zero-field splitting of Ni-II, the calculated D values are in agreement with values reported in the literature for Ni-II ions. Complex 3, [{Co(phen)}(2)(fum)(2)] (phen=1,10-phenanthroline) is obtained by diffusing methanolic solution of 1,10-phenanthroline on an aqueous layer of disodium fumarate and Co(NO3)(2).6H(2)O. It consists of dimeric Co-II(phen) units, doubly bridged by carboxylate groups in a distorted syn-syn fashion. These fumarate anions act as bis-chelates to form corrugated sheets. The 2D layer has a (4,4) topology, with the nodes represented by the centres of the dimers. The magnetic data were fitted ignoring the very weak coupling through the fumarate pathway and using a dimer model.
Resumo:
Motivation: The ability of a simple method (MODCHECK) to determine the sequence–structure compatibility of a set of structural models generated by fold recognition is tested in a thorough benchmark analysis. Four Model Quality Assessment Programs (MQAPs) were tested on 188 targets from the latest LiveBench-9 automated structure evaluation experiment. We systematically test and evaluate whether the MQAP methods can successfully detect native-likemodels. Results: We show that compared with the other three methods tested MODCHECK is the most reliable method for consistently performing the best top model selection and for ranking the models. In addition, we show that the choice of model similarity score used to assess a model's similarity to the experimental structure can influence the overall performance of these tools. Although these MQAP methods fail to improve the model selection performance for methods that already incorporate protein three dimension (3D) structural information, an improvement is observed for methods that are purely sequence-based, including the best profile–profile methods. This suggests that even the best sequence-based fold recognition methods can still be improved by taking into account the 3D structural information.
Resumo:
A number of new and newly improved methods for predicting protein structure developed by the Jones–University College London group were used to make predictions for the CASP6 experiment. Structures were predicted with a combination of fold recognition methods (mGenTHREADER, nFOLD, and THREADER) and a substantially enhanced version of FRAGFOLD, our fragment assembly method. Attempts at automatic domain parsing were made using DomPred and DomSSEA, which are based on a secondary structure parsing algorithm and additionally for DomPred, a simple local sequence alignment scoring function. Disorder prediction was carried out using a new SVM-based version of DISOPRED. Attempts were also made at domain docking and “microdomain” folding in order to build complete chain models for some targets.
Resumo:
Motivation: In order to enhance genome annotation, the fully automatic fold recognition method GenTHREADER has been improved and benchmarked. The previous version of GenTHREADER consisted of a simple neural network which was trained to combine sequence alignment score, length information and energy potentials derived from threading into a single score representing the relationship between two proteins, as designated by CATH. The improved version incorporates PSI-BLAST searches, which have been jumpstarted with structural alignment profiles from FSSP, and now also makes use of PSIPRED predicted secondary structure and bi-directional scoring in order to calculate the final alignment score. Pairwise potentials and solvation potentials are calculated from the given sequence alignment which are then used as inputs to a multi-layer, feed-forward neural network, along with the alignment score, alignment length and sequence length. The neural network has also been expanded to accommodate the secondary structure element alignment (SSEA) score as an extra input and it is now trained to learn the FSSP Z-score as a measurement of similarity between two proteins. Results: The improvements made to GenTHREADER increase the number of remote homologues that can be detected with a low error rate, implying higher reliability of score, whilst also increasing the quality of the models produced. We find that up to five times as many true positives can be detected with low error rate per query. Total MaxSub score is doubled at low false positive rates using the improved method.
Resumo:
If secondary structure predictions are to be incorporated into fold recognition methods, an assessment of the effect of specific types of errors in predicted secondary structures on the sensitivity of fold recognition should be carried out. Here, we present a systematic comparison of different secondary structure prediction methods by measuring frequencies of specific types of error. We carry out an evaluation of the effect of specific types of error on secondary structure element alignment (SSEA), a baseline fold recognition method. The results of this evaluation indicate that missing out whole helix or strand elements, or predicting the wrong type of element, is more detrimental than predicting the wrong lengths of elements or overpredicting helix or strand. We also suggest that SSEA scoring is an effective method for assessing accuracy of secondary structure prediction and perhaps may also provide a more appropriate assessment of the “usefulness” and quality of predicted secondary structure, if secondary structure alignments are to be used in fold recognition.