907 resultados para secondary structure detection
Resumo:
The king cobra neuotoxin CM-11 is a small protein with 72 amino acid residues. After its complete assignments of H-1-NMR resonance's were obtained using various 2D-NMR technologies, including of DQF-COSY, clean-TOCSY AND NOESY, the secondary structure was analysed by studying the various NOEs extracted from the NOESY spectra and the distribution of chemical shifts. The secondary structure was finally determined by MCD as follows: a triple-strand antiparallel beta sheet with I20-W36, R37-A43 and V53--S59 as its beta strands, a short alpha helix formed by W30-G35 and four turns formed by P7-K10, C14-G17, K50-V53 and D61-N64.
Resumo:
The hybridization kinetics for a series of designed 25mer probe�target pairs having varying degrees of secondary structure have been measured by UV absorbance and surface plasmon resonance (SPR) spectroscopy in solution and on the surface, respectively. Kinetic rate constants derived from the resultant data decrease with increasing probe and target secondary structure similarly in both solution and surface environments. Specifically, addition of three intramolecular base pairs in the probe and target structure slow hybridization by a factor of two. For individual strands containing four or more intramolecular base pairs, hybridization cannot be described by a traditional two-state model in solution-phase nor on the surface. Surface hybridization rates are also 20- to 40-fold slower than solution-phase rates for identical sequences and conditions. These quantitative findings may have implications for the design of better biosensors, particularly those using probes with deliberate secondary structure.
Resumo:
The classification of protein structures is an important and still outstanding problem. The purpose of this paper is threefold. First, we utilize a relation between the Tutte and homfly polynomial to show that the Alexander-Conway polynomial can be algorithmically computed for a given planar graph. Second, as special cases of planar graphs, we use polymer graphs of protein structures. More precisely, we use three building blocks of the three-dimensional protein structure-alpha-helix, antiparallel beta-sheet, and parallel beta-sheet-and calculate, for their corresponding polymer graphs, the Tutte polynomials analytically by providing recurrence equations for all three secondary structure elements. Third, we present numerical results comparing the results from our analytical calculations with the numerical results of our algorithm-not only to test consistency, but also to demonstrate that all assigned polynomials are unique labels of the secondary structure elements. This paves the way for an automatic classification of protein structures.
Resumo:
A very efficient learning algorithm for model subset selection is introduced based on a new composite cost function that simultaneously optimizes the model approximation ability and model adequacy. The derived model parameters are estimated via forward orthogonal least squares, but the subset selection cost function includes an A-optimality design criterion to minimize the variance of the parameter estimates that ensures the adequacy and parsimony of the final model. An illustrative example is included to demonstrate the effectiveness of the new approach.
Resumo:
Motivation: A new method that uses support vector machines (SVMs) to predict protein secondary structure is described and evaluated. The study is designed to develop a reliable prediction method using an alternative technique and to investigate the applicability of SVMs to this type of bioinformatics problem. Methods: Binary SVMs are trained to discriminate between two structural classes. The binary classifiers are combined in several ways to predict multi-class secondary structure. Results: The average three-state prediction accuracy per protein (Q3) is estimated by cross-validation to be 77.07 ± 0.26% with a segment overlap (Sov) score of 73.32 ± 0.39%. The SVM performs similarly to the 'state-of-the-art' PSIPRED prediction method on a non-homologous test set of 121 proteins despite being trained on substantially fewer examples. A simple consensus of the SVM, PSIPRED and PROFsec achieves significantly higher prediction accuracy than the individual methods. Availability: The SVM classifier is available from the authors. Work is in progress to make the method available on-line and to integrate the SVM predictions into the PSIPRED server.
Resumo:
If secondary structure predictions are to be incorporated into fold recognition methods, an assessment of the effect of specific types of errors in predicted secondary structures on the sensitivity of fold recognition should be carried out. Here, we present a systematic comparison of different secondary structure prediction methods by measuring frequencies of specific types of error. We carry out an evaluation of the effect of specific types of error on secondary structure element alignment (SSEA), a baseline fold recognition method. The results of this evaluation indicate that missing out whole helix or strand elements, or predicting the wrong type of element, is more detrimental than predicting the wrong lengths of elements or overpredicting helix or strand. We also suggest that SSEA scoring is an effective method for assessing accuracy of secondary structure prediction and perhaps may also provide a more appropriate assessment of the “usefulness” and quality of predicted secondary structure, if secondary structure alignments are to be used in fold recognition.
Resumo:
The elucidation of the domain content of a given protein sequence in the absence of determined structure or significant sequence homology to known domains is an important problem in structural biology. Here we address how successfully the delineation of continuous domains can be accomplished in the absence of sequence homology using simple baseline methods, an existing prediction algorithm (Domain Guess by Size), and a newly developed method (DomSSEA). The study was undertaken with a view to measuring the usefulness of these prediction methods in terms of their application to fully automatic domain assignment. Thus, the sensitivity of each domain assignment method was measured by calculating the number of correctly assigned top scoring predictions. We have implemented a new continuous domain identification method using the alignment of predicted secondary structures of target sequences against observed secondary structures of chains with known domain boundaries as assigned by Class Architecture Topology Homology (CATH). Taking top predictions only, the success rate of the method in correctly assigning domain number to the representative chain set is 73.3%. The top prediction for domain number and location of domain boundaries was correct for 24% of the multidomain set (±20 residues). These results have been put into context in relation to the results obtained from the other prediction methods assessed
Resumo:
The complete sequences of the dsrA and dsrB genes coding for the α− and β−subunits, respectively, of the sulphite reductase enzyme in Desulfovibrio desulfuricans were determined. Analyses of the amino acid sequences indicated a number of serohaem/Fe4S4 binding consensus sequences whilst predictive secondary structure analysis revealed a similar pattern of α−helix and β−strand structures between the two subunits which was indicative of gene duplication.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
The proline-rich N-terminal domain of gamma-zein has been reported in relevant process, which include its ability to cross the cell membranes. Evidences indicate that synthetic hexapeptide (PPPVHL), naturally found in N-terminal portion of gamma-zein, can adopt the polyproline II (PPII) conformation in aqueous solution. The secondary structure of gamma-zein in maize protein bodies had been analyzed by solid state Fourier transform infrared and nuclear magnetic resonance spectroscopies. However, it was not possible to measure PPII content in physiological environment since the beta-sheet and PPII signals overlap in both solid state techniques. Here, the secondary structure of gamma-zein has been analyzed by circular dichroism in SDS aqueous solution with and without ditiothreitol (DTT), and in 60% of 2-propanol and water with DTT The results show that gamma-zein has high helical content in all solutions. The PPII conformation was present at about 7% only in water/DTT solution. (c) 2007 Wiley Periodicals, Inc.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Identification and classification of overlapping nodes in networks are important topics in data mining. In this paper, a network-based (graph-based) semi-supervised learning method is proposed. It is based on competition and cooperation among walking particles in a network to uncover overlapping nodes by generating continuous-valued outputs (soft labels), corresponding to the levels of membership from the nodes to each of the communities. Moreover, the proposed method can be applied to detect overlapping data items in a data set of general form, such as a vector-based data set, once it is transformed to a network. Usually, label propagation involves risks of error amplification. In order to avoid this problem, the proposed method offers a mechanism to identify outliers among the labeled data items, and consequently prevents error propagation from such outliers. Computer simulations carried out for synthetic and real-world data sets provide a numeric quantification of the performance of the method. © 2012 Springer-Verlag.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
A scheme is presented in which an organic solvent environment in combination with surfactants is used to confine a natively unfolded protein inside an inverse microemulsion droplet. This type of confinement allows a study that provides unique insight into the dynamic structure of an unfolded, flexible protein which is still solvated and thus under near-physiological conditions. In a model system, the protein osteopontin (OPN) is used. It is a highly phosphorylated glycoprotein that is expressed in a wide range of cells and tissues for which limited structural analysis exists due to the high degree of flexibility and large number of post-translational modifications. OPN is implicated in tissue functions, such as inflammation and mineralisation. It also has a key function in tumour metastasis and progression. Circular dichroism measurements show that confinement enhances the secondary structural features of the protein. Small-angle X-ray scattering and dynamic light scattering show that OPN changes from being a flexible protein in aqueous solution to adopting a less flexible and more compact structure inside the microemulsion droplets. This novel approach for confining proteins while they are still hydrated may aid in studying the structure of a wide range of natively unfolded proteins.