Protein contact prediction using patterns of correlation


Autoria(s): Hamilton, Nicholas; Burrage, Kevin; Ragan, Mark A.; Huber, Thomas
Contribuinte(s)

E.E. Lattman

et al.

Data(s)

01/01/2004

Resumo

We describe a new method for using neural networks to predict residue contact pairs in a protein. The main inputs to the neural network are a set of 25 measures of correlated mutation between all pairs of residues in two windows of size 5 centered on the residues of interest. While the individual pair-wise correlations are a relatively weak predictor of contact, by training the network on windows of correlation the accuracy of prediction is significantly improved. The neural network is trained on a set of 100 proteins and then tested on a disjoint set of 1033 proteins of known structure. An average predictive accuracy of 21.7% is obtained taking the best L/2 predictions for each protein, where L is the sequence length. Taking the best L/10 predictions gives an average accuracy of 30.7%. The predictor is also tested on a set of 59 proteins from the CASP5 experiment. The accuracy is found to be relatively consistent across different sequence lengths, but to vary widely according to the secondary structure. Predictive accuracy is also found to improve by using multiple sequence alignments containing many sequences to calculate the correlations. (C) 2004 Wiley-Liss, Inc.

Identificador

http://espace.library.uq.edu.au/view/UQ:73625

Idioma(s)

eng

Publicador

Wiley

Palavras-Chave #Protein Structure Prediction #Predicted Contact Map #Correlated Mutation #Neural Network #Casp5 #Biochemistry & Molecular Biology #Biophysics #Residue Contacts #Secondary Structure #Mutations #Templates #C1 #239901 Biological Mathematics #780105 Biological sciences #250503 Characterisation of Macromolecules
Tipo

Journal Article