Prediction of contact maps with neural networks and correlated mutations

被引:143
作者
Fariselli, P
Olmea, O
Valencia, A
Casadio, R
机构
[1] Univ Bologna, CIRB, Bologna, Italy
[2] Univ Bologna, Dept Biol, Bologna, Italy
[3] CSIC, CNB, Prot Design Grp, E-28049 Madrid, Spain
来源
PROTEIN ENGINEERING | 2001年 / 14卷 / 11期
关键词
contact maps; correlated mutations; neural networks; protein structure predictions; residue contacts;
D O I
10.1093/protein/14.11.835
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Contact maps of proteins are predicted with neural network-based methods, using as input codings of increasing complexity including evolutionary information, sequence conservation, correlated mutations and predicted secondary structures. Neural networks are trained on a data set comprising the contact maps of 173 inion-homologous proteins as computed from their well resolved three-dimensional structures. Proteins are selected from the Protein Data Bank database provided that they align with at least 15 similar sequences in the corresponding families. The predictors are trained to learn the association rules between the covalent structure of each protein and its contact map with a standard back propagation algorithm and tested on the same protein set with a cross-validation procedure. Our results indicate that the method can assign protein contacts with an average accuracy of 0.21 and with an improvement over a random predictor of a factor >6, which is higher than that previously obtained with methods only based either on neural networks or on correlated mutations. Furthermore, filtering the network outputs with a procedure based on the residue coordination numbers, the accuracy of predictions increases up to 0.25 for all the proteins, with an 8-fold deviation from a random predictor. These scores are the highest reported so far for predicting protein contact maps.
引用
收藏
页码:835 / 843
页数:9
相关论文
共 41 条
[1]   PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS [J].
ANFINSEN, CB .
SCIENCE, 1973, 181 (4096) :223-230
[2]   A NOVEL-APPROACH TO PREDICTION OF THE 3-DIMENSIONAL STRUCTURES OF PROTEIN BACKBONES BY NEURAL NETWORKS [J].
BOHR, H ;
BOHR, J ;
BRUNAK, S ;
COTTERILL, RMJ ;
FREDHOLM, H ;
LAUTRUP, B ;
PETERSEN, SB .
FEBS LETTERS, 1990, 261 (01) :43-46
[3]   PROTEIN STRUCTURES FROM DISTANCE INEQUALITIES [J].
BOHR, J ;
BOHR, H ;
BRUNAK, S ;
COTTERILL, RMJ ;
FREDHOLM, H ;
LAUTRUP, B ;
PETERSEN, SB .
JOURNAL OF MOLECULAR BIOLOGY, 1993, 231 (03) :861-869
[4]   An entropy criterion to detect minimally frustrated intermediates in native proteins [J].
Compiani, M ;
Fariselli, P ;
Martelli, PL ;
Casadio, R .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (16) :9290-9294
[5]   PROTEIN-STRUCTURE PREDICTION - RECOGNITION OF PRIMARY, SECONDARY, AND TERTIARY STRUCTURAL FEATURES FROM AMINO-ACID-SEQUENCE [J].
EISENHABER, F ;
PERSSON, B ;
ARGOS, P .
CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (01) :1-94
[6]  
FARISELLI P, 1993, EUR BIOPHYS J BIOPHY, V22, P41, DOI 10.1007/BF00205811
[7]  
Fariselli P, 1998, WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL 1, PROCEEDINGS, P527
[8]   A neural network based predictor of residue contacts in proteins [J].
Fariselli, P ;
Casadio, R .
PROTEIN ENGINEERING, 1999, 12 (01) :15-21
[9]   CORRELATED MUTATIONS AND RESIDUE CONTACTS IN PROTEINS [J].
GOBEL, U ;
SANDER, C ;
SCHNEIDER, R ;
VALENCIA, A .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1994, 18 (04) :309-317
[10]   TOPOLOGY FINGERPRINT APPROACH TO THE INVERSE PROTEIN FOLDING PROBLEM [J].
GODZIK, A ;
KOLINSKI, A ;
SKOLNICK, J .
JOURNAL OF MOLECULAR BIOLOGY, 1992, 227 (01) :227-238