Progress in predicting inter-residue contacts of proteins with neural networks and correlated mutations

被引:71
作者
Fariselli, P
Olmea, O
Valencia, A
Casadio, R
机构
[1] Univ Bologna, Dept Biol, I-40126 Bologna, Italy
[2] Univ Bologna, CIRB, Bologna, Italy
[3] CSIC Cantoblanco, CNB, Prot Design Grp, Madrid, Spain
关键词
protein structure predictions; contact maps; correlated mutations; neural networks; residue contacts;
D O I
10.1002/prot.1173
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
This article presents recent progress in predicting inter-residue contacts of proteins with a neural network-based method. Improvement over the results obtained at the previous CASP3 competition is attained by using as input to the network a complex code, which includes evolutionary information, sequence conservation, correlated mutations, and predicted secondary structures. The predictor was trained and cross-validated on a data set comprising the contact maps of 173 non-homologous proteins as computed from their well-resolved three-dimensional structures. The method could assign protein contacts with an average accuracy of 0.21 and with an improvement over a random predictor of a factor greater than 6, which is higher than that previously obtained with methods only based either on neural networks or on correlated mutations. Although far from being ideal, these scores are the highest reported so far for predicting protein contact maps. On 29 targets automatically predicted by the server (CORNET) the average accuracy is 0.14. The predictor is poorly performing on all-a proteins, not represented in the training set. On all-beta and mixed proteins (22 targets) the average accuracy is 0.16. This set comprises proteins of different complexity and different chain length, suggesting that the predictor is capable of generalization over a broad number of features. (C) 2002 Wiley-Liss, Inc.
引用
收藏
页码:157 / 162
页数:6
相关论文
共 15 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] Creighton T.E., 1997, PROTEINS STRUCTURES, VSecond
  • [3] The HSSP database of protein structure sequence alignments and family profiles
    Dodge, C
    Schneider, R
    Sander, C
    [J]. NUCLEIC ACIDS RESEARCH, 1998, 26 (01) : 313 - 315
  • [4] FARISELLI P, 1993, EUR BIOPHYS J BIOPHY, V22, P41, DOI 10.1007/BF00205811
  • [5] A neural network based predictor of residue contacts in proteins
    Fariselli, P
    Casadio, R
    [J]. PROTEIN ENGINEERING, 1999, 12 (01): : 15 - 21
  • [6] CORRELATED MUTATIONS AND RESIDUE CONTACTS IN PROTEINS
    GOBEL, U
    SANDER, C
    SCHNEIDER, R
    VALENCIA, A
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1994, 18 (04): : 309 - 317
  • [7] Jacoboni I, 2000, PROTEINS, V41, P535, DOI 10.1002/1097-0134(20001201)41:4<535::AID-PROT100>3.0.CO
  • [8] 2-C
  • [9] Protein distance constraints predicted by neural networks and probability density functions
    Lund, O
    Frimand, K
    Gorodkin, J
    Bohr, H
    Bohr, J
    Hansen, J
    Brunak, S
    [J]. PROTEIN ENGINEERING, 1997, 10 (11): : 1241 - 1248
  • [10] TESTS FOR COMPARING RELATED AMINO-ACID SEQUENCES CYTOCHROME-C AND CYTOCHROME-C551
    MCLACHLAN, AD
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1971, 61 (02) : 409 - +