Real value prediction of solvent accessibility from amino acid sequence

被引:163
作者
Ahmad, S
Gromiha, MM
Sarai, A
机构
[1] RIKEN, Tsukuba Inst, Tsukuba, Ibaraki 3050074, Japan
[2] AIST, CBRC, Tokyo, Japan
关键词
structure prediction; accessible surface area; neural network;
D O I
10.1002/prot.10328
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The solvent accessibility of amino acid residues has been predicted in the past by classifying them into exposure states with varying thresholds. This classification provides a wide range of values for the accessible surface area (ASA) within which a residue may fall. Thus far, no attempt has been made to predict real values of ASA from the sequence information without a priori classification into exposure states. Here, we present a new method with which to predict real value ASAs for residues, based on neighborhood information. Our real value prediction neural network could estimate the ASA for four different nonhomologous, nonredundant data sets of varying size, with 18.0-19.5% mean absolute error, defined as per residue absolute difference between the predicted and experimental values of relative ASA. Correlation between the predicted and experimental values ranged from 0.47 to 0.50. It was observed that the ASA of a residue could be predicted within a 23.7% mean absolute error, even when no information about its neighbors is included. Prediction of real values answers the issue of arbitrary choice of ASA state thresholds, and carries more information than category prediction. Prediction error for each residue type strongly correlates with the variability in its experimental ASA values. (C) 2003 Wiley-Liss, Inc.
引用
收藏
页码:629 / 635
页数:7
相关论文
共 24 条
  • [1] NETASA: neural network based prediction of solvent accessibility
    Ahmad, S
    Gromiha, MM
    [J]. BIOINFORMATICS, 2002, 18 (06) : 819 - 824
  • [2] Protein structure prediction
    Benner, SA
    Geroff, DL
    Rozzell, JD
    [J]. SCIENCE, 1996, 274 (5292) : 1448 - 1449
  • [3] Predicting residue solvent accessibility from protein sequence by considering the sequence environment
    Carugo, O
    [J]. PROTEIN ENGINEERING, 2000, 13 (09): : 607 - 609
  • [4] CHANDONIA JM, 1995, PROTEIN SCI, V4, P275
  • [5] Protein threading by learning
    Chang, I
    Cieplak, M
    Dima, RI
    Maritan, A
    Banavar, JR
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (25) : 14350 - 14355
  • [6] Cuff JA, 2000, PROTEINS, V40, P502, DOI 10.1002/1097-0134(20000815)40:3<502::AID-PROT170>3.0.CO
  • [7] 2-Q
  • [8] IMPROVED STRATEGY IN ANALYTIC SURFACE CALCULATION FOR MOLECULAR-SYSTEMS - HANDLING OF SINGULARITIES AND COMPUTATIONAL-EFFICIENCY
    EISENHABER, F
    ARGOS, P
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 1993, 14 (11) : 1272 - 1280
  • [9] ProTherm, thermodynamic database for proteins and mutants: Developments in version 3.0
    Gromiha, MM
    Uedaira, H
    An, JH
    Selvaraj, S
    Prabakaran, P
    Sarai, A
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (01) : 301 - 302
  • [10] Role of structural and sequence information in the prediction of protein stability changes: comparison between buried and partially buried mutations
    Gromiha, MM
    Oobatake, M
    Kono, H
    Uedaira, H
    Sarai, A
    [J]. PROTEIN ENGINEERING, 1999, 12 (07): : 549 - 555