Prediction of solvent accessibility and sites of deleterious mutations from protein sequence

被引:123
作者
Chen, HL
Zhou, HX [1 ]
机构
[1] Florida State Univ, Dept Phys, Tallahassee, FL 32306 USA
[2] Florida State Univ, Inst Mol Biophys, Tallahassee, FL 32306 USA
[3] Florida State Univ, Sch Computat Sci, Tallahassee, FL 32306 USA
[4] Drexel Univ, Dept Phys, Philadelphia, PA 19104 USA
关键词
D O I
10.1093/nar/gki633
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Residues that form the hydrophobic core of a protein are critical for its stability. A number of approaches have been developed to classify residues as buried or exposed. In order to optimize the classification, we have refined a suite of five methods over a large dataset and proposed a metamethod based on an ensemble average of the individual methods, leading to a two-state classification accuracy of 80%. Many studies have suggested that hydrophobic core residues are likely sites of deleterious mutations, so we wanted to see to what extent these sites can be predicted from the putative buried residues. Residues that were most confidently classified as buried were proposed as sites of deleterious mutations. This proposition was tested on six proteins for which sites of deleterious mutations have previously been identified by stability measurement or functional assay. Of the total of 130 residues predicted as sites of deleterious mutations, 104 (or 80%) were correct.
引用
收藏
页码:3193 / 3199
页数:7
相关论文
共 44 条
  • [1] Accurate prediction of solvent accessibility using neural networks-based regression
    Adamczak, R
    Porollo, A
    Meller, J
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 56 (04) : 753 - 767
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] Adaptation of protein surfaces to subcellular location
    Andrade, MA
    O'Donoghue, SI
    Rost, B
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1998, 276 (02) : 517 - 525
  • [4] [Anonymous], 2005, The proteomics protocols handbook. Totowa (New Jersey)
  • [5] Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
  • [6] ORIGINS OF STRUCTURE IN GLOBULAR-PROTEINS
    CHAN, HS
    DILL, KA
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1990, 87 (16) : 6388 - 6392
  • [7] CHEN H, 2005, IN PRESS PROTEINS
  • [8] CHEN H, 2004, 2 AS PAC BIO C APBC2, V29, P333
  • [9] A symbolic-numeric approach to find patterns in genomes.: Application to the translation initiation sites of E-coli.
    Delamarche, C
    Guerdoux-Jamet, P
    Gras, R
    Nicolas, J
    [J]. BIOCHIMIE, 1999, 81 (11) : 1065 - 1072
  • [10] Multi-class protein fold recognition using support vector machines and neural networks
    Ding, CHQ
    Dubchak, I
    [J]. BIOINFORMATICS, 2001, 17 (04) : 349 - 358