Loss of protein structure stability as a major causative factor in monogenic disease

被引:388
作者
Yue, P
Li, ZL
Moult, J [1 ]
机构
[1] Univ Maryland, Maryland Biotechnol Inst, Ctr Adv Res Biotechnol, Rockville, MD 20850 USA
[2] Univ Maryland, Mol & Cellular Biol Program, College Pk, MD 20742 USA
关键词
protein structure; protein stability; monogenic disease; mis-sense mutations; single nucleotide polymorphisms;
D O I
10.1016/j.jmb.2005.08.020
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The most common cause of monogenic disease is a single base DNA variant resulting in an amino acid substitution. In a previous study, we observed that a high fraction of these substitutions appear to result in reduction of stability of the corresponding protein structure. We have now investigated this phenomenon more fully. A set of structural effects, such as reduction in hydrophobic area, overpacking, backbone strain, and loss of electrostatic interactions, is used to represent the impact of single residue mutations on protein stability. A support vector machine (SVM) was trained on a set of mutations causative of disease, and a control set of non-disease causing mutations. In jack-knifed testing, the method identifies 74% of disease mutations, with a false positive rate of 15%. Evaluation of a set of in vitro mutagenesis data with the SVM established that the majority of disease mutations affect protein stability by 1 to 3 kcal/mol. The method's effective distinction between disease and non-disease variants, strongly supports the hypothesis that loss of protein stability is a major factor contributing to monogenic disease. Mutant analysis is available (www.snps3d.org). (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:459 / 473
页数:15
相关论文
共 62 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] SCOP database in 2004: refinements integrate structure and sequence family data
    Andreeva, A
    Howorth, D
    Brenner, SE
    Hubbard, TJP
    Chothia, C
    Murzin, AG
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D226 - D229
  • [3] ProTherm, version 4.0: thermodynamic database for proteins and mutants
    Bava, KA
    Gromiha, MM
    Uedaira, H
    Kitajima, K
    Sarai, A
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D120 - D121
  • [4] FREE-ENERGY VIA MOLECULAR SIMULATION - APPLICATIONS TO CHEMICAL AND BIOMOLECULAR SYSTEMS
    BEVERIDGE, DL
    DICAPUA, FM
    [J]. ANNUAL REVIEW OF BIOPHYSICS AND BIOPHYSICAL CHEMISTRY, 1989, 18 : 431 - 492
  • [5] The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003
    Boeckmann, B
    Bairoch, A
    Apweiler, R
    Blatter, MC
    Estreicher, A
    Gasteiger, E
    Martin, MJ
    Michoud, K
    O'Donovan, C
    Phan, I
    Pilbout, S
    Schneider, M
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 365 - 370
  • [6] ENERGETIC CONTRIBUTION OF SIDE-CHAIN HYDROGEN-BONDING TO THE STABILITY OF STAPHYLOCOCCAL NUCLEASE
    BYRNE, MP
    MANUEL, RL
    LOWE, LG
    STITES, WE
    [J]. BIOCHEMISTRY, 1995, 34 (42) : 13949 - 13960
  • [7] A graph-theory algorithm for rapid protein side-chain prediction
    Canutescu, AA
    Shelenkov, AA
    Dunbrack, RL
    [J]. PROTEIN SCIENCE, 2003, 12 (09) : 2001 - 2014
  • [8] Predicting the functional consequences of non-synonymous single nucleotide polymorphisms: Structure-based assessment of amino acid variation
    Chasman, D
    Adams, RM
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (02) : 683 - 706
  • [9] Finding functional features in Saccharomyces genomes by phylogenetic footprinting
    Cliften, P
    Sudarsanam, P
    Desikan, A
    Fulton, L
    Fulton, B
    Majors, J
    Waterston, R
    Cohen, BA
    Johnston, M
    [J]. SCIENCE, 2003, 301 (5629) : 71 - 76
  • [10] Deshpande N, 2005, NUCLEIC ACIDS RES, V33, pD233