Physicochemical constraint violation by missense substitutions mediates impairment of protein function and disease severity

被引:301
作者
Stone, EA
Sidow, A [1 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Pathol, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Genet, Stanford, CA 94305 USA
关键词
D O I
10.1101/gr.3804205
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We find that the degree of impairment of protein function by missense variants is predictable by comparative sequence analysis alone. The applicable range of impairment is not confined to binary predictions that distinguish normal from deleterious variants, but extends continuously from mild to severe effects. The accuracy of predictions is strongly dependent on sequence variation and is highest when diverse orthologs are available. High predictive accuracy is achieved by quantification of the physicochemical characteristics in each position of the protein, based oil observed evolutionary variation. The strong relationship between physicochemical characteristics of a missense variant and impairment of protein function extends to human disease. By using four diverse proteins for which sufficient comparative sequence data are available, we show that grades of disease, or likelihood of developing cancer, correlate strongly with physicochemical constraint violation by causative amino acid variants.
引用
收藏
页码:978 / 986
页数:9
相关论文
共 37 条
  • [1] WEIGHTS FOR DATA RELATED BY A TREE
    ALTSCHUL, SF
    CARROLL, RJ
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1989, 207 (04) : 647 - 653
  • [2] Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease
    Botstein, D
    Risch, N
    [J]. NATURE GENETICS, 2003, 33 (Suppl 3) : 228 - 237
  • [3] Bayesian approach to discovering pathogenic SNPs in conserved protein domains
    Cai, ZH
    Tsung, EF
    Marinescu, VD
    Ramoni, MF
    Riva, A
    Kohane, IS
    [J]. HUMAN MUTATION, 2004, 24 (02) : 178 - 184
  • [4] CRYSTAL-STRUCTURE OF A P53 TUMOR-SUPPRESSOR DNA COMPLEX - UNDERSTANDING TUMORIGENIC MUTATIONS
    CHO, YJ
    GORINA, S
    JEFFREY, PD
    PAVLETICH, NP
    [J]. SCIENCE, 1994, 265 (5170) : 346 - 355
  • [5] HIV POPULATION-DYNAMICS IN-VIVO - IMPLICATIONS FOR GENETIC-VARIATION, PATHOGENESIS, AND THERAPY
    COFFIN, JM
    [J]. SCIENCE, 1995, 267 (5197) : 483 - 489
  • [6] Delano WL, 2002, PYMOL USERS MANUAL
  • [7] A structural EM algorithm for phylogenetic inference
    Friedman, N
    Ninio, M
    Pe'er, I
    Pupko, T
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (02) : 331 - 353
  • [8] HbVar.: A relational database of human hemoglobin variants and thalassemia mutations at the globin gene server
    Hardison, RC
    Chui, DHK
    Giardine, B
    Riemer, C
    Patrinos, GP
    Anagnou, N
    Miller, W
    Wajcman, H
    [J]. HUMAN MUTATION, 2002, 19 (03) : 225 - 233
  • [9] Huisman THJ, 1996, SYLLABUS HUMAN HEMOG
  • [10] A comparative study of machine-learning methods to predict the effects of single nucleotide polymorphisms on protein function
    Krishnan, VG
    Westhead, DR
    [J]. BIOINFORMATICS, 2003, 19 (17) : 2199 - 2209