Performance of Mutation Pathogenicity Prediction Methods on Missense Variants

被引:414
作者
Thusberg, Janita [1 ,2 ]
Olatubosun, Ayodeji [1 ]
Vihinen, Mauno [1 ,3 ]
机构
[1] Univ Tampere, Inst Biomed Technol, FI-33014 Tampere, Finland
[2] Buck Inst Age Res, Novato, CA USA
[3] Tampere Univ Hosp, Res Ctr, Tampere, Finland
基金
芬兰科学院;
关键词
method evaluation; bioinformatics; pathogenicity prediction; SNPs; PROTEIN SECONDARY STRUCTURE; NON-SYNONYMOUS SNPS; SEQUENCE; DISEASE; DATABASE; INFORMATION; ANNOTATION; PROFILE; CLASSIFICATION; IDENTIFICATION;
D O I
10.1002/humu.21445
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Single nucleotide polymorphisms (SNPs) are the most common form of genetic variation in humans. The number of SNPs identified in the human genome is growing rapidly, but attaining experimental knowledge about the possible disease association of variants is laborious and time-consuming. Several computational methods have been developed for the classification of SNPs according to their predicted pathogenicity. In this study, we have evaluated the performance of nine widely used pathogenicity prediction methods available on the Internet. The evaluated methods were MutPred, nsSNPAnalyzer, Panther, PhD-SNP, PolyPhen, PolyPhen2, SIFT, SNAP, and SNPs&GO. The methods were tested with a set of over 40,000 pathogenic and neutral variants. We also assessed whether the type of original or substituting amino acid residue, the structural class of the protein, or the structural environment of the amino acid substitution, had an effect on the prediction performance. The performances of the programs ranged from poor (MCC 0.19) to reasonably good (MCC 0.65), and the results from the programs correlated poorly. The overall best performing methods in this study were SNPs&GO and MutPred, with accuracies reaching 0.82 and 0.81, respectively. Hum Mutat 32: 358-368, 2011. (C) 2011 Wiley-Liss, Inc.
引用
收藏
页码:358 / 368
页数:11
相关论文
共 62 条
[1]   A method and server for predicting damaging missense mutations [J].
Adzhubei, Ivan A. ;
Schmidt, Steffen ;
Peshkin, Leonid ;
Ramensky, Vasily E. ;
Gerasimova, Anna ;
Bork, Peer ;
Kondrashov, Alexey S. ;
Sunyaev, Shamil R. .
NATURE METHODS, 2010, 7 (04) :248-249
[2]   Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information [J].
Ahmad, S ;
Gromiha, MM ;
Sarai, A .
BIOINFORMATICS, 2004, 20 (04) :477-486
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[5]   The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :45-48
[6]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[7]   nsSNPAnalyzer: identifying disease-associated nonsynonymous single nucleotide polymorphisms [J].
Bao, L ;
Zhou, M ;
Cui, Y .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W480-W482
[8]   Statistical geometry based prediction of nonsynonymous SNP functional effects using random forest and neuro-fuzzy classifiers [J].
Barenboim, Maxim ;
Masso, Majid ;
Vaisman, Iosif I. ;
Jamison, D. Curtis .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 71 (04) :1930-1939
[9]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[10]   SNAP: predict effect of non-synonymous polymorphisms on function [J].
Bromberg, Yana ;
Rost, Burkhard .
NUCLEIC ACIDS RESEARCH, 2007, 35 (11) :3823-3835