Interpreting missense variants:: Comparing computational methods in human disease genes CDKN2A, MLH1, MSH2, MECP2, and tyrosinase (TYR)

被引:103
作者
Chan, Philip A.
Duraisamy, Sekhar
Miller, Peter J.
Newell, Joan A.
McBride, Carole
Bond, Jeffrey P.
Raevaara, Tiina
Ollila, Saara
Nystrom, Minna
Grimm, Andrew J.
Christodoulou, John
Oetting, William S.
Greenblatt, Marc S.
机构
[1] Univ Vermont, Vermont Canc Ctr, Burlington, VT USA
[2] Univ Helsinki, Dept Biol & Environm Sci Genet, Helsinki, Finland
[3] Univ Sydney, Childrens Hosp Westmead, Sydney, NSW 2006, Australia
[4] Univ Sydney, Discipline Paediat & Child Hlth, Sydney, NSW 2006, Australia
[5] Univ Minnesota, Dept Med, Minneapolis, MN USA
关键词
SIFT; PolyPhen; BLOSUM62; Grantham; hereditary; cancer; nsSNP; germline; CDKN2A; MLH1; MLSH2; MECP2; TYR;
D O I
10.1002/humu.20492
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The human genome contains frequent single-basepair variants that may or may not cause genetic disease. To characterize benign vs. pathogenic missense variants, numerous computational algorithms have been developed based on comparative sequence and/or protein structure analysis. We compared computational methods that use evolutionary conservation alone, amino acid (AA) change alone, and a combination of conservation and AA change in predicting the consequences of 254 missense variants in the CDKN2A (n = 92), MLH1 (n = 28), MSH2 (n = 14), MECP2 (n = 30), and tyrosinase (TYR) (n = 90) genes. Variants were validated as either neutral or deleterious by curated locus-specific mutation databases and published functional data. All methods that use evolutionary sequence analysis have comparable overall prediction accuracy (72.9-82.0%). Mutations at codons where the AA is absolutely conserved over a sufficient evolutionary distance (about one-third of variants) had a 91.6 to 96.8% likelihood of being deleterious. Three algorithms (SIFT, PolyPhen, and A-GVGD) that differentiate one variant from another at a given codon did not significantly improve predictive value over conservation score alone using the BLOSUM62 matrix. However, when all four methods were in agreement (62.7% of variants), predictive value improved to 88.1%. These results confirm a high predictive value for methods that use evolutionary sequence conservation, with or without considering protein structural change, to predict the clinical consequences of missense variants. The methods can be generalized across genes that cause different types of genetic disease. The results support the clinical use of computational methods as one tool to help interpret missense variants in genes associated with human genetic disease.
引用
收藏
页码:683 / 693
页数:11
相关论文
共 54 条
[51]   Many amino acid substitution variants identified in DNA repair genes during human population screenings are predicted to impact protein function [J].
Xi, T ;
Jones, IM ;
Mohrenweiser, HW .
GENOMICS, 2004, 83 (06) :970-979
[52]  
YANG GW, 1995, CHEM J CHINESE U, V16, P55
[53]   Biologic and biochemical analyses of p16INK4a mutations from primary tumors [J].
Yarbrough, WG ;
Buckmire, RA ;
Bessho, M ;
Liu, ET .
JOURNAL OF THE NATIONAL CANCER INSTITUTE, 1999, 91 (18) :1569-1574
[54]   Defective folding of mutant p16(INK4) proteins encoded by tumor-derived alleles [J].
Zhang, B ;
Peng, ZY .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1996, 271 (46) :28734-28737