dbNSFP: A Lightweight Database of Human Nonsynonymous SNPs and Their Functional Predictions

被引:600
作者
Liu, Xiaoming [1 ]
Jian, Xueqiu [1 ]
Boerwinkle, Eric [1 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, Sch Publ Hlth, Ctr Human Genet, Houston, TX 77030 USA
基金
美国国家卫生研究院;
关键词
dbNSFP; functional prediction; database; SIFT; Polyphen2; LRT; MutationTaster; PhyloP; MISSENSE MUTATIONS; PROTEIN FUNCTION; DISEASE; ANNOTATION; GENERATION; VARIANTS; GENOMES; COMMON;
D O I
10.1002/humu.21517
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
With the advance of sequencing technologies, whole exome sequencing has increasingly been used to identify mutations that cause human diseases, especially rare Mendelian diseases. Among the analysis steps, functional prediction (of being deleterious) plays an important role in filtering or prioritizing nonsynonymous SNP (NS) for further analysis. Unfortunately, different prediction algorithms use different information and each has its own strength and weakness. It has been suggested that investigators should use predictions from multiple algorithms instead of relying on a single one. However, querying predictions from different databases/Web-servers for different algorithms is both tedious and time consuming, especially when dealing with a huge number of NSs identified by exome sequencing. To facilitate the process, we developed dbNSFP (database for nonsynonymous SNPs' functional predictions). It compiles prediction scores from four new and popular algorithms (SIFT, Polyphen2, LRT, and MutationTaster), along with a conservation score (PhyloP) and other related information, for every potential NS in the human genome (a total of 75,931,005). It is the first integrated database of functional predictions from multiple algorithms for the comprehensive collection of human NSs. dbNSFP is freely available for download at http://sites.google.com/site/jpopgen/dbNSFP. Hum Mutat 32:894-899, 2011. (C) 2011 Wiley-Liss, Inc.
引用
收藏
页码:894 / 899
页数:6
相关论文
共 29 条
[1]   A method and server for predicting damaging missense mutations [J].
Adzhubei, Ivan A. ;
Schmidt, Steffen ;
Peshkin, Leonid ;
Ramensky, Vasily E. ;
Gerasimova, Anna ;
Bork, Peer ;
Kondrashov, Alexey S. ;
Sunyaev, Shamil R. .
NATURE METHODS, 2010, 7 (04) :248-249
[2]   Dealing with missing values in large-scale studies: microarray data imputation and beyond [J].
Aittokallio, Tero .
BRIEFINGS IN BIOINFORMATICS, 2010, 11 (02) :253-264
[3]   Integrating common and rare genetic variation in diverse human populations [J].
Altshuler, David M. ;
Gibbs, Richard A. ;
Peltonen, Leena ;
Dermitzakis, Emmanouil ;
Schaffner, Stephen F. ;
Yu, Fuli ;
Bonnen, Penelope E. ;
de Bakker, Paul I. W. ;
Deloukas, Panos ;
Gabriel, Stacey B. ;
Gwilliam, Rhian ;
Hunt, Sarah ;
Inouye, Michael ;
Jia, Xiaoming ;
Palotie, Aarno ;
Parkin, Melissa ;
Whittaker, Pamela ;
Chang, Kyle ;
Hawes, Alicia ;
Lewis, Lora R. ;
Ren, Yanru ;
Wheeler, David ;
Muzny, Donna Marie ;
Barnes, Chris ;
Darvishi, Katayoon ;
Hurles, Matthew ;
Korn, Joshua M. ;
Kristiansson, Kati ;
Lee, Charles ;
McCarroll, Steven A. ;
Nemesh, James ;
Keinan, Alon ;
Montgomery, Stephen B. ;
Pollack, Samuela ;
Price, Alkes L. ;
Soranzo, Nicole ;
Gonzaga-Jauregui, Claudia ;
Anttila, Verneri ;
Brodeur, Wendy ;
Daly, Mark J. ;
Leslie, Stephen ;
McVean, Gil ;
Moutsianas, Loukas ;
Nguyen, Huy ;
Zhang, Qingrun ;
Ghori, Mohammed J. R. ;
McGinnis, Ralph ;
McLaren, William ;
Takeuchi, Fumihiko ;
Grossman, Sharon R. .
NATURE, 2010, 467 (7311) :52-58
[4]   Predicting the insurgence of human genetic diseases associated to single point protein mutations with support vector machines and evolutionary information [J].
Capriotti, E. ;
Calabrese, R. ;
Casadio, R. .
BIOINFORMATICS, 2006, 22 (22) :2729-2734
[5]   Identification of deleterious mutations within three human genomes [J].
Chun, Sung ;
Fay, Justin C. .
GENOME RESEARCH, 2009, 19 (09) :1553-1561
[6]   Single-nucleotide evolutionary constraint scores highlight disease-causing mutations [J].
Cooper, Gregory M. ;
Goode, David L. ;
Ng, Sarah B. ;
Sidow, Arend ;
Bamshad, Michael J. ;
Shendure, Jay ;
Nickerson, Deborah A. .
NATURE METHODS, 2010, 7 (04) :250-251
[7]   Bioinformatics approaches for genomics and post genomics applications of next-generation sequencing [J].
Horner, David Stephen ;
Pavesi, Giulio ;
Castrignano, Tiziana ;
De Meo, Paolo D'Onorio ;
Liuni, Sabino ;
Sammeth, Michael ;
Picardi, Ernesto ;
Pesole, Graziano .
BRIEFINGS IN BIOINFORMATICS, 2010, 11 (02) :181-197
[8]   Next generation tools for the annotation of human SNPs [J].
Karchin, Rachel .
BRIEFINGS IN BIOINFORMATICS, 2009, 10 (01) :35-52
[9]   Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm [J].
Kumar, Prateek ;
Henikoff, Steven ;
Ng, Pauline C. .
NATURE PROTOCOLS, 2009, 4 (07) :1073-1082
[10]   F-SNP: computationally predicted functional SNPs for disease association studies [J].
Lee, Phil Hyoun ;
Shatkay, Hagit .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D820-D824