dbNSFP v3.0: A One-Stop Database of Functional Predictions and Annotations for Human Nonsynonymous and Splice-Site SNVs

被引:713
作者
Liu, Xiaoming [1 ,2 ]
Wu, Chunlei [3 ]
Li, Chang [2 ]
Boerwinkle, Eric [1 ,2 ,4 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, Sch Publ Hlth, Ctr Human Genet, Houston, TX 77030 USA
[2] Univ Texas Hlth Sci Ctr Houston, Sch Publ Hlth, Dept Epidemiol Human Genet & Environm Sci, Houston, TX 77030 USA
[3] Scripps Res Inst, Dept Mol & Expt Med, 10666 N Torrey Pines Rd, La Jolla, CA 92037 USA
[4] Baylor Coll Med, Houston, TX 77030 USA
基金
美国国家卫生研究院;
关键词
dbNSFP; dbscSNV; nonsynonymous mutation; splice-site mutation; functional prediction; database; CONSEQUENCES; MUTATIONS; VARIANTS; GENOME; PATHOGENICITY; ELEMENTS; SNPS;
D O I
10.1002/humu.22932
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The purpose of the dbNSFP is to provide a one-stop resource for functional predictions and annotations for human nonsynonymous single-nucleotide variants (nsSNVs) and splice-site variants (ssSNVs), and to facilitate the steps of filtering and prioritizing SNVs from a large list of SNVs discovered in an exome-sequencing study. A list of all potential nsSNVs and ssSNVs based on the human reference sequence were created and functional predictions and annotations were curated and compiled for each SNV. Here, we report a recent major update of the database to version 3.0. The SNV list has been rebuilt based on GENCODE 22 and currently the database includes 82,832,027 nsSNVs and ssSNVs. An attached database dbscSNV, which compiled all potential human SNVs within splicing consensus regions and their deleteriousness predictions, add another 15,030,459 potentially functional SNVs. Eleven prediction scores (MetaSVM, MetaLR, CADD, VEST3, PROVEAN, 4x fitCons, fathmm-MKL, and DANN) and allele frequencies from the UK10K cohorts and the Exome Aggregation Consortium (ExAC), among others, have been added. The original seven prediction scores in v2.0 (SIFT, 2x Polyphen2, LRT, MutationTaster, MutationAssessor, and FATHMM) as well as many SNV and gene functional annotations have been updated. dbNSFP v3.0 is freely available at http://sites.google.com/site/jpopgen/dbNSFP. (C) 2015 Wiley Periodicals, Inc.
引用
收藏
页码:235 / 241
页数:7
相关论文
共 38 条
[1]   A method and server for predicting damaging missense mutations [J].
Adzhubei, Ivan A. ;
Schmidt, Steffen ;
Peshkin, Leonid ;
Ramensky, Vasily E. ;
Gerasimova, Anna ;
Bork, Peer ;
Kondrashov, Alexey S. ;
Sunyaev, Shamil R. .
NATURE METHODS, 2010, 7 (04) :248-249
[2]   An integrated map of genetic variation from 1,092 human genomes [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Schmidt, Jeanette P. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Dinh, Huyen ;
Kovar, Christie ;
Lee, Sandra ;
Lewis, Lora ;
Muzny, Donna ;
Reid, Jeff ;
Wang, Min ;
Wang, Jun ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Li, Zhuo ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Su, Zhe ;
Tai, Shuaishuai ;
Tang, Meifang .
NATURE, 2012, 491 (7422) :56-65
[3]   Identifying Mendelian disease genes with the Variant Effect Scoring Tool [J].
Carter, Hannah ;
Douville, Christopher ;
Stenson, Peter D. ;
Cooper, David N. ;
Karchin, Rachel .
BMC GENOMICS, 2013, 14
[4]  
Choi YH, 2012, PLOS ONE, V7, DOI [10.1371/journal.pone.0039927, 10.1371/journal.pone.0046688]
[5]   Identification of deleterious mutations within three human genomes [J].
Chun, Sung ;
Fay, Justin C. .
GENOME RESEARCH, 2009, 19 (09) :1553-1561
[6]   A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3 [J].
Cingolani, Pablo ;
Platts, Adrian ;
Wang, Le Lily ;
Coon, Melissa ;
Tung Nguyen ;
Wang, Luan ;
Land, Susan J. ;
Lu, Xiangyi ;
Ruden, Douglas M. .
FLY, 2012, 6 (02) :80-92
[7]   Identifying a High Fraction of the Human Genome to be under Selective Constraint Using GERP plus [J].
Davydov, Eugene V. ;
Goode, David L. ;
Sirota, Marina ;
Cooper, Gregory M. ;
Sidow, Arend ;
Batzoglou, Serafim .
PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (12)
[8]   Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies [J].
Dong, Chengliang ;
Wei, Peng ;
Jian, Xueqiu ;
Gibbs, Richard ;
Boerwinkle, Eric ;
Wang, Kai ;
Liu, Xiaoming .
HUMAN MOLECULAR GENETICS, 2015, 24 (08) :2125-2137
[9]   Analysis of 6,515 exomes reveals the recent origin of most human protein-coding variants [J].
Fu, Wenqing ;
O'Connor, Timothy D. ;
Jun, Goo ;
Kang, Hyun Min ;
Abecasis, Goncalo ;
Leal, Suzanne M. ;
Gabriel, Stacey ;
Altshuler, David ;
Shendure, Jay ;
Nickerson, Deborah A. ;
Bamshad, Michael J. ;
Akey, Joshua M. .
NATURE, 2013, 493 (7431) :216-220
[10]   Identifying novel constrained elements by exploiting biased substitution patterns [J].
Garber, Manuel ;
Guttman, Mitchell ;
Clamp, Michele ;
Zody, Michael C. ;
Friedman, Nir ;
Xie, Xiaohui .
BIOINFORMATICS, 2009, 25 (12) :I54-I62