A survey of tools for variant analysis of next-generation genome sequencing data

被引:355
作者
Pabinger, Stephan [1 ]
Dander, Andreas [2 ,3 ]
Fischer, Maria [1 ]
Snajder, Rene [1 ,2 ]
Sperk, Michael [3 ]
Efremova, Mirjana [3 ]
Krabichler, Birgit [4 ]
Speicher, Michael R. [5 ]
Zschocke, Johannes [4 ]
Trajanoski, Zlatko [1 ]
机构
[1] Med Univ Innsbruck, Div Bioinformat, A-6020 Innsbruck, Austria
[2] Oncotyrol, Innsbruck, Austria
[3] Med Univ Innsbruck, A-6020 Innsbruck, Austria
[4] Med Univ Innsbruck, Div Human Genet, A-6020 Innsbruck, Austria
[5] Med Univ Graz, Inst Human Genet, Graz, Austria
基金
奥地利科学基金会;
关键词
Mendelian disorders; cancer; variants; bioinformatics tools; next-generation sequencing; COPY-NUMBER VARIATION; WHOLE-GENOME; QUALITY-CONTROL; READ ALIGNMENT; FUNCTIONAL-CHARACTERIZATION; STRUCTURAL VARIATION; MENDELIAN DISEASE; POINT MUTATIONS; EXOME; CANCER;
D O I
10.1093/bib/bbs086
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Recent advances in genome sequencing technologies provide unprecedented opportunities to characterize individual genomic landscapes and identify mutations relevant for diagnosis and therapy. Specifically, whole-exome sequencing using next-generation sequencing (NGS) technologies is gaining popularity in the human genetics community due to the moderate costs, manageable data amounts and straightforward interpretation of analysis results. While whole-exome and, in the near future, whole-genome sequencing are becoming commodities, data analysis still poses significant challenges and led to the development of a plethora of tools supporting specific parts of the analysis workflow or providing a complete solution. Here, we surveyed 205 tools for whole-genome/whole-exome sequencing data analysis supporting five distinct analytical steps: quality assessment, alignment, variant identification, variant annotation and visualization. We report an overview of the functionality, features and specific requirements of the individual tools. We then selected 32 programs for variant identification, variant annotation and visualization, which were subjected to hands-on evaluation using four data sets: one set of exome data from two patients with a rare disease for testing identification of germline mutations, two cancer data sets for testing variant callers for somatic mutations, copy number variations and structural variations, and one semi-synthetic data set for testing identification of copy number variations. Our comprehensive survey and evaluation of NGS tools provides a valuable guideline for human geneticists working on Mendelian disorders, complex diseases and cancers.
引用
收藏
页码:256 / 278
页数:23
相关论文
共 151 条
[1]   CNVnator: An approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing [J].
Abyzov, Alexej ;
Urban, Alexander E. ;
Snyder, Michael ;
Gerstein, Mark .
GENOME RESEARCH, 2011, 21 (06) :974-984
[2]   A method and server for predicting damaging missense mutations [J].
Adzhubei, Ivan A. ;
Schmidt, Steffen ;
Peshkin, Leonid ;
Ramensky, Vasily E. ;
Gerasimova, Anna ;
Bork, Peer ;
Kondrashov, Alexey S. ;
Sunyaev, Shamil R. .
NATURE METHODS, 2010, 7 (04) :248-249
[3]   Personalized copy number and segmental duplication maps using next-generation sequencing [J].
Alkan, Can ;
Kidd, Jeffrey M. ;
Marques-Bonet, Tomas ;
Aksay, Gozde ;
Antonacci, Francesca ;
Hormozdiari, Fereydoun ;
Kitzman, Jacob O. ;
Baker, Carl ;
Malig, Maika ;
Mutlu, Onur ;
Sahinalp, S. Cenk ;
Gibbs, Richard A. ;
Eichler, Evan E. .
NATURE GENETICS, 2009, 41 (10) :1061-U29
[4]   McKusick's Online Mendelian Inheritance in Man (OMIM®) [J].
Amberger, Joanna ;
Bocchini, Carol A. ;
Scott, Alan F. ;
Hamosh, Ada .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D793-D796
[5]   TREAT: a bioinformatics tool for variant annotations and visualizations in targeted and exome sequencing data [J].
Asmann, Yan W. ;
Middha, Sumit ;
Hossain, Asif ;
Baheti, Saurabh ;
Li, Ying ;
Chai, High-Seng ;
Sun, Zhifu ;
Duffy, Patrick H. ;
Hadad, Ahmed A. ;
Nair, Asha ;
Liu, Xiaoyu ;
Zhang, Yuji ;
Klee, Eric W. ;
Kalari, Krishna R. ;
Kocher, Jean-Pierre A. .
BIOINFORMATICS, 2012, 28 (02) :277-278
[6]   High Accuracy Mutation Detection in Leukemia on a Selected Panel of Cancer Genes [J].
Atak, Zeynep Kalender ;
De Keersmaecker, Kim ;
Gianfelici, Valentina ;
Geerdens, Ellen ;
Vandepoel, Roel ;
Pauwels, Daphnie ;
Porcu, Michael ;
Lahortiga, Idoya ;
Brys, Vanessa ;
Dirks, Willy G. ;
Quentmeier, Hilmar ;
Cloos, Jacqueline ;
Cuppens, Harry ;
Uyttebroeck, Anne ;
Vandenberghe, Peter ;
Cools, Jan ;
Aerts, Stein .
PLOS ONE, 2012, 7 (06)
[7]   The Centers for Mendelian Genomics: A new large-scale initiative to identify the genes underlying rare Mendelian conditions [J].
Bamshad, Michael J. ;
Shendure, Jay A. ;
Valle, David ;
Hamosh, Ada ;
Lupski, James R. ;
Gibbs, Richard A. ;
Boerwinkle, Eric ;
Lifton, Richard P. ;
Gerstein, Mark ;
Gunel, Murat ;
Mane, Shrikant ;
Nickerson, Deborah A. .
AMERICAN JOURNAL OF MEDICAL GENETICS PART A, 2012, 158A (07) :1523-1525
[8]   Exome sequencing as a tool for Mendelian disease gene discovery [J].
Bamshad, Michael J. ;
Ng, Sarah B. ;
Bigham, Abigail W. ;
Tabor, Holly K. ;
Emond, Mary J. ;
Nickerson, Deborah A. ;
Shendure, Jay .
NATURE REVIEWS GENETICS, 2011, 12 (11) :745-755
[9]   A statistical method for the detection of variants from next-generation resequencing of DNA pools [J].
Bansal, Vikas .
BIOINFORMATICS, 2010, 26 (12) :i318-i324
[10]   RETRACTED: Evaluation of next-generation sequencing software in mapping and assembly (Retracted article. See vol. 56, pg. 687, 2011) [J].
Bao, Suying ;
Jiang, Rui ;
Kwan, WingKeung ;
Wang, BinBin ;
Ma, Xu ;
Song, You-Qiang .
JOURNAL OF HUMAN GENETICS, 2011, 56 (06) :406-414