ANGSD: Analysis of Next Generation Sequencing Data

被引:1851
作者
Korneliussen, Thorfinn Sand [1 ]
Albrechtsen, Anders [2 ]
Nielsen, Rasmus [1 ,3 ]
机构
[1] Nat Hist Museum Denmark, Ctr GeoGenet, Copenhagen, Denmark
[2] Univ Copenhagen, Bioinformat Ctr, Dept Biol, DK-2200 Copenhagen, Denmark
[3] Univ Calif Berkeley, Dept Integrat Biol & Stat, Berkeley, CA 94720 USA
来源
BMC BIOINFORMATICS | 2014年 / 15卷
基金
美国国家卫生研究院; 新加坡国家研究基金会;
关键词
Next-generation sequencing; Bioinformatics; Population genetics; Association studies; SHORT READ ALIGNMENT; GENOME SEQUENCE; NGS DATA; ASSOCIATION; GENOTYPE; ACCURATE; HISTORY; STATISTICS; DISCOVERY; FRAMEWORK;
D O I
10.1186/s12859-014-0356-4
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: High-throughput DNA sequencing technologies are generating vast amounts of data. Fast, flexible and memory efficient implementations are needed in order to facilitate analyses of thousands of samples simultaneously. Results: We present a multithreaded program suite called ANGSD. This program can calculate various summary statistics, and perform association mapping and population genetic analyses utilizing the full information in next generation sequencing data by working directly on the raw sequencing data or by using genotype likelihoods. Conclusions: The open source c/c++ program ANGSD is available at http://www.popgen.dk/angsd. The program is tested and validated on GNU/Linux systems. The program facilitates multiple input formats including BAM and imputed beagle genotype probability files. The program allow the user to choose between combinations of existing methods and can perform analysis that is not implemented elsewhere.
引用
收藏
页数:13
相关论文
共 35 条
[1]   Ascertainment Biases in SNP Chips Affect Measures of Population Divergence [J].
Albrechtsen, Anders ;
Nielsen, Finn Cilius ;
Nielsen, Rasmus .
MOLECULAR BIOLOGY AND EVOLUTION, 2010, 27 (11) :2534-2547
[2]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[3]   Simultaneous Genotype Calling and Haplotype Phasing Improves Genotype Accuracy and Reduces False-Positive Associations for Genome-wide Association Studies [J].
Browning, Brian L. ;
Yu, Zhaoxia .
AMERICAN JOURNAL OF HUMAN GENETICS, 2009, 85 (06) :847-861
[4]   ReQON: a Bioconductor package for recalibrating quality scores from next-generation sequencing data [J].
Cabanski, Christopher R. ;
Cavin, Keary ;
Bizon, Chris ;
Wilkerson, Matthew D. ;
Parker, Joel S. ;
Wilhelmsen, Kirk C. ;
Perou, Charles M. ;
Marron, J. S. ;
Hayes, D. Neil .
BMC BIOINFORMATICS, 2012, 13
[5]   MSMS: a coalescent simulation program including recombination, demographic structure and selection at a single locus [J].
Ewing, Gregory ;
Hermisson, Joachim .
BIOINFORMATICS, 2010, 26 (16) :2064-2065
[6]   A second generation human haplotype map of over 3.1 million SNPs [J].
Frazer, Kelly A. ;
Ballinger, Dennis G. ;
Cox, David R. ;
Hinds, David A. ;
Stuve, Laura L. ;
Gibbs, Richard A. ;
Belmont, John W. ;
Boudreau, Andrew ;
Hardenbol, Paul ;
Leal, Suzanne M. ;
Pasternak, Shiran ;
Wheeler, David A. ;
Willis, Thomas D. ;
Yu, Fuli ;
Yang, Huanming ;
Zeng, Changqing ;
Gao, Yang ;
Hu, Haoran ;
Hu, Weitao ;
Li, Chaohua ;
Lin, Wei ;
Liu, Siqi ;
Pan, Hao ;
Tang, Xiaoli ;
Wang, Jian ;
Wang, Wei ;
Yu, Jun ;
Zhang, Bo ;
Zhang, Qingrun ;
Zhao, Hongbin ;
Zhao, Hui ;
Zhou, Jun ;
Gabriel, Stacey B. ;
Barry, Rachel ;
Blumenstiel, Brendan ;
Camargo, Amy ;
Defelice, Matthew ;
Faggart, Maura ;
Goyette, Mary ;
Gupta, Supriya ;
Moore, Jamie ;
Nguyen, Huy ;
Onofrio, Robert C. ;
Parkin, Melissa ;
Roy, Jessica ;
Stahl, Erich ;
Winchester, Ellen ;
Ziaugra, Liuda ;
Altshuler, David ;
Shen, Yan .
NATURE, 2007, 449 (7164) :851-U3
[7]   ngsTools: methods for population genetics analyses from next-generation sequencing data [J].
Fumagalli, Matteo ;
Vieira, Filipe G. ;
Linderoth, Tyler ;
Nielsen, Rasmus .
BIOINFORMATICS, 2014, 30 (10) :1486-1487
[8]   Quantifying Population Genetic Differentiation from Next-Generation Sequencing Data [J].
Fumagalli, Matteo ;
Vieira, Filipe G. ;
Korneliussen, Thorfinn Sand ;
Linderoth, Tyler ;
Huerta-Sanchez, Emilia ;
Albrechtsen, Anders ;
Nielsen, Rasmus .
GENETICS, 2013, 195 (03) :979-+
[9]   Demographic history and rare allele sharing among human populations [J].
Gravel, Simon ;
Henn, Brenna M. ;
Gutenkunst, Ryan N. ;
Indap, Amit R. ;
Marth, Gabor T. ;
Clark, Andrew G. ;
Yu, Fuli ;
Gibbs, Richard A. ;
Bustamante, Carlos D. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (29) :11983-11988
[10]   A Draft Sequence of the Neandertal Genome [J].
Green, Richard E. ;
Krause, Johannes ;
Briggs, Adrian W. ;
Maricic, Tomislav ;
Stenzel, Udo ;
Kircher, Martin ;
Patterson, Nick ;
Li, Heng ;
Zhai, Weiwei ;
Fritz, Markus Hsi-Yang ;
Hansen, Nancy F. ;
Durand, Eric Y. ;
Malaspinas, Anna-Sapfo ;
Jensen, Jeffrey D. ;
Marques-Bonet, Tomas ;
Alkan, Can ;
Pruefer, Kay ;
Meyer, Matthias ;
Burbano, Hernan A. ;
Good, Jeffrey M. ;
Schultz, Rigo ;
Aximu-Petri, Ayinuer ;
Butthof, Anne ;
Hoeber, Barbara ;
Hoeffner, Barbara ;
Siegemund, Madlen ;
Weihmann, Antje ;
Nusbaum, Chad ;
Lander, Eric S. ;
Russ, Carsten ;
Novod, Nathaniel ;
Affourtit, Jason ;
Egholm, Michael ;
Verna, Christine ;
Rudan, Pavao ;
Brajkovic, Dejana ;
Kucan, Zeljko ;
Gusic, Ivan ;
Doronichev, Vladimir B. ;
Golovanova, Liubov V. ;
Lalueza-Fox, Carles ;
de la Rasilla, Marco ;
Fortea, Javier ;
Rosas, Antonio ;
Schmitz, Ralf W. ;
Johnson, Philip L. F. ;
Eichler, Evan E. ;
Falush, Daniel ;
Birney, Ewan ;
Mullikin, James C. .
SCIENCE, 2010, 328 (5979) :710-722