Estimating inbreeding coefficients from NGS data: Impact on genotype calling and allele frequency estimation

被引:73
作者
Vieira, Filipe G. [1 ]
Fumagalli, Matteo [1 ]
Albrechtsen, Anders [2 ]
Nielsen, Rasmus [1 ,2 ]
机构
[1] Univ Calif Berkeley, Dept Integrat Biol, Berkeley, CA 94720 USA
[2] Univ Copenhagen, Dept Biol, DK-2200 Copenhagen, Denmark
关键词
WILD-RICE; LINKAGE DISEQUILIBRIUM; NUCLEOTIDE VARIATION; MULTILOCUS ANALYSIS; ORYZA-RUFIPOGON; POPULATION; GENOME; DOMESTICATION; ASSOCIATION; DISCOVERY;
D O I
10.1101/gr.157388.113
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Most methods for next-generation sequencing (NGS) data analyses incorporate information regarding allele frequencies using the assumption of Hardy-Weinberg equilibrium (HWE) as a prior. However, many organisms including those that are domesticated, partially selfing, or with asexual life cycles show strong deviations from HWE. For such species, and specially for low-coverage data, it is necessary to obtain estimates of inbreeding coefficients (F) for each individual before calling genotypes. Here, we present two methods for estimating inbreeding coefficients from NGS data based on an expectation-maximization (EM) algorithm. We assess the impact of taking inbreeding into account when calling genotypes or estimating the site frequency spectrum (SFS), and demonstrate a marked increase in accuracy on low-coverage highly inbred samples. We demonstrate the applicability and efficacy of these methods in both simulated and real data sets.
引用
收藏
页码:1852 / 1861
页数:10
相关论文
共 53 条
[1]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[2]  
[Anonymous], J APPL PROBAB
[3]   Artificial selection for a green revolution gene during japonica rice domestication [J].
Asano, Kenji ;
Yamasaki, Masanori ;
Takuno, Shohei ;
Miura, Kotaro ;
Katagiri, Satoshi ;
Ito, Tomoko ;
Doi, Kazuyuki ;
Wu, Jianzhong ;
Ebana, Kaworu ;
Matsumoto, Takashi ;
Innan, Hideki ;
Kitano, Hidemi ;
Ashikari, Motoyuki ;
Matsuoka, Makoto .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (27) :11034-11039
[4]   Genome-wide patterns of nucleotide polymorphism in domesticated rice [J].
Caicedo, Ana L. ;
Williamson, Scott H. ;
Hernandez, Ryan D. ;
Boyko, Adam ;
Fledel-Alon, Adi ;
York, Thomas L. ;
Polato, Nicholas R. ;
Olsen, Kenneth M. ;
Nielsen, Rasmus ;
McCouch, Susan R. ;
Bustamante, Carlos D. ;
Purugganan, Michael D. .
PLOS GENETICS, 2007, 3 (09) :1745-1756
[5]   Estimating human inbreeding coefficients: Comparison of genealogical and marker heterozygosity approaches [J].
Carothers, A. D. ;
Rudan, I. ;
Kolcic, I. ;
Polasek, O. ;
Hayward, C. ;
Wright, A. F. ;
Campbell, H. ;
Teague, P. ;
Hastie, N. D. ;
Weber, J. L. .
ANNALS OF HUMAN GENETICS, 2006, 70 :666-676
[6]  
CEPPELLINI R, 1955, ANN HUM GENET, V20, P97
[7]   Extremely high levels of inbreeding in a natural population of the free-living wasp Ancistrocerus antilope (Hymenoptera: Vespidae: Eumeninae) [J].
Chapman, TW ;
Stewart, SC .
HEREDITY, 1996, 76 :65-69
[8]   A framework for variation discovery and genotyping using next-generation DNA sequencing data [J].
DePristo, Mark A. ;
Banks, Eric ;
Poplin, Ryan ;
Garimella, Kiran V. ;
Maguire, Jared R. ;
Hartl, Christopher ;
Philippakis, Anthony A. ;
del Angel, Guillermo ;
Rivas, Manuel A. ;
Hanna, Matt ;
McKenna, Aaron ;
Fennell, Tim J. ;
Kernytsky, Andrew M. ;
Sivachenko, Andrey Y. ;
Cibulskis, Kristian ;
Gabriel, Stacey B. ;
Altshuler, David ;
Daly, Mark J. .
NATURE GENETICS, 2011, 43 (05) :491-+
[9]   Assessment of population genetic structure in common wild rice Oryza rufipogon Griff. using microsatellite and allozyme markers [J].
Gao, LZ ;
Schaal, BA ;
Zhang, CH ;
Jia, JZ ;
Dong, YS .
THEORETICAL AND APPLIED GENETICS, 2002, 106 (01) :173-180
[10]   Field guide to next-generation DNA sequencers [J].
Glenn, Travis C. .
MOLECULAR ECOLOGY RESOURCES, 2011, 11 (05) :759-769