The power of single-nucleotide polymorphisms for large-scale parentage inference

被引:232
作者
Anderson, EC [1 ]
Garza, JC [1 ]
机构
[1] SW Fisheries Sci Ctr, Santa Cruz Lab, Fisheries Ecol Div, Santa Cruz, CA 95060 USA
关键词
D O I
10.1534/genetics.105.048074
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Likelihood-based parentage inference depends on the distribution of a likelihood-ratio statistic, which, in most cases of interest, cannot be exactly determined, but only approximated by Monte Carlo simulation. We provide importance-sampling algorithms for efficiently approximating very small tail probabilities in the distribution of the likelihood-ratio statistic. These importance-sampling methods allow the estimation of sin all false-positive rates and hence permit likelihood-based inference of parentage in large studies involving a great number of potential parents and many potential offspring. We investigate the performance of these importance-sampling algorithms in the context of parentage inference using single-nucleotide polymorphism (SNP) data and find that they may accelerate the computation of tail probabilities > 1 millionfold. We subsequently use the importance-sampling algorithms to calculate the power available with SNPs for largescale parentage studies, paying particular attention to the effect of genotyping errors and the occurrence of related individuals among the members of the putative mother-father-offspring trios. These simulations show that 60-100 SNPs may allow accurate pedigree reconstruction, even in situations involving thousands of potential mothers, fathers, and offspring. In addition, we compare the power of exclusion-based parentage inference to that of the likelihood-based method. Likelihood-based inference is much more powerful under many conditions; exclusion-based inference would require 40% more SNP loci to achieve the same accuracy as the likelihood-based approach in one common scenario. Our results demonstrate that SNPs are a powerful tool for parentage inference in large managed and/or natural populations.
引用
收藏
页码:2567 / 2582
页数:16
相关论文
共 48 条
  • [1] SPIP 1.0: a program for simulating pedigrees and genetic data in age-structured populations
    Anderson, EC
    Dunham, KK
    [J]. MOLECULAR ECOLOGY NOTES, 2005, 5 (02): : 459 - 461
  • [2] [Anonymous], 1979, Monte Carlo Methods, DOI DOI 10.1007/978-94-009-5819-7
  • [3] Association of protein kinase C alpha (PRKCA) gene with multiple sclerosis in a UK population
    Barton, A
    Woolmore, JA
    Ward, D
    Eyre, S
    Hinks, A
    Ollier, WER
    Strange, RC
    Fryer, AA
    John, S
    Hawkins, CP
    Worthington, J
    [J]. BRAIN, 2004, 127 : 1717 - 1722
  • [4] The utility of single nucleotide polymorphisms in inferences of population history
    Brumfield, RT
    Beerli, P
    Nickerson, DA
    Edwards, SV
    [J]. TRENDS IN ECOLOGY & EVOLUTION, 2003, 18 (05) : 249 - 256
  • [5] CHAKRABORTY R, 1988, GENETICS, V118, P327
  • [6] CHAKRABORTY R, 1983, HUM HERED, V33, P12
  • [7] Cotterman CW, 1940, THESIS OHIO STATE U
  • [8] Exclusion probabilities for single-locus paternity analysis when related males compete for matings
    Double, MC
    Cockburn, A
    Barry, SC
    Smouse, PE
    [J]. MOLECULAR ECOLOGY, 1997, 6 (12) : 1155 - 1166
  • [9] PAPA (package for the analysis of parental allocation): a computer program for simulated and real parental allocation
    Duchesne, P
    Godbout, MH
    Bernatchez, L
    [J]. MOLECULAR ECOLOGY NOTES, 2002, 2 (02): : 191 - 193
  • [10] Edwards A. W. F., 1967, B EUR SOC HUM GENET, V1, P42