Exact coalescent for the Wright-Fisher model

被引:42
作者
Fu, Yun-Xin [1 ]
机构
[1] Univ Texas, Ctr Human Genet, Sch Publ Hlth, Houston, TX 77030 USA
关键词
exact coalescent; Kingman coalescent; Wright-Fisher model;
D O I
10.1016/j.tpb.2005.11.005
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
The Kingman coalescent, which has become the foundation for a wide range of theoretical as well as empirical studies, was derived as an approximation of the Wright-Fisher (WF) model. The approximation heavily relies on the assumption that population size is large and sample size is much smaller than the population size. Whether the sample size is too large compared to the population size is rarely questioned in practice when applying statistical methods based on the Kingman coalescent. Since WF model is the most widely used population genetics model for reproduction, it is desirable to develop a coalescent framework for the WF model, which can be used whenever there are concerns about the accuracy of the Kingman coalescent as an approximation. This paper described the exact coalescent theory for the WF model and develops a simulation algorithm, which is then used, together with an analytical approach, to study the properties of the exact coalescent as well as its differences to the Kingman coalescent. We show that the Kingman coalescent differs from the exact coalescent by: (1) shorter waiting time between successive coalescent events; (2) different probability of observing a topological relationship among sequences in a sample; and (3) slightly smaller tree length in the genealogy of a large sample. On the other hand, there is little difference in the age of the most recent common ancestor (MRCA) of the sample. The exact coalescent makes up the longer waiting time between successive coalescent events by having multiple coalescence at the same time. The most significant difference among various summary statistics of a coalescent examined is the sum of lengths of external branches, which can be more than 10% larger for exact coalescent than that for the Kingman coalescent. As a whole, the Kingman coalescent is a remarkably accurate approximation to the exact coalescent for sample and population sizes failing considerably outside the region that was originally anticipated. (c) 2005 Elsevier Inc. All rights reserved.
引用
收藏
页码:385 / 394
页数:10
相关论文
共 17 条
[1]  
Abramowitz M., 1970, HDB MATH FUNCTIONS
[2]  
Ewens W.J., 2004, MATH POPULATION GENE, DOI DOI 10.1007/978-0-387-21822-9
[3]   STATISTICAL PROPERTIES OF SEGREGATING SITES [J].
FU, YX .
THEORETICAL POPULATION BIOLOGY, 1995, 48 (02) :172-197
[4]  
FU YX, 1993, GENETICS, V133, P693
[5]  
FU YX, 1994, GENETICS, V138, P1375
[6]   SIMULATING PROBABILITY-DISTRIBUTIONS IN THE COALESCENT [J].
GRIFFITHS, RC ;
TAVARE, S .
THEORETICAL POPULATION BIOLOGY, 1994, 46 (02) :131-159
[7]  
HUDSON RR, 1991, OXF SURV EVOL BIOL, V7, P1
[9]  
Kingman JFC., 1982, Journal of Applied Probability, V19, P27, DOI [10.2307/3213548, DOI 10.1017/S0021900200034446, DOI 10.2307/3213548]
[10]  
KINGMAN JFC, 1980, MATH GENETIC DIVERSI