ESTIMATING EFFECTIVE POPULATION-SIZE FROM SAMPLES OF SEQUENCES - A BOOTSTRAP MONTE-CARLO INTEGRATION METHOD

被引:117
作者
FELSENSTEIN, J
机构
[1] Department of Genetics SK-50, University of Washington, Seattle
关键词
D O I
10.1017/S0016672300030962
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We would like to use maximum likelihood to estimate parameters such as the effective population size N(e), or, if we do not know mutation rates, the product 4N(e)mu of mutation rate per site and effective population size. To compute the likelihood for a sample of unrecombined nucleotide sequences taken from a random-mating population it is necessary to sum over all genealogies that could have led to the sequences, computing for each one the probability that it would have yielded the sequences, and weighting each one by its prior probability. The genealogies vary in tree topology and in branch lengths. Although the likelihood and the prior are straightforward to compute, the summation over all genealogies seems at first sight hopelessly difficult. This paper reports that it is possible to carry out a Monte Carlo integration to evaluate the likelihoods approximately. The method uses bootstrap sampling of sites to create data sets for each of which a maximum likelihood tree is estimated. The resulting trees are assumed to be sampled from a distribution whose height is proportional to the likelihood surface for the full data. That it will be so is dependent on a theorem which is not proven, but seems likely to be true if the sequences are not short. One can use the resulting estimated likelihood curve to make a maximum likelihood estimate of the parameter of interest, N(e) or of 4N(e)mu. The method requires at least 100 times the computational effort required for estimation of a phylogeny by maximum likelihood, but is practical on today's work stations. The method does not at present have any way of dealing with recombination.
引用
收藏
页码:209 / 220
页数:12
相关论文
共 24 条
[1]  
[Anonymous], 1979, MONTE CARLO METHODS
[2]   GENE TREES AND ORGANISMAL HISTORIES - A PHYLOGENETIC APPROACH TO POPULATION BIOLOGY [J].
AVISE, JC .
EVOLUTION, 1989, 43 (06) :1192-1208
[3]   MITOCHONDRIAL-DNA AND HUMAN-EVOLUTION [J].
CANN, RL ;
STONEKING, M ;
WILSON, AC .
NATURE, 1987, 325 (6099) :31-36
[4]  
EDWARDS AWF, 1970, J ROY STAT SOC B, V32, P155
[5]   1977 RIETZ LECTURE - BOOTSTRAP METHODS - ANOTHER LOOK AT THE JACKKNIFE [J].
EFRON, B .
ANNALS OF STATISTICS, 1979, 7 (01) :1-26
[6]  
Efron B., 1982, JACKKNIFE BOOTSTRAP, V38, DOI 10.1137/1.9781611970319
[7]   PHYLOGENIES FROM MOLECULAR SEQUENCES - INFERENCE AND RELIABILITY [J].
FELSENSTEIN, J .
ANNUAL REVIEW OF GENETICS, 1988, 22 :521-565
[8]  
FELSENSTEIN J, 1981, EVOLUTION, V35, P1229, DOI 10.1111/j.1558-5646.1981.tb04991.x
[9]   ESTIMATING EFFECTIVE POPULATION-SIZE FROM SAMPLES OF SEQUENCES - INEFFICIENCY OF PAIRWISE AND SEGREGATING SITES AS COMPARED TO PHYLOGENETIC ESTIMATES [J].
FELSENSTEIN, J .
GENETICS RESEARCH, 1992, 59 (02) :139-147
[10]   CONFIDENCE-LIMITS ON PHYLOGENIES WITH A MOLECULAR CLOCK [J].
FELSENSTEIN, J .
SYSTEMATIC ZOOLOGY, 1985, 34 (02) :152-161