Stochastic models and descriptive statistics for phylogenetic trees, from Yule to today

被引:190
作者
Aldous, DJ [1 ]
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
关键词
descriptive statistics; phylogenetic tree; stochastic model; tree balance; Yule process;
D O I
10.1214/ss/998929474
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In 1924 Yule observed that distributions of number of species per genus were typically long-tailed, and proposed a stochastic model to fit these data. Modern taxonomists often prefer to represent relationships between species via phylogenetic trees; the counterpart to Yule's observation is that actual reconstructed trees look surprisingly unbalanced. The imbalance can readily be seen via a scatter diagram of the sizes of clades involved in the splits of published large phylogenetic trees. Attempting stochastic modeling leads to two puzzles. First, two somewhat opposite possible biological descriptions of what dominates the macroevolutionary process (adaptive radiation; "neutral" evolution) lead to exactly the same mathematical model (Marhov or Yule or coalescent). Second, neither this nor any other simple stochastic model predicts the observed pattern of imbalance. This essay represents a probabilist's musings on these puzzles, complementing the more detailed survey of biological literature.
引用
收藏
页码:23 / 34
页数:12
相关论文
共 41 条
  • [1] DARWINS LOG - A TOY MODEL OF SPECIATION AND EXTINCTION
    ALDOUS, D
    [J]. JOURNAL OF APPLIED PROBABILITY, 1995, 32 (02) : 279 - 295
  • [2] Aldous D J., 1995, Random Discrete Structures, P1
  • [3] [Anonymous], 1980, PHYLOGENETIC PATTERN, DOI DOI 10.1093/sysbio/syt033
  • [4] Asmussen S., 1983, Branching processes
  • [5] Athreya K.B., 1972, BRANCHING PROCESS
  • [6] Bininda-Emonds O.R.P., 1996, Bonner Zoologische Monographien, V41, P1
  • [7] THE 1991 CENSUS ADJUSTMENT - UNDERCOUNT OR BAD DATA
    BREIMAN, L
    [J]. STATISTICAL SCIENCE, 1994, 9 (04) : 458 - 475
  • [8] PHYLOGENETICS OF SEED PLANTS - AN ANALYSIS OF NUCLEOTIDE-SEQUENCES FROM THE PLASTID GENE RBCL
    CHASE, MW
    SOLTIS, DE
    OLMSTEAD, RG
    MORGAN, D
    LES, DH
    MISHLER, BD
    DUVALL, MR
    PRICE, RA
    HILLS, HG
    QIU, YL
    KRON, KA
    RETTIG, JH
    CONTI, E
    PALMER, JD
    MANHART, JR
    SYTSMA, KJ
    MICHAELS, HJ
    KRESS, WJ
    KAROL, KG
    CLARK, WD
    HEDREN, M
    GAUT, BS
    JANSEN, RK
    KIM, KJ
    WIMPEE, CF
    SMITH, JF
    FURNIER, GR
    STRAUSS, SH
    XIANG, QY
    PLUNKETT, GM
    SOLTIS, PS
    SWENSEN, SM
    WILLIAMS, SE
    GADEK, PA
    QUINN, CJ
    EGUIARTE, LE
    GOLENBERG, E
    LEARN, GH
    GRAHAM, SW
    BARRETT, SCH
    DAYANANDAN, S
    ALBERT, VA
    [J]. ANNALS OF THE MISSOURI BOTANICAL GARDEN, 1993, 80 (03) : 528 - 580
  • [9] Interpreting sister-group tests of key innovation hypotheses
    de Queiroz, A
    [J]. SYSTEMATIC BIOLOGY, 1998, 47 (04) : 710 - 718
  • [10] Ewens W. J., 1979, MATH POPULATION GENE