Estimation of evolutionary parameters with phylogenetic trees

被引:7
作者
Wang, Q
Salter, LA
Pearl, DK
机构
[1] Univ New Mexico, Dept Math & Stat, Albuquerque, NM 87131 USA
[2] Ohio State Univ, Dept Stat, Columbus, OH 43210 USA
关键词
maximum likelihood; phylogenetics; evolutionary model parameters; bootstrapping; Monte Carlo sampling; Fisher information;
D O I
10.1007/s00239-002-2364-7
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
An important issue in the phylogenetic analysis of nucleotide sequence data using the maximum likelihood (ML) method is the underlying evolutionary model employed. We consider the problem of simultaneously estimating the tree topology and the parameters in the underlying substitution model and of obtaining estimates of the standard errors of these parameter estimates. Given a fixed tree topology and corresponding set of branch lengths, the ML estimates of standard evolutionary model parameters are asymptotically efficient, in the sense that their joint distribution is asymptotically normal with the variance-covariance matrix given by the inverse of the Fisher information matrix. We propose a new estimate of this conditional variance based on estimation of the expected information using a Monte Carlo sampling (MCS) method. Simulations are used to compare this conditional variance estimate to the standard technique of using the observed information under a variety of experimental conditions. In the case in which one wishes to estimate simultaneously the tree and parameters, we provide a bootstrapping approach that can be used in conjunction with the MCS method to estimate the unconditional standard error. The methods developed are applied to a real data set consisting of 30 papillomavirus sequences. This overall method is easily incorporated into standard bootstrapping procedures to allow for proper variance estimation.
引用
收藏
页码:684 / 695
页数:12
相关论文
共 45 条
  • [1] [Anonymous], 2000, PHYLOGENETIC ANAL MA
  • [2] BIRCH MW, 1964, ANN MATH STAT, V35, P818
  • [3] Bishop M.M., 1975, DISCRETE MULTIVARIAT
  • [4] BURK R, 1999, HUMAN PAPILLOMAVIRUS
  • [5] PHYLOGENETIC ANALYSIS OF 48 PAPILLOMAVIRUS TYPES AND 28 SUBTYPES AND VARIANTS - A SHOWCASE FOR THE MOLECULAR EVOLUTION OF DNA VIRUSES
    CHAN, SY
    BERNARD, HU
    ONG, CK
    CHAN, SP
    HOFMANN, B
    DELIUS, H
    [J]. JOURNAL OF VIROLOGY, 1992, 66 (10) : 5714 - 5725
  • [6] ANALYSIS OF GENOMIC SEQUENCES OF 95 PAPILLOMAVIRUS TYPES - UNITING TYPING, PHYLOGENY, AND TAXONOMY
    CHAN, SY
    DELIUS, H
    HALPERN, AL
    BERNARD, HU
    [J]. JOURNAL OF VIROLOGY, 1995, 69 (05) : 3074 - 3083
  • [7] Full reconstruction of Markov models on evolutionary trees: Identifiability and consistency
    Chang, JT
    [J]. MATHEMATICAL BIOSCIENCES, 1996, 137 (01) : 51 - 73
  • [8] Davidson A. C., 1997, BOOTSTRAP METHODS TH
  • [9] Bootstrap confidence levels for phylogenetic trees (vol 93, pg 7085, 1996)
    Efron, B
    Halloran, E
    Holmes, S
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (23) : 13429 - 13434
  • [10] EFRON B, 1978, BIOMETRIKA, V65, P457, DOI 10.1093/biomet/65.3.457