Many-core algorithms for statistical phylogenetics

被引:313
作者
Suchard, Marc A. [1 ,2 ,3 ]
Rambaut, Andrew [4 ]
机构
[1] Univ Calif Los Angeles, Dept Biomath, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Biostat, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
[4] Univ Edinburgh, Inst Evolutionary Biol, Edinburgh EH9 3JT, Midlothian, Scotland
基金
美国国家卫生研究院;
关键词
CODON-SUBSTITUTION MODELS; MAXIMUM-LIKELIHOOD; NUCLEOTIDE SUBSTITUTION; DNA-SEQUENCES; RECONSTRUCTION; INFERENCE; TREES; GENES; RATES;
D O I
10.1093/bioinformatics/btp244
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Statistical phylogenetics is computationally intensive, resulting in considerable attention meted on techniques for parallelization. Codon-based models allow for independent rates of synonymous and replacement substitutions and have the potential to more adequately model the process of protein-coding sequence evolution with a resulting increase in phylogenetic accuracy. Unfortunately, due to the high number of codon states, computational burden has largely thwarted phylogenetic reconstruction under codon models, particularly at the genomic-scale. Here, we describe novel algorithms and methods for evaluating phylogenies under arbitrary molecular evolutionary models on graphics processing units (GPUs), making use of the large number of processing cores to efficiently parallelize calculations even for large state-size models. Results: We implement the approach in an existing Bayesian framework and apply the algorithms to estimating the phylogeny of 62 complete mitochondrial genomes of carnivores under a 60-state codon model. We see a near 90-fold speed increase over an optimized CPU-based computation and a > 140-fold increase over the currently available implementation, making this the first practical use of codon models for phylogenetic inference over whole mitochondrial or microorganism genomes.
引用
收藏
页码:1370 / 1376
页数:7
相关论文
共 32 条
[11]   Whence the red panda? [J].
Flynn, JJ ;
Nedbal, MA ;
Dragoo, JW ;
Honeycutt, RL .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2000, 17 (02) :190-199
[12]  
GOLDMAN N, 1994, MOL BIOL EVOL, V11, P725
[13]   DATING OF THE HUMAN APE SPLITTING BY A MOLECULAR CLOCK OF MITOCHONDRIAL-DNA [J].
HASEGAWA, M ;
KISHINO, H ;
YANO, TA .
JOURNAL OF MOLECULAR EVOLUTION, 1985, 22 (02) :160-174
[14]   DPRml: distributed phylogeny reconstruction by maximum likelihood [J].
Keane, TM ;
Naughton, TJ ;
Travers, SAA ;
McInerney, JO ;
McCormack, GP .
BIOINFORMATICS, 2005, 21 (07) :969-974
[15]   A NEW METHOD FOR CALCULATING EVOLUTIONARY SUBSTITUTION RATES [J].
LANAVE, C ;
PREPARATA, G ;
SACCONE, C ;
SERIO, G .
JOURNAL OF MOLECULAR EVOLUTION, 1984, 20 (01) :86-93
[16]  
LANGE K, 1997, MATH STAT METHODS GE
[17]  
LEE HJ, 1997, ICS 1997, P44
[18]   CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment [J].
Manavski, Svetlin A. ;
Valle, Giorgio .
BMC BIOINFORMATICS, 2008, 9 (Suppl 2)
[19]   pIQPNNI: parallel reconstruction of large maximum likelihood phylogenies [J].
Minh, BQ ;
Vinh, LS ;
von Haeseler, A ;
Schmidt, HA .
BIOINFORMATICS, 2005, 21 (19) :3794-3796