Parallelized phylogenetic post-analysis on multi-core architectures

被引:11
作者
Aberer, Andre J. [1 ]
Pattengale, Nicholas D. [2 ]
Stamatakis, Alexandros [1 ]
机构
[1] Tech Univ Munich, Dept Comp Sci, Exelixis Lab, D-85748 Garching, Germany
[2] Univ New Mexico, Dept Comp Sci, Albuquerque, NM 87131 USA
关键词
RAxML; Majority rule tree; Multi-core architecture; Phylogenetics; ALGORITHMS; CONFIDENCE;
D O I
10.1016/j.jocs.2010.03.006
中图分类号
TP39 [计算机的应用];
学科分类号
080201 [机械制造及其自动化];
摘要
Bioinformatics is experiencing a rapid and overwhelming accumulation of molecular sequence data, predominantly driven by novel wet-lab sequencing techniques. This trend poses scalability challenges for tool developers. In the field of phylogenetic inference (reconstruction of evolutionary trees from molecular sequence data), scalability is becoming an increasingly important issue for operations other than the tree reconstruction itself. In this paper we focus on post-analysis tasks in reconstructing very large trees, specifically the step of building (extended) majority rule consensus trees from a collection of equally plausible trees or a collection of bootstrap replicate trees. To this end we present non-parallel optimizations which establish our implementation as the fastest exact implementation in phylogenetics, and our novel parallelized routines are the first of their kind. Our non-parallel optimizations achieve a performance improvement of factor 50 compared to the previous version of our code and we achieve a maximum speedup of 5.5 on a 8-core Nehalem node for building consensus trees comprising up to 55,000 organisms. We also present a parallel approach for drawing bootstrap support values on a candidate tree, and experimentally assess our approach in order to better understand read-only versus read-write parallel hash table accesses on multi-core systems. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:107 / 114
页数:8
相关论文
共 27 条
[1]
[Anonymous], 2002, PAUP*. Phylogenetic Analysis Using Parsimony (*and other methods). Version 4
[2]
[Anonymous], 2006, GENETIC ALGORITHM AP
[3]
[Anonymous], 2005, PHYLIP (phylogeny inference package) version 3.6
[4]
Bryant D., 2003, Bioconsensus. DIMACS Working Group Meetings on Bioconsensus, P163
[5]
Buneman P., 1971, Mathematics in the Archeological and Historical Sciences: Proceedings of the Anglo-Romanian Conference, P387
[6]
Maximum likelihood of evolutionary trees: hardness and approximation [J].
Chor, B ;
Tuller, T .
BIOINFORMATICS, 2005, 21 :I97-I106
[7]
EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[8]
FELSENSTEIN J, 1985, EVOLUTION, V39, P783, DOI 10.1111/j.1558-5646.1985.tb00420.x
[9]
CONSTRUCTION OF PHYLOGENETIC TREES [J].
FITCH, WM ;
MARGOLIASH, E .
SCIENCE, 1967, 155 (3760) :279-+
[10]
Likelihood-based tests of topologies in phylogenetics [J].
Goldman, N ;
Anderson, JP ;
Rodrigo, AG .
SYSTEMATIC BIOLOGY, 2000, 49 (04) :652-670