Parallelized phylogenetic post-analysis on multi-core architectures

被引:11
作者
Aberer, Andre J. [1 ]
Pattengale, Nicholas D. [2 ]
Stamatakis, Alexandros [1 ]
机构
[1] Tech Univ Munich, Dept Comp Sci, Exelixis Lab, D-85748 Garching, Germany
[2] Univ New Mexico, Dept Comp Sci, Albuquerque, NM 87131 USA
关键词
RAxML; Majority rule tree; Multi-core architecture; Phylogenetics; ALGORITHMS; CONFIDENCE;
D O I
10.1016/j.jocs.2010.03.006
中图分类号
TP39 [计算机的应用];
学科分类号
080201 [机械制造及其自动化];
摘要
Bioinformatics is experiencing a rapid and overwhelming accumulation of molecular sequence data, predominantly driven by novel wet-lab sequencing techniques. This trend poses scalability challenges for tool developers. In the field of phylogenetic inference (reconstruction of evolutionary trees from molecular sequence data), scalability is becoming an increasingly important issue for operations other than the tree reconstruction itself. In this paper we focus on post-analysis tasks in reconstructing very large trees, specifically the step of building (extended) majority rule consensus trees from a collection of equally plausible trees or a collection of bootstrap replicate trees. To this end we present non-parallel optimizations which establish our implementation as the fastest exact implementation in phylogenetics, and our novel parallelized routines are the first of their kind. Our non-parallel optimizations achieve a performance improvement of factor 50 compared to the previous version of our code and we achieve a maximum speedup of 5.5 on a 8-core Nehalem node for building consensus trees comprising up to 55,000 organisms. We also present a parallel approach for drawing bootstrap support values on a candidate tree, and experimentally assess our approach in order to better understand read-only versus read-write parallel hash table accesses on multi-core systems. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:107 / 114
页数:8
相关论文
共 27 条
[21]
CONSEL: for assessing the confidence of phylogenetic tree selection [J].
Shimodaira, H ;
Hasegawa, M .
BIOINFORMATICS, 2001, 17 (12) :1246-1247
[22]
Rates of molecular evolution are linked to life history in flowering plants [J].
Smith, Stephen A. ;
Donoghue, Michael J. .
SCIENCE, 2008, 322 (5898) :86-89
[23]
Stamatakis A, 2008, LECT N BIOINFORMAT, V5265, P424, DOI 10.1007/978-3-540-88436-1_36
[24]
RAxML-VI-HPC: Maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models [J].
Stamatakis, Alexandros .
BIOINFORMATICS, 2006, 22 (21) :2688-2690
[25]
Many-core algorithms for statistical phylogenetics [J].
Suchard, Marc A. ;
Rambaut, Andrew .
BIOINFORMATICS, 2009, 25 (11) :1370-1376
[26]
Sul SJ, 2009, LECT N BIOINFORMAT, V5542, P100
[27]
Swofford D., 1993, PAUP: Phylogenetic Analysis Using Parsimony