Automated Reconstruction of Whole-Genome Phylogenies from Short-Sequence Reads

被引:359
作者
Bertels, Frederic [1 ,2 ,3 ,4 ]
Silander, Olin K. [1 ,2 ]
Pachkov, Mikhail [1 ,2 ]
Rainey, Paul B. [3 ,4 ,5 ]
van Nimwegen, Erik [1 ,2 ]
机构
[1] Univ Basel, Biozentrum, Basel, Switzerland
[2] Swiss Inst Bioinformat, Basel, Switzerland
[3] Massey Univ Albany, New Zealand Inst Adv Study, Auckland, New Zealand
[4] Massey Univ Albany, Allan Wilson Ctr Mol Ecol & Evolut, Auckland, New Zealand
[5] Max Planck Inst Evolutionary Biol, Plon, Germany
基金
瑞士国家科学基金会;
关键词
next-generation sequencing; Escherichia coli; Pseudomonas syringae; RESISTANT STAPHYLOCOCCUS-AUREUS; ESCHERICHIA-COLI; LIKELIHOOD APPROACH; EVOLUTION; TRANSMISSION; ADAPTATION; MECHANISM; ALIGNMENT; SPREAD; TREES;
D O I
10.1093/molbev/msu088
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
070307 [化学生物学]; 071010 [生物化学与分子生物学];
摘要
Studies of microbial evolutionary dynamics are being transformed by the availability of affordable high-throughput sequencing technologies, which allow whole-genome sequencing of hundreds of related taxa in a single study. Reconstructing a phylogenetic tree of these taxa is generally a crucial step in any evolutionary analysis. Instead of constructing genome assemblies for all taxa, annotating these assemblies, and aligning orthologous genes, many recent studies 1) directly map raw sequencing reads to a single reference sequence, 2) extract single nucleotide polymorphisms (SNPs), and 3) infer the phylogenetic tree using maximum likelihood methods from the aligned SNP positions. However, here we show that, when using such methods to reconstruct phylogenies from sets of simulated sequences, both the exclusion of nonpolymorphic positions and the alignment to a single reference genome, introduce systematic biases and errors in phylogeny reconstruction. To address these problems, we developed a new method that combines alignments from mappings to multiple reference sequences and show that this successfully removes biases from the reconstructed phylogenies. We implemented this method as a web server named REALPHY (Reference sequence Alignment-based Phylogeny builder), which fully automates phylogenetic reconstruction from raw sequencing reads.
引用
收藏
页码:1077 / 1088
页数:12
相关论文
共 36 条
[1]
Dynamic Evolution of Pathogenicity Revealed by Sequencing and Comparative Genomics of 19 Pseudomonas syringae Isolates [J].
Baltrus, David A. ;
Nishimura, Marc T. ;
Romanchuk, Artur ;
Chang, Jeff H. ;
Mukhtar, M. Shahid ;
Cherkis, Karen ;
Roach, Jeff ;
Grant, Sarah R. ;
Jones, Corbin D. ;
Dangl, Jeffery L. .
PLOS PATHOGENS, 2011, 7 (07)
[2]
Multiple sequence alignment with the Clustal series of programs [J].
Chenna, R ;
Sugawara, H ;
Koike, T ;
Lopez, R ;
Gibson, TJ ;
Higgins, DG ;
Thompson, JD .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3497-3500
[3]
Comparative genomics reveals mechanism for short-term and long-term clonal transitions in pandemic Vibrio cholerae [J].
Chun, Jongsik ;
Grim, Christopher J. ;
Hasan, Nur A. ;
Lee, Je Hee ;
Choi, Seon Young ;
Haley, Bradd J. ;
Taviani, Elisa ;
Jeon, Yoon-Seong ;
Kim, Dong Wook ;
Lee, Jae-Hak ;
Brettin, Thomas S. ;
Bruce, David C. ;
Challacombe, Jean F. ;
Detter, J. Chris ;
Han, Cliff S. ;
Munk, A. Christine ;
Chertkov, Olga ;
Meincke, Linda ;
Saunders, Elizabeth ;
Walters, Ronald A. ;
Huq, Anwar ;
Nair, G. Balakrish ;
Colwell, Rita R. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (36) :15442-15447
[4]
Rapid Pneumococcal Evolution in Response to Clinical Interventions [J].
Croucher, Nicholas J. ;
Harris, Simon R. ;
Fraser, Christophe ;
Quail, Michael A. ;
Burton, John ;
van der Linden, Mark ;
McGee, Lesley ;
von Gottberg, Anne ;
Song, Jae Hoon ;
Ko, Kwan Soo ;
Pichon, Bruno ;
Baker, Stephen ;
Parry, Christopher M. ;
Lambertsen, Lotte M. ;
Shahinas, Dea ;
Pillai, Dylan R. ;
Mitchell, Timothy J. ;
Dougan, Gordon ;
Tomasz, Alexander ;
Klugman, Keith P. ;
Parkhill, Julian ;
Hanage, William P. ;
Bentley, Stephen D. .
SCIENCE, 2011, 331 (6016) :430-434
[5]
Historical variations in mutation rate in an epidemic pathogen, Yersinia pestis [J].
Cui, Yujun ;
Yu, Chang ;
Yan, Yanfeng ;
Li, Dongfang ;
Li, Yanjun ;
Jombart, Thibaut ;
Weinert, Lucy A. ;
Wang, Zuyun ;
Guo, Zhaobiao ;
Xu, Lizhi ;
Zhang, Yujiang ;
Zheng, Hancheng ;
Qin, Nan ;
Xiao, Xiao ;
Wu, Mingshou ;
Wang, Xiaoyi ;
Zhou, Dongsheng ;
Qi, Zhizhen ;
Du, Zongmin ;
Wu, Honglong ;
Yang, Xianwei ;
Cao, Hongzhi ;
Wang, Hu ;
Wang, Jing ;
Yao, Shusen ;
Rakin, Alexander ;
Li, Yingrui ;
Falush, Daniel ;
Balloux, Francois ;
Achtman, Mark ;
Song, Yajun ;
Wang, Jun ;
Yang, Ruifu .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (02) :577-582
[6]
Impact of recombination on bacterial evolution [J].
Didelot, Xavier ;
Maiden, Martin C. J. .
TRENDS IN MICROBIOLOGY, 2010, 18 (07) :315-322
[7]
Population Genomics of the Facultatively Mutualistic Bacteria Sinorhizobium meliloti and S. medicae [J].
Epstein, Brendan ;
Branca, Antoine ;
Mudge, Joann ;
Bharti, Arvind K. ;
Briskine, Roman ;
Farmer, Andrew D. ;
Sugawara, Masayuki ;
Young, Nevin D. ;
Sadowsky, Michael J. ;
Tiffin, Peter .
PLOS GENETICS, 2012, 8 (08)
[8]
EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[9]
Felsenstein J., 2004, Phylip (phylogeny inference package) version 3.6
[10]
Insights on evolution of virulence and resistance from the complete genome analysis of an early methicillin-resistant Staphylococcus aureus strain and a biofilm-producing methicillin-resistant Staphylococcus epidermidis strain [J].
Gill, SR ;
Fouts, DE ;
Archer, GL ;
Mongodin, EF ;
DeBoy, RT ;
Ravel, J ;
Paulsen, IT ;
Kolonay, JF ;
Brinkac, L ;
Beanan, M ;
Dodson, RJ ;
Daugherty, SC ;
Madupu, R ;
Angiuoli, SV ;
Durkin, AS ;
Haft, DH ;
Vamathevan, J ;
Khouri, H ;
Utterback, T ;
Lee, C ;
Dimitrov, G ;
Jiang, LX ;
Qin, HY ;
Weidman, J ;
Tran, K ;
Kang, K ;
Hance, IR ;
Nelson, KE ;
Fraser, CM .
JOURNAL OF BACTERIOLOGY, 2005, 187 (07) :2426-2438