Precise phylogenetic analysis of microbial isolates and genomes from metagenomes using PhyloPhlAn 3.0

被引:526
作者
Asnicar, Francesco [1 ]
Thomas, Andrew Maltez [1 ]
Beghini, Francesco [1 ]
Mengoni, Claudia [1 ]
Manara, Serena [1 ]
Manghi, Paolo [1 ]
Zhu, Qiyun [2 ]
Bolzan, Mattia [1 ,9 ]
Cumbo, Fabio [1 ]
May, Uyen [3 ]
Sanders, Jon G. [2 ,12 ]
Zolfo, Moreno [1 ]
Kopylova, Evguenia [2 ,11 ]
Pasolli, Edoardo [1 ,10 ]
Knight, Rob [2 ,4 ,5 ,6 ]
Mirarab, Siavash [3 ]
Huttenhower, Curtis [7 ,8 ]
Segata, Nicola [1 ]
机构
[1] Univ Trento, Dept CIBIO, Trento, Italy
[2] Univ Calif San Diego, Dept Pediat, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA
[4] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[5] Univ Calif San Diego, Ctr Microbiome Innovat, La Jolla, CA 92093 USA
[6] Univ Calif San Diego, Dept Bioengn, La Jolla, CA 92093 USA
[7] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA USA
[8] Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
[9] PreBiomics Srl, Trento, Italy
[10] Univ Naples Federico II, Dept Agr Sci, Portici, Italy
[11] Clar Genom BVBA, Sint Michielskaai 34, B-2000 Antwerp, Belgium
[12] Cornell Univ, Cornell Inst Host Microbe Interact & Dis, Ithaca, NY USA
基金
欧洲研究理事会; 美国国家卫生研究院;
关键词
MULTIPLE SEQUENCE ALIGNMENT; TREE; INFORMATION; ACCURACY; BACTERIA; BLOCKS; SEARCH; TOOL;
D O I
10.1038/s41467-020-16366-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
070301 [无机化学]; 070403 [天体物理学]; 070507 [自然资源与国土空间规划学]; 090105 [作物生产系统与生态工程];
摘要
Microbial genomes are available at an ever-increasing pace, as cultivation and sequencing become cheaper and obtaining metagenome-assembled genomes (MAGs) becomes more effective. Phylogenetic placement methods to contextualize hundreds of thousands of genomes must thus be efficiently scalable and sensitive from closely related strains to divergent phyla. We present PhyloPhlAn 3.0, an accurate, rapid, and easy-to-use method for large-scale microbial genome characterization and phylogenetic analysis at multiple levels of resolution. PhyloPhlAn 3.0 can assign genomes from isolate sequencing or MAGs to species-level genome bins built from >230,000 publically available sequences. For individual clades of interest, it reconstructs strain-level phylogenies from among the closest species using clade-specific maximally informative markers. At the other extreme of resolution, it scales to large phylogenies comprising >17,000 microbial species. Examples including Staphylococcus aureus isolates, gut metagenomes, and meta-analyses demonstrate the ability of PhyloPhlAn 3.0 to support genomic and metagenomic analyses.
引用
收藏
页数:10
相关论文
共 73 条
[11]
trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses [J].
Capella-Gutierrez, Salvador ;
Silla-Martinez, Jose M. ;
Gabaldon, Toni .
BIOINFORMATICS, 2009, 25 (15) :1972-1973
[12]
Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis [J].
Castresana, J .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (04) :540-552
[13]
TCS: A New Multiple Sequence Alignment Reliability Measure to Estimate Alignment Accuracy and Improve Phylogenetic Tree Reconstruction [J].
Chang, Jia-Ming ;
Di Tommaso, Paolo ;
Notredame, Cedric .
MOLECULAR BIOLOGY AND EVOLUTION, 2014, 31 (06) :1625-1637
[14]
Dense genomic sampling identifies highways of pneumococcal recombination [J].
Chewapreecha, Claire ;
Harris, Simon R. ;
Croucher, Nicholas J. ;
Turner, Claudia ;
Marttinen, Pekka ;
Cheng, Lu ;
Pessia, Alberto ;
Aanensen, David M. ;
Mather, Alison E. ;
Page, Andrew J. ;
Salter, Susannah J. ;
Harris, David ;
Nosten, Francois ;
Goldblatt, David ;
Corander, Jukka ;
Parkhill, Julian ;
Turner, Paul ;
Bentley, Stephen D. .
NATURE GENETICS, 2014, 46 (03) :305-+
[15]
Toward automatic reconstruction of a highly resolved tree of life [J].
Ciccarelli, FD ;
Doerks, T ;
von Mering, C ;
Creevey, CJ ;
Snel, B ;
Bork, P .
SCIENCE, 2006, 311 (5765) :1283-1287
[16]
PhyloSift: phylogenetic analysis of genomes and metagenomes [J].
Darling, Aaron E. ;
Jospin, Guillaume ;
Lowe, Eric ;
Matsen, Frederick A., IV ;
Bik, Holly M. ;
Eisen, Jonathan A. .
PEERJ, 2014, 2
[17]
Gut Microbial Succession Follows Acute Secretory Diarrhea in Humans [J].
David, Lawrence A. ;
Weil, Ana ;
Ryan, Edward T. ;
Calderwood, Stephen B. ;
Harris, Jason B. ;
Chowdhury, Fahima ;
Begum, Yasmin ;
Qadri, Firdausi ;
LaRocque, Regina C. ;
Turnbaugh, Peter J. .
MBIO, 2015, 6 (03) :1-14
[18]
Noisy:: Identification of problematic columns in multiple sequence alignments [J].
Dress, Andreas W. M. ;
Flamm, Christoph ;
Fritzsch, Guido ;
Gruenewald, Stefan ;
Kruspe, Matthias ;
Prohaska, Sonja J. ;
Stadler, Peter F. .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2008, 3 (1)
[19]
MUSCLE: multiple sequence alignment with high accuracy and high throughput [J].
Edgar, RC .
NUCLEIC ACIDS RESEARCH, 2004, 32 (05) :1792-1797
[20]
Search and clustering orders of magnitude faster than BLAST [J].
Edgar, Robert C. .
BIOINFORMATICS, 2010, 26 (19) :2460-2461