Precise phylogenetic analysis of microbial isolates and genomes from metagenomes using PhyloPhlAn 3.0

被引:526
作者
Asnicar, Francesco [1 ]
Thomas, Andrew Maltez [1 ]
Beghini, Francesco [1 ]
Mengoni, Claudia [1 ]
Manara, Serena [1 ]
Manghi, Paolo [1 ]
Zhu, Qiyun [2 ]
Bolzan, Mattia [1 ,9 ]
Cumbo, Fabio [1 ]
May, Uyen [3 ]
Sanders, Jon G. [2 ,12 ]
Zolfo, Moreno [1 ]
Kopylova, Evguenia [2 ,11 ]
Pasolli, Edoardo [1 ,10 ]
Knight, Rob [2 ,4 ,5 ,6 ]
Mirarab, Siavash [3 ]
Huttenhower, Curtis [7 ,8 ]
Segata, Nicola [1 ]
机构
[1] Univ Trento, Dept CIBIO, Trento, Italy
[2] Univ Calif San Diego, Dept Pediat, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA
[4] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[5] Univ Calif San Diego, Ctr Microbiome Innovat, La Jolla, CA 92093 USA
[6] Univ Calif San Diego, Dept Bioengn, La Jolla, CA 92093 USA
[7] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA USA
[8] Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
[9] PreBiomics Srl, Trento, Italy
[10] Univ Naples Federico II, Dept Agr Sci, Portici, Italy
[11] Clar Genom BVBA, Sint Michielskaai 34, B-2000 Antwerp, Belgium
[12] Cornell Univ, Cornell Inst Host Microbe Interact & Dis, Ithaca, NY USA
基金
欧洲研究理事会; 美国国家卫生研究院;
关键词
MULTIPLE SEQUENCE ALIGNMENT; TREE; INFORMATION; ACCURACY; BACTERIA; BLOCKS; SEARCH; TOOL;
D O I
10.1038/s41467-020-16366-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
070301 [无机化学]; 070403 [天体物理学]; 070507 [自然资源与国土空间规划学]; 090105 [作物生产系统与生态工程];
摘要
Microbial genomes are available at an ever-increasing pace, as cultivation and sequencing become cheaper and obtaining metagenome-assembled genomes (MAGs) becomes more effective. Phylogenetic placement methods to contextualize hundreds of thousands of genomes must thus be efficiently scalable and sensitive from closely related strains to divergent phyla. We present PhyloPhlAn 3.0, an accurate, rapid, and easy-to-use method for large-scale microbial genome characterization and phylogenetic analysis at multiple levels of resolution. PhyloPhlAn 3.0 can assign genomes from isolate sequencing or MAGs to species-level genome bins built from >230,000 publically available sequences. For individual clades of interest, it reconstructs strain-level phylogenies from among the closest species using clade-specific maximally informative markers. At the other extreme of resolution, it scales to large phylogenies comprising >17,000 microbial species. Examples including Staphylococcus aureus isolates, gut metagenomes, and meta-analyses demonstrate the ability of PhyloPhlAn 3.0 to support genomic and metagenomic analyses.
引用
收藏
页数:10
相关论文
共 73 条
[1]
Database resources of the National Center for Biotechnology Information [J].
Acland, Abigail ;
Agarwala, Richa ;
Barrett, Tanya ;
Beck, Jeff ;
Benson, Dennis A. ;
Bollin, Colleen ;
Bolton, Evan ;
Bryant, Stephen H. ;
Canese, Kathi ;
Church, Deanna M. ;
Clark, Karen ;
DiCuccio, Michael ;
Dondoshansky, Ilya ;
Federhen, Scott ;
Feolo, Michael ;
Geer, Lewis Y. ;
Gorelenkov, Viatcheslav ;
Hoeppner, Marilu ;
Johnson, Mark ;
Kelly, Christopher ;
Khotomlianski, Viatcheslav ;
Kimchi, Avi ;
Kimelman, Michael ;
Kitts, Paul ;
Krasnov, Sergey ;
Kuznetsov, Anatoliy ;
Landsman, David ;
Lipman, David J. ;
Lu, Zhiyong ;
Madden, Thomas L. ;
Madej, Tom ;
Maglott, Donna R. ;
Marchler-Bauer, Aron ;
Karsch-Mizrachi, Ilene ;
Murphy, Terence ;
Ostell, James ;
O'Sullivan, Christopher ;
Panchenko, Anna ;
Phan, Lon ;
Pruitt, Don Preussm Kim D. ;
Rubinstein, Wendy ;
Sayers, Eric W. ;
Schneider, Valerie ;
Schuler, Gregory D. ;
Sequeira, Edwin ;
Sherry, Stephen T. ;
Shumway, Martin ;
Sirotkin, Karl ;
Siyan, Karanjit ;
Slotta, Douglas .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D7-D17
[2]
A genomic overview of the population structure of Salmonella [J].
Alikhan, Nabil-Fareed ;
Zhou, Zhemin ;
Sergeant, Martin J. ;
Achtman, Mark .
PLOS GENETICS, 2018, 14 (04)
[3]
BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[4]
Activities at the Universal Protein Resource (UniProt) [J].
Apweiler, Rolf ;
Bateman, Alex ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alam-Faruque, Yasmin ;
Alpi, Emanuele ;
Antunes, Ricardo ;
Arganiska, Joanna ;
Casanova, Elisabet Barrera ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Chan, Wei Mun ;
Chavali, Gayatri ;
Cibrian-Uhalte, Elena ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Fazzini, Francesco ;
Gane, Paul ;
Castro, Leyla Garcia ;
Garmiri, Penelope ;
Hatton-Ellis, Emma ;
Hieta, Reija ;
Huntley, Rachael ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
MacDougall, Alistair ;
Mutowo, Prudence ;
Nightingale, Andrew ;
Orchard, Sandra ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Turner, Edward ;
Volynkin, Vladimir ;
Wardell, Tony ;
Watkins, Xavier ;
Zellner, Hermann ;
Corbett, Matt .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D191-D198
[5]
Compact graphical representation of phylogenetic data and metadata with GraPhlAn [J].
Asnicar, Francesco ;
Weingart, George ;
Tickle, Timothy L. ;
Huttenhower, Curtis ;
Segata, Nicola .
PEERJ, 2015, 3
[6]
Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea [J].
Bowers, Robert M. ;
Kyrpides, Nikos C. ;
Stepanauskas, Ramunas ;
Harmon-Smith, Miranda ;
Doud, Devin ;
Reddy, T. B. K. ;
Schulz, Frederik ;
Jarett, Jessica ;
Rivers, Adam R. ;
Eloe-Fadrosh, Emiley A. ;
Tringe, Susannah G. ;
Ivanova, Natalia N. ;
Copeland, Alex ;
Clum, Alicia ;
Becraft, Eric D. ;
Malmstrom, Rex R. ;
Birren, Bruce ;
Podar, Mircea ;
Bork, Peer ;
Weinstock, George M. ;
Garrity, George M. ;
Dodsworth, Jeremy A. ;
Yooseph, Shibu ;
Sutton, Granger ;
Gloeckner, Frank O. ;
Gilbert, Jack A. ;
Nelson, William C. ;
Hallam, Steven J. ;
Jungbluth, Sean P. ;
Ettema, Thijs J. G. ;
Tighe, Scott ;
Konstantinidis, Konstantinos T. ;
Liu, Wen-Tso ;
Baker, Brett J. ;
Rattei, Thomas ;
Eisen, Jonathan A. ;
Hedlund, Brian ;
McMahon, Katherine D. ;
Fierer, Noah ;
Knight, Rob ;
Finn, Rob ;
Cochrane, Guy ;
Karsch-Mizrachi, Ilene ;
Tyson, Gene W. ;
Rinke, Christian ;
Lapidus, Alla ;
Meyer, Folker ;
Yilmaz, Pelin ;
Parks, Donovan H. ;
Eren, A. M. .
NATURE BIOTECHNOLOGY, 2017, 35 (08) :725-731
[7]
A gene-by-gene population genomics platform: de novo assembly, annotation and genealogical analysis of 108 representative Neisseria meningitidis genomes [J].
Bratcher, Holly B. ;
Corton, Craig ;
Jolley, Keith A. ;
Parkhill, Julian ;
Maiden, Martin C. J. .
BMC GENOMICS, 2014, 15
[8]
Unusual biology across a group comprising more than 15% of domain Bacteria [J].
Brown, Christopher T. ;
Hug, Laura A. ;
Thomas, Brian C. ;
Sharon, Itai ;
Castelle, Cindy J. ;
Singh, Andrea ;
Wilkins, Michael J. ;
Wrighton, Kelly C. ;
Williams, Kenneth H. ;
Banfield, Jillian F. .
NATURE, 2015, 523 (7559) :208-U173
[9]
Fast and sensitive protein alignment using DIAMOND [J].
Buchfink, Benjamin ;
Xie, Chao ;
Huson, Daniel H. .
NATURE METHODS, 2015, 12 (01) :59-60
[10]
Minimizing proteome redundancy in the UniProt Knowledgebase [J].
Bursteinas, Borisas ;
Britto, Ramona ;
Bely, Benoit ;
Auchincloss, Andrea ;
Rivoire, Catherine ;
Redaschi, Nicole ;
O'Donovan, Claire ;
Martin, Maria Jesus .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016, :1-9