METAGENassist: a comprehensive web server for comparative metagenomics

被引:302
作者
Arndt, David [1 ]
Xia, Jianguo [2 ]
Liu, Yifeng [1 ]
Zhou, You [2 ]
Guo, An Chi [1 ]
Cruz, Joseph A. [1 ]
Sinelnikov, Igor [1 ]
Budwill, Karen [3 ]
Nesbo, Camilla L. [2 ,4 ]
Wishart, David S. [1 ,2 ,5 ]
机构
[1] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada
[2] Univ Alberta, Dept Biol Sci, Edmonton, AB, Canada
[3] Alberta Innovates Technol Futures, Environm & Carbon Management Div, Edmonton, AB, Canada
[4] Univ Oslo, Dept Biol, CEES, Oslo, Norway
[5] Natl Inst Nanotechnol NINT, Natl Res Council, Edmonton, AB T6G 2E8, Canada
基金
加拿大健康研究院;
关键词
RESOURCE; BACMAP; ATLAS;
D O I
10.1093/nar/gks497
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
With recent improvements in DNA sequencing and sample extraction techniques, the quantity and quality of metagenomic data are now growing exponentially. This abundance of richly annotated metagenomic data and bacterial census information has spawned a new branch of microbiology called comparative metagenomics. Comparative metagenomics involves the comparison of bacterial populations between different environmental samples, different culture conditions or different microbial hosts. However, in order to do comparative metagenomics, one typically requires a sophisticated knowledge of multivariate statistics and/or advanced software programming skills. To make comparative metagenomics more accessible to microbiologists, we have developed a freely accessible, easy-to-use web server for comparative metagenomic analysis called METAGENassist. Users can upload their bacterial census data from a wide variety of common formats, using either amplified 16S rRNA data or shotgun metagenomic data. Metadata concerning environmental, culture, or host conditions can also be uploaded. During the data upload process, METAGENassist also performs an automated taxonomic-to-phenotypic mapping. Phenotypic information covering nearly 20 functional categories such as GC content, genome size, oxygen requirements, energy sources and preferred temperature range is automatically generated from the taxonomic input data. Using this phenotypically enriched data, users can then perform a variety of multivariate and univariate data analyses including fold change analysis, t-tests, PCA, PLS-DA, clustering and classification. To facilitate data processing, users are guided through a step-by-step analysis workflow using a variety of menus, information hyperlinks and check boxes. METAGENassist also generates colorful, publication quality tables and graphs that can be downloaded and used directly in the preparation of scientific papers. METAGENassist is available at http://www.metagenassist.ca.
引用
收藏
页码:W88 / W95
页数:8
相关论文
共 22 条
[1]   Enterotypes of the human gut microbiome [J].
Arumugam, Manimozhiyan ;
Raes, Jeroen ;
Pelletier, Eric ;
Le Paslier, Denis ;
Yamada, Takuji ;
Mende, Daniel R. ;
Fernandes, Gabriel R. ;
Tap, Julien ;
Bruls, Thomas ;
Batto, Jean-Michel ;
Bertalan, Marcelo ;
Borruel, Natalia ;
Casellas, Francesc ;
Fernandez, Leyden ;
Gautier, Laurent ;
Hansen, Torben ;
Hattori, Masahira ;
Hayashi, Tetsuya ;
Kleerebezem, Michiel ;
Kurokawa, Ken ;
Leclerc, Marion ;
Levenez, Florence ;
Manichanh, Chaysavanh ;
Nielsen, H. Bjorn ;
Nielsen, Trine ;
Pons, Nicolas ;
Poulain, Julie ;
Qin, Junjie ;
Sicheritz-Ponten, Thomas ;
Tims, Sebastian ;
Torrents, David ;
Ugarte, Edgardo ;
Zoetendal, Erwin G. ;
Wang, Jun ;
Guarner, Francisco ;
Pedersen, Oluf ;
de Vos, Willem M. ;
Brunak, Soren ;
Dore, Joel ;
Weissenbach, Jean ;
Ehrlich, S. Dusko ;
Bork, Peer .
NATURE, 2011, 473 (7346) :174-180
[2]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[3]   QIIME allows analysis of high-throughput community sequencing data [J].
Caporaso, J. Gregory ;
Kuczynski, Justin ;
Stombaugh, Jesse ;
Bittinger, Kyle ;
Bushman, Frederic D. ;
Costello, Elizabeth K. ;
Fierer, Noah ;
Pena, Antonio Gonzalez ;
Goodrich, Julia K. ;
Gordon, Jeffrey I. ;
Huttley, Gavin A. ;
Kelley, Scott T. ;
Knights, Dan ;
Koenig, Jeremy E. ;
Ley, Ruth E. ;
Lozupone, Catherine A. ;
McDonald, Daniel ;
Muegge, Brian D. ;
Pirrung, Meg ;
Reeder, Jens ;
Sevinsky, Joel R. ;
Tumbaugh, Peter J. ;
Walters, William A. ;
Widmann, Jeremy ;
Yatsunenko, Tanya ;
Zaneveld, Jesse ;
Knight, Rob .
NATURE METHODS, 2010, 7 (05) :335-336
[4]   BacMap: an up-to-date electronic atlas of annotated bacterial genomes [J].
Cruz, Joseph ;
Liu, Yifeng ;
Liang, Yongjie ;
Zhou, You ;
Wilson, Michael ;
Dennis, Jonathan J. ;
Stothard, Paul ;
Van Domselaar, Gary ;
Wishart, David S. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D599-D604
[5]   GEPAS:: a web-based resource for microarray gene expression data analysis [J].
Herrero, J ;
Al-Shahrour, F ;
Díaz-Uriarte, R ;
Mateos, A ;
Vaquerizas, JM ;
Santoyo, J ;
Dopazo, J .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3461-3467
[6]   Integrative analysis of environmental sequences using MEGAN4 [J].
Huson, Daniel H. ;
Mitra, Suparna ;
Ruscheweyh, Hans-Joachim ;
Weber, Nico ;
Schuster, Stephan C. .
GENOME RESEARCH, 2011, 21 (09) :1552-1560
[7]   Supervised classification of human microbiota [J].
Knights, Dan ;
Costello, Elizabeth K. ;
Knight, Rob .
FEMS MICROBIOLOGY REVIEWS, 2011, 35 (02) :343-359
[8]   APPLICATIONS OF NEXT-GENERATION SEQUENCING Sequencing technologies - the next generation [J].
Metzker, Michael L. .
NATURE REVIEWS GENETICS, 2010, 11 (01) :31-46
[9]   The metagenomics RAST server - a public resource for the automatic phylogenetic and functional analysis of metagenomes [J].
Meyer, F. ;
Paarmann, D. ;
D'Souza, M. ;
Olson, R. ;
Glass, E. M. ;
Kubal, M. ;
Paczian, T. ;
Rodriguez, A. ;
Stevens, R. ;
Wilke, A. ;
Wilkening, J. ;
Edwards, R. A. .
BMC BIOINFORMATICS, 2008, 9 (1)
[10]   Visual and statistical comparison of metagenomes [J].
Mitra, Suparna ;
Klar, Bernhard ;
Huson, Daniel H. .
BIOINFORMATICS, 2009, 25 (15) :1849-1855