MixMC: A Multivariate Statistical Framework to Gain Insight into Microbial Communities

被引:114
作者
Cao, Kim-Anh Le [1 ]
Costello, Mary-Ellen [1 ,5 ]
Lakis, Vanessa Anne [1 ]
Bartolo, Francois [2 ]
Chua, Xin-Yi [3 ]
Brazeilles, Remi [4 ]
Rondeau, Pascale [4 ]
机构
[1] Univ Queensland, Diamantina Inst, Translat Res Inst, Brisbane, Qld, Australia
[2] Univ Toulouse, Inst Math Toulouse, UMR CNRS INSA 5219, Toulouse, France
[3] Inst Mol Biosci, Queensland Facil Adv Bioinformat, Brisbane, Qld, Australia
[4] Danone Nutr Res, Palaiseau, France
[5] Queensland Univ Technol, Translat Res Inst, Brisbane, Qld 4102, Australia
基金
英国医学研究理事会;
关键词
RARE BIOSPHERE; GUT; SELECTION; WRINKLES; OBESITY; LEAD; DIET;
D O I
10.1371/journal.pone.0160169
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
070301 [无机化学]; 070403 [天体物理学]; 070507 [自然资源与国土空间规划学]; 090105 [作物生产系统与生态工程];
摘要
Culture independent techniques, such as shotgun metagenomics and 16S rRNA amplicon sequencing have dramatically changed the way we can examine microbial communities. Recently, changes in microbial community structure and dynamics have been associated with a growing list of human diseases. The identification and comparison of bacteria driving those changes requires the development of sound statistical tools, especially if microbial biomarkers are to be used in a clinical setting. We present mixMC, a novel multivariate data analysis framework for metagenomic biomarker discovery. mixMC accounts for the compositional nature of 16S data and enables detection of subtle differences when high inter-subject variability is present due to microbial sampling performed repeatedly on the same subjects, but in multiple habitats. Through data dimension reduction the multivariate methods provide insightful graphical visualisations to characterise each type of environment in a detailed manner. We applied mixMC to 16S microbiome studies focusing on multiple body sites in healthy individuals, compared our results with existing statistical tools and illustrated added value of using multivariate methodologies to fully characterise and compare microbial communities.
引用
收藏
页数:21
相关论文
共 52 条
[1]
AITCHISON J, 1982, J ROY STAT SOC B, V44, P139
[2]
Anders S., 2010, GENOME BIOL, V11, pR106, DOI [10.1186/gb-2010-11-10-r106, DOI 10.1186/gb-2010-11-10-r106]
[3]
[Anonymous], 2016, mixOmics: Omics Data Integration Project. R package version 6.1.1
[4]
[Anonymous], 2011, ROBCOMPOSITIONS R PA
[5]
Enterotypes of the human gut microbiome [J].
Arumugam, Manimozhiyan ;
Raes, Jeroen ;
Pelletier, Eric ;
Le Paslier, Denis ;
Yamada, Takuji ;
Mende, Daniel R. ;
Fernandes, Gabriel R. ;
Tap, Julien ;
Bruls, Thomas ;
Batto, Jean-Michel ;
Bertalan, Marcelo ;
Borruel, Natalia ;
Casellas, Francesc ;
Fernandez, Leyden ;
Gautier, Laurent ;
Hansen, Torben ;
Hattori, Masahira ;
Hayashi, Tetsuya ;
Kleerebezem, Michiel ;
Kurokawa, Ken ;
Leclerc, Marion ;
Levenez, Florence ;
Manichanh, Chaysavanh ;
Nielsen, H. Bjorn ;
Nielsen, Trine ;
Pons, Nicolas ;
Poulain, Julie ;
Qin, Junjie ;
Sicheritz-Ponten, Thomas ;
Tims, Sebastian ;
Torrents, David ;
Ugarte, Edgardo ;
Zoetendal, Erwin G. ;
Wang, Jun ;
Guarner, Francisco ;
Pedersen, Oluf ;
de Vos, Willem M. ;
Brunak, Soren ;
Dore, Joel ;
Weissenbach, Jean ;
Ehrlich, S. Dusko ;
Bork, Peer .
NATURE, 2011, 473 (7346) :174-180
[6]
Compact graphical representation of phylogenetic data and metadata with GraPhlAn [J].
Asnicar, Francesco ;
Weingart, George ;
Tickle, Timothy L. ;
Huttenhower, Curtis ;
Segata, Nicola .
PEERJ, 2015, 3
[7]
Investigating microbial co-occurrence patterns based on metagenomic compositional data [J].
Ban, Yuguang ;
An, Lingling ;
Jiang, Hongmei .
BIOINFORMATICS, 2015, 31 (20) :3322-3329
[8]
CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[9]
Bokulich NA, 2013, NAT METHODS, V10, P57, DOI [10.1038/NMETH.2276, 10.1038/nmeth.2276]
[10]
AN ORDINATION OF THE UPLAND FOREST COMMUNITIES OF SOUTHERN WISCONSIN [J].
BRAY, JR ;
CURTIS, JT .
ECOLOGICAL MONOGRAPHS, 1957, 27 (04) :326-349