A Generic Multivariate Framework for the Integration of Microbiome Longitudinal Studies With Other Data Types

被引:40
作者
Bodein, Antoine [1 ]
Chapleur, Olivier [2 ]
Droit, Arnaud [1 ]
Cao, Kim-Anh Le [3 ]
机构
[1] Univ Laval, CHU Quebec Res Ctr, Mol Med Dept, Quebec City, PQ, Canada
[2] Irstea, Hydrosyst & Biopresses Res Unit, Antony, France
[3] Univ Melbourne, Sch Math & Stat, Melbourne Integrat Genom, Melbourne, Vic, Australia
基金
英国医学研究理事会;
关键词
time course; data integration; splines; feature selection; dimension reduction; multi-omics; ANAEROBIC-DIGESTION; MASS-SPECTROMETRY; PRINCIPAL; PHENOL; METAGENOMICS; DEGRADATION; COMPONENTS; SELECTION;
D O I
10.3389/fgene.2019.00963
中图分类号
Q3 [遗传学];
学科分类号
071007 [遗传学];
摘要
Simultaneous profiling of biospecimens using different technological platforms enables the study of many data types, encompassing microbial communities, omics, and meta-omics as well as clinical or chemistry variables. Reduction in costs now enables longitudinal or time course studies on the same biological material or system. The overall aim of such studies is to investigate relationships between these longitudinal measures in a holistic manner to further decipher the link between molecular mechanisms and microbial community structures, or host-microbiota interactions. However, analytical frameworks enabling an integrated analysis between microbial communities and other types of biological, clinical, or phenotypic data are still in their infancy. The challenges include few time points that may be unevenly spaced and unmatched between different data types, a small number of unique individual biospecimens, and high individual variability. Those challenges are further exacerbated by the inherent characteristics of microbial communities-derived data (e.g., sparse, compositional). We propose a generic data-driven framework to integrate different types of longitudinal data measured on the same biological specimens with microbial community data and select key temporal features with strong associations within the same sample group. The framework ranges from filtering and modeling to integration using smoothing splines and multivariate dimension reduction methods to address some of the analytical challenges of microbiome-derived data. We illustrate our framework on different types of multi-omics case studies in bioreactor experiments as well as human studies.
引用
收藏
页数:18
相关论文
共 70 条
[1]
Temporal probabilistic modeling of bacterial compositions derived from 16S rRNA sequencing [J].
Aijo, Tarmo ;
Muller, Christian L. ;
Bonneau, Richard .
BIOINFORMATICS, 2018, 34 (03) :372-380
[2]
AITCHISON J, 1982, J ROY STAT SOC B, V44, P139
[3]
[Anonymous], 2005, PRINCIPAL COMPONENT, DOI [DOI 10.1002/0470013192.BSA501, 10.1002/0470013192.bsa501]
[4]
Compact graphical representation of phylogenetic data and metadata with GraPhlAn [J].
Asnicar, Francesco ;
Weingart, George ;
Tickle, Timothy L. ;
Huttenhower, Curtis ;
Segata, Nicola .
PEERJ, 2015, 3
[5]
Badri M., 2018, bioRxiv, P406264, DOI 10.1101/406264
[6]
'TIME': A Web Application for Obtaining Insights into Microbial Ecology Using Longitudinal Microbiome Data [J].
Baksi, Krishanu D. ;
Kuntal, Bhusan K. ;
Mande, Sharmila S. .
FRONTIERS IN MICROBIOLOGY, 2018, 9
[7]
Bing M., 2012, ANNU REV IMMUNOL, V66, P371
[8]
Lignin biosynthesis [J].
Boerjan, W ;
Ralph, J ;
Baucher, M .
ANNUAL REVIEW OF PLANT BIOLOGY, 2003, 54 :519-546
[9]
SIMPLE TEST FOR HETEROSCEDASTICITY AND RANDOM COEFFICIENT VARIATION [J].
BREUSCH, TS ;
PAGAN, AR .
ECONOMETRICA, 1979, 47 (05) :1287-1294
[10]
MDSINE: Microbial Dynamical Systems INference Engine for microbiome time-series analyses [J].
Bucci, Vanni ;
Tzen, Belinda ;
Li, Ning ;
Simmons, Matt ;
Tanoue, Takeshi ;
Bogart, Elijah ;
Deng, Luxue ;
Yeliseyev, Vladimir ;
Delaney, Mary L. ;
Liu, Qing ;
Olle, Bernat ;
Stein, Richard R. ;
Honda, Kenya ;
Bry, Lynn ;
Gerber, Georg K. .
GENOME BIOLOGY, 2016, 17