Identifying biologically relevant differences between metagenomic communities

被引:756
作者
Parks, Donovan H. [1 ]
Beiko, Robert G. [1 ]
机构
[1] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 1W5, Canada
关键词
STATISTICAL SIGNIFICANCE; CONFIDENCE-INTERVALS; GUT MICROBIOME; ODDS RATIO; PROPORTIONS; RESOURCE; GENES; TOOLS;
D O I
10.1093/bioinformatics/btq041
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Metagenomics is the study of genetic material recovered directly from environmental samples. Taxonomic and functional differences between metagenomic samples can highlight the influence of ecological factors on patterns of microbial life in a wide range of habitats. Statistical hypothesis tests can help us distinguish ecological influences from sampling artifacts, but knowledge of only the P-value from a statistical hypothesis test is insufficient to make inferences about biological relevance. Current reporting practices for pairwise comparative metagenomics are inadequate, and better tools are needed for comparative metagenomic analysis. Results: We have developed a new software package, STAMP, for comparative metagenomics that supports best practices in analysis and reporting. Examination of a pair of iron mine metagenomes demonstrates that deeper biological insights can be gained using statistical techniques available in our software. An analysis of the functional potential of 'Candidatus Accumulibacter phosphatis' in two enhanced biological phosphorus removal metagenomes identified several subsystems that differ between the A. phosphatis stains in these related communities, including phosphate metabolism, secretion and metal transport.
引用
收藏
页码:715 / 721
页数:7
相关论文
共 61 条
[1]  
Abdi H., 2007, Encyclopedia of Measurement and Statistics, P651, DOI DOI 10.4135/9781412952644.N299
[2]   Inappropriate interpretation of the odds ratio: Oddly not that uncommon [J].
Agrawal, D .
PEDIATRICS, 2005, 116 (06) :1612-1613
[3]   On logit confidence intervals for the odds ratio with small samples [J].
Agresti, A .
BIOMETRICS, 1999, 55 (02) :597-602
[4]  
Agresti A., 1992, STAT SCI, V7, P131, DOI [10.1214/ss/1177011454, DOI 10.1214/SS/1177011454]
[5]  
Agresti A., 1990, CATEGORICAL DATA ANA
[6]   The genome sequence of the psychrophilic archaeon, Methanococcoides burtonii: the role of genome evolution in cold adaptation [J].
Allen, Michelle A. ;
Lauro, Federico M. ;
Williams, Timothy J. ;
Burg, Dominic ;
Siddiqui, Khawar S. ;
De Francisci, Davide ;
Chong, Kevin W. Y. ;
Pilak, Oliver ;
Chew, Hwee H. ;
De Maere, Matthew Z. ;
Ting, Lily ;
Katrib, Marilyn ;
Ng, Charmaine ;
Sowers, Kevin R. ;
Galperin, Michael Y. ;
Anderson, Iain J. ;
Ivanova, Natalia ;
Dalin, Eileen ;
Martinez, Michele ;
Lapidus, Alla ;
Hauser, Loren ;
Land, Miriam ;
Thomas, Torsten ;
Cavicchioli, Ricardo .
ISME JOURNAL, 2009, 3 (09) :1012-1035
[7]  
BARNARD GA, 1947, BIOMETRIKA, V34, P123, DOI 10.1093/biomet/34.1-2.123
[8]   ON ALLEGED GAINS IN POWER FROM LOWER P-VALUES [J].
BARNARD, GA .
STATISTICS IN MEDICINE, 1989, 8 (12) :1469-1477
[9]   Bacterial rhodopsin:: Evidence for a new type of phototrophy in the sea [J].
Béjà, O ;
Aravind, L ;
Koonin, EV ;
Suzuki, MT ;
Hadd, A ;
Nguyen, LP ;
Jovanovich, S ;
Gates, CM ;
Feldman, RA ;
Spudich, JL ;
Spudich, EN ;
DeLong, EF .
SCIENCE, 2000, 289 (5486) :1902-1906
[10]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300