The GAAS Metagenomic Tool and Its Estimations of Viral and Microbial Average Genome Size in Four Major Biomes

被引:149
作者
Angly, Florent E. [1 ,2 ]
Willner, Dana [1 ]
Prieto-Davo, Alejandra [1 ]
Edwards, Robert A. [1 ,3 ,4 ]
Schmieder, Robert [2 ,3 ]
Vega-Thurber, Rebecca [6 ]
Antonopoulos, Dionysios A. [5 ]
Barott, Katie [1 ]
Cottrell, Matthew T. [7 ]
Desnues, Christelle [8 ]
Dinsdale, Elizabeth A. [1 ]
Furlan, Mike [1 ]
Haynes, Matthew [1 ,9 ]
Henn, Matthew R.
Hu, Yongfei [10 ]
Kirchman, David L. [7 ]
McDole, Tracey [1 ]
McPherson, John D. [11 ]
Meyer, Folker [4 ]
Miller, R. Michael [5 ]
Mundt, Egbert [12 ]
Naviaux, Robert K. [13 ]
Rodriguez-Mueller, Beltran [1 ,2 ]
Stevens, Rick [4 ]
Wegley, Linda [1 ]
Zhang, Lixin [10 ]
Zhu, Baoli [10 ]
Rohwer, Forest [1 ]
机构
[1] San Diego State Univ, Dept Biol, San Diego, CA 92182 USA
[2] San Diego State Univ, Computat Sci Res Ctr, San Diego, CA 92182 USA
[3] San Diego State Univ, Dept Comp Sci, San Diego, CA 92182 USA
[4] Argonne Natl Lab, Div Math & Comp Sci, Argonne, IL 60439 USA
[5] Argonne Natl Lab, Biosci Div, Argonne, IL 60439 USA
[6] Florida Int Univ, Dept Biol, Miami, FL 33199 USA
[7] Univ Delaware, Sch Marine Sci & Policy, Lewes, DE 19958 USA
[8] Univ Aix Marseille 2, URMITE, CNRS, UMR IRD 6236, Marseille, France
[9] Massachusetts Inst Technol & Harvard, Broad Inst, Cambridge, MA USA
[10] Chinese Acad Sci, Inst Microbiol, Key Lab Pathogen Microbiol & Immunol, Beijing, Peoples R China
[11] MaRS Ctr, Ontario Inst Canc Res, Toronto, ON, Canada
[12] Univ Georgia, Coll Vet Med, Poultry Diagnost & Res Ctr, Athens, GA USA
[13] Univ Calif San Diego, Sch Med, San Diego, CA 92103 USA
基金
国家高技术研究发展计划(863计划); 美国国家科学基金会;
关键词
SPECTRAL ABUNDANCE FACTORS; FIELD GEL-ELECTROPHORESIS; STATISTICAL SIGNIFICANCE; ONLINE TOOL; DIVERSITY; VIRUSES; VIRIOPLANKTON; INFORMATION; RESOURCE; PHAGES;
D O I
10.1371/journal.pcbi.1000593
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Metagenomic studies characterize both the composition and diversity of uncultured viral and microbial communities. BLAST-based comparisons have typically been used for such analyses; however, sampling biases, high percentages of unknown sequences, and the use of arbitrary thresholds to find significant similarities can decrease the accuracy and validity of estimates. Here, we present Genome relative Abundance and Average Size (GAAS), a complete software package that provides improved estimates of community composition and average genome length for metagenomes in both textual and graphical formats. GAAS implements a novel methodology to control for sampling bias via length normalization, to adjust for multiple BLAST similarities by similarity weighting, and to select significant similarities using relative alignment lengths. In benchmark tests, the GAAS method was robust to both high percentages of unknown sequences and to variations in metagenomic sequence read lengths. Re-analysis of the Sargasso Sea virome using GAAS indicated that standard methodologies for metagenomic analysis may dramatically underestimate the abundance and importance of organisms with small genomes in environmental systems. Using GAAS, we conducted a meta-analysis of microbial and viral average genome lengths in over 150 metagenomes from four biomes to determine whether genome lengths vary consistently between and within biomes, and between microbial and viral communities from the same environment. Significant differences between biomes and within aquatic sub-biomes (oceans, hypersaline systems, freshwater, and microbialites) suggested that average genome length is a fundamental property of environments driven by factors at the sub-biome level. The behavior of paired viral and microbial metagenomes from the same environment indicated that microbial and viral average genome sizes are independent of each other, but indicative of community responses to stressors and environmental conditions.
引用
收藏
页数:10
相关论文
共 44 条
  • [31] Diverse circovirus-like genome architectures revealed by environmental metagenomics
    Rosario, Karyna
    Duffy, Siobain
    Breitbart, Mya
    [J]. JOURNAL OF GENERAL VIROLOGY, 2009, 90 : 2418 - 2424
  • [32] COMPASS: A tool for comparison of multiple protein alignments with assessment of statistical significance
    Sadreyev, R
    Grishin, N
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2003, 326 (01) : 317 - 336
  • [33] Virioplankton community structure along a salinity gradient in a solar saltern
    Sandaa, RA
    Skjoldal, EF
    Bratbak, G
    [J]. EXTREMOPHILES, 2003, 7 (05) : 347 - 351
  • [34] Burden or benefit? Virus-host interactions in the marine environment
    Sandaa, Ruth-Anne
    [J]. RESEARCH IN MICROBIOLOGY, 2008, 159 (05) : 374 - 381
  • [35] CAMERA: A community resource for metagenomics
    Seshadri, Rekha
    Kravitz, Saul A.
    Smarr, Larry
    Gilna, Paul
    Frazier, Marvin
    [J]. PLOS BIOLOGY, 2007, 5 (03) : 394 - 397
  • [36] Genome size distributions indicate variability and similarities among marine viral assemblages from diverse environments
    Steward, GF
    Montiel, JL
    Azam, F
    [J]. LIMNOLOGY AND OCEANOGRAPHY, 2000, 45 (08) : 1697 - 1706
  • [37] Metagenomic analysis indicates that stressors induce production of herpes-like viruses in the coral Porites compressa
    Thurber, Rebecca L. Vega
    Barott, Katie L.
    Hall, Dana
    Liu, Hong
    Rodriguez-Mueller, Beltran
    Desnues, Christelle
    Edwards, Robert A.
    Haynes, Matthew
    Angly, Florent E.
    Wegley, Linda
    Rohwer, Forest L.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (47) : 18413 - 18418
  • [38] Metagenomic analysis of stressed coral holobionts
    Thurber, Rebecca Vega
    Willner-Hall, Dana
    Rodriguez-Mueller, Beltran
    Desnues, Christelle
    Edwards, Robert A.
    Angly, Florent
    Dinsdale, Elizabeth
    Kelly, Linda
    Rohwer, Forest
    [J]. ENVIRONMENTAL MICROBIOLOGY, 2009, 11 (08) : 2148 - 2163
  • [39] Are viruses driving microbial diversification and diversity?
    Weinbauer, MG
    Rassoulzadegan, F
    [J]. ENVIRONMENTAL MICROBIOLOGY, 2004, 6 (01) : 1 - 11
  • [40] Database resources of the National Center for Biotechnology Information
    Wheeler, DL
    Chappey, C
    Lash, AE
    Leipe, DD
    Madden, TL
    Schuler, GD
    Tatusova, TA
    Rapp, BA
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 10 - 14