Introducing DOTUR, a computer program for defining operational taxonomic units and estimating species richness

被引:2063
作者
Schloss, PD [1 ]
Handelsman, J [1 ]
机构
[1] Univ Wisconsin, Dept Plant Pathol, Madison, WI 53706 USA
关键词
D O I
10.1128/AEM.71.3.1501-1506.2005
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Although copious qualitative information describes the members of the diverse microbial communities on Earth, statistical approaches for quantifying and comparing the numbers and compositions of lineages in communities are lacking. We present a method that addresses the challenge of assigning sequences to operational taxonomic units (OTUs) based on the genetic distances between sequences. We developed a computer program, DOTUR, which assigns sequences to OTUs by using either the furthest, average, or nearest neighbor algorithm for each distance level. DOTUR uses the frequency at which each OTU is observed to construct rarefaction and collector's curves for various measures of richness and diversity. We analyzed 16S rRNA gene libraries derived from Scottish and Amazonian soils and the Sargasso Sea with DOTUR, which assigned sequences to OTUs rapidly and reliably based on the genetic distances between sequences and identified previous inconsistencies and errors in assigning sequences to OTUs. An analysis of the two 16S rRNA gene libraries from soil demonstrated that they do not contain enough sequences to support a claim that they contain different numbers of bacterial lineages with statistical confidence (P > 0.05), nor do they contain enough sequences to provide a robust estimate of species richness when an OTU is defined as containing sequences that are no more than 3% different from each other. In contrast, the richness of OTUs at the 3% level in the Sargasso Sea collection began to plateau after the sampling of 690 sequences. We anticipate that an equivalent extent of sampling for soil would require sampling more than 10,000 sequences, almost 100 times the size of typical sequence collections obtained from soil.
引用
收藏
页码:1501 / 1506
页数:6
相关论文
共 30 条
[21]   A computer-simulated restriction fragment length polymorphism analysis of bacterial small-subunit rRNA genes: Efficacy of selected tetrameric restriction enzymes for studies of microbial diversity in nature? [J].
Moyer, CL ;
Tiedje, JM ;
Dobbs, FC ;
Karl, DM .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 1996, 62 (07) :2501-2507
[22]   THE RECONSTRUCTED EVOLUTIONARY PROCESS [J].
NEE, S ;
MAY, RM ;
HARVEY, PH .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES B-BIOLOGICAL SCIENCES, 1994, 344 (1309) :305-311
[23]   Cultivation of globally distributed soil bacteria from phylogenetic lineages previously only detected in cultivation-independent surveys [J].
Sait, M ;
Hugenholtz, P ;
Janssen, PH .
ENVIRONMENTAL MICROBIOLOGY, 2002, 4 (11) :654-666
[24]   Integration of microbial ecology and statistics: a test to compare gene libraries [J].
Schloss, PD ;
Larget, BR ;
Handelsman, J .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2004, 70 (09) :5485-5492
[25]   FastGroup: A program to dereplicate libraries of 16S rDNA sequences [J].
Seguritan, Victor ;
Rohwer, Forest .
BMC BIOINFORMATICS, 2001, 2 (1)
[26]   Quantitative comparisons of 16S rRNA gene sequence libraries from environmental samples [J].
Singleton, DR ;
Furlong, MA ;
Rathbun, SL ;
Whitman, WB .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2001, 67 (09) :4374-4376
[27]   NONPARAMETRIC-ESTIMATION OF SPECIES RICHNESS [J].
SMITH, EP ;
VANBELLE, G .
BIOMETRICS, 1984, 40 (01) :119-129
[28]   A PLACE FOR DNA-DNA REASSOCIATION AND 16S RIBOSOMAL-RNA SEQUENCE-ANALYSIS IN THE PRESENT SPECIES DEFINITION IN BACTERIOLOGY [J].
STACKEBRANDT, E ;
GOEBEL, BM .
INTERNATIONAL JOURNAL OF SYSTEMATIC BACTERIOLOGY, 1994, 44 (04) :846-849
[29]   HIGH DIVERSITY IN DNA OF SOIL BACTERIA [J].
TORSVIK, V ;
GOKSOYR, J ;
DAAE, FL .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 1990, 56 (03) :782-787
[30]   Environmental genome shotgun sequencing of the Sargasso Sea [J].
Venter, JC ;
Remington, K ;
Heidelberg, JF ;
Halpern, AL ;
Rusch, D ;
Eisen, JA ;
Wu, DY ;
Paulsen, I ;
Nelson, KE ;
Nelson, W ;
Fouts, DE ;
Levy, S ;
Knap, AH ;
Lomas, MW ;
Nealson, K ;
White, O ;
Peterson, J ;
Hoffman, J ;
Parsons, R ;
Baden-Tillson, H ;
Pfannkoch, C ;
Rogers, YH ;
Smith, HO .
SCIENCE, 2004, 304 (5667) :66-74