共 23 条
Clustering 16S rRNA for OTU prediction: a method of unsupervised Bayesian clustering
被引:190
作者:

Hao, Xiaolin
论文数: 0 引用数: 0
h-index: 0
机构:
Univ So Calif, Dept Biol, Mol & Computat Biol Program, Los Angeles, CA 90089 USA Univ So Calif, Dept Biol, Mol & Computat Biol Program, Los Angeles, CA 90089 USA

Jiang, Rui
论文数: 0 引用数: 0
h-index: 0
机构:
Tsinghua Univ, Dept Automat, MOE Key Lab Bioinformat, TNLIST, Beijing 100084, Peoples R China
Tsinghua Univ, Dept Automat, TNLIST, Bioinformat Div, Beijing 100084, Peoples R China Univ So Calif, Dept Biol, Mol & Computat Biol Program, Los Angeles, CA 90089 USA

Chen, Ting
论文数: 0 引用数: 0
h-index: 0
机构:
Univ So Calif, Dept Biol, Mol & Computat Biol Program, Los Angeles, CA 90089 USA Univ So Calif, Dept Biol, Mol & Computat Biol Program, Los Angeles, CA 90089 USA
机构:
[1] Univ So Calif, Dept Biol, Mol & Computat Biol Program, Los Angeles, CA 90089 USA
[2] Tsinghua Univ, Dept Automat, MOE Key Lab Bioinformat, TNLIST, Beijing 100084, Peoples R China
[3] Tsinghua Univ, Dept Automat, TNLIST, Bioinformat Div, Beijing 100084, Peoples R China
基金:
美国国家卫生研究院;
美国国家科学基金会;
关键词:
MULTIPLE SEQUENCE ALIGNMENT;
MICROBIAL DIVERSITY;
UNKNOWN NUMBER;
RARE BIOSPHERE;
COMPONENTS;
SEARCH;
D O I:
10.1093/bioinformatics/btq725
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
Motivation: With the advancements of next-generation sequencing technology, it is now possible to study samples directly obtained from the environment. Particularly, 16S rRNA gene sequences have been frequently used to profile the diversity of organisms in a sample. However, such studies are still taxed to determine both the number of operational taxonomic units (OTUs) and their relative abundance in a sample. Results: To address these challenges, we propose an unsupervised Bayesian clustering method termed Clustering 16S rRNA for OTU Prediction (CROP). CROP can find clusters based on the natural organization of data without setting a hard cut-off threshold (3%/5%) as required by hierarchical clustering methods. By applying our method to several datasets, we demonstrate that CROP is robust against sequencing errors and that it produces more accurate results than conventional hierarchical clustering methods.
引用
收藏
页码:611 / 618
页数:8
相关论文
共 23 条
[1]
Efficient functional clustering of protein sequences using the Dirichlet process
[J].
Brown, Duncan P.
.
BIOINFORMATICS,
2008, 24 (16)
:1765-1771

Brown, Duncan P.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Calif Berkeley, Dept Bioengn, San Francisco, CA 94158 USA
Merck & Co Inc, San Francisco, CA 94158 USA Univ Calif Berkeley, Dept Bioengn, San Francisco, CA 94158 USA
[2]
The Ribosomal Database Project: improved alignments and new tools for rRNA analysis
[J].
Cole, J. R.
;
Wang, Q.
;
Cardenas, E.
;
Fish, J.
;
Chai, B.
;
Farris, R. J.
;
Kulam-Syed-Mohideen, A. S.
;
McGarrell, D. M.
;
Marsh, T.
;
Garrity, G. M.
;
Tiedje, J. M.
.
NUCLEIC ACIDS RESEARCH,
2009, 37
:D141-D145

Cole, J. R.
论文数: 0 引用数: 0
h-index: 0
机构:
Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA

Wang, Q.
论文数: 0 引用数: 0
h-index: 0
机构:
Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA

Cardenas, E.
论文数: 0 引用数: 0
h-index: 0
机构:
Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA

Fish, J.
论文数: 0 引用数: 0
h-index: 0
机构:
Michigan State Univ, Dept Microbiol & Mol Genet, E Lansing, MI 48824 USA Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA

Chai, B.
论文数: 0 引用数: 0
h-index: 0
机构:
Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA

Farris, R. J.
论文数: 0 引用数: 0
h-index: 0
机构:
Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA

Kulam-Syed-Mohideen, A. S.
论文数: 0 引用数: 0
h-index: 0
机构:
Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA

McGarrell, D. M.
论文数: 0 引用数: 0
h-index: 0
机构:
Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA

论文数: 引用数:
h-index:
机构:

Garrity, G. M.
论文数: 0 引用数: 0
h-index: 0
机构:
Michigan State Univ, Dept Microbiol & Mol Genet, E Lansing, MI 48824 USA Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA

Tiedje, J. M.
论文数: 0 引用数: 0
h-index: 0
机构:
Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA
Michigan State Univ, Dept Microbiol & Mol Genet, E Lansing, MI 48824 USA Michigan State Univ, Ctr Microbial Ecol, E Lansing, MI 48824 USA
[3]
Bacterial Community Variation in Human Body Habitats Across Space and Time
[J].
Costello, Elizabeth K.
;
Lauber, Christian L.
;
Hamady, Micah
;
Fierer, Noah
;
Gordon, Jeffrey I.
;
Knight, Rob
.
SCIENCE,
2009, 326 (5960)
:1694-1697

Costello, Elizabeth K.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA

Lauber, Christian L.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Colorado, Cooperat Inst Res Environm Sci, Boulder, CO 80309 USA Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA

Hamady, Micah
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USA Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA

Fierer, Noah
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Colorado, Cooperat Inst Res Environm Sci, Boulder, CO 80309 USA
Univ Colorado, Dept Ecol & Evolutionary Biol, Boulder, CO 80309 USA Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA

Gordon, Jeffrey I.
论文数: 0 引用数: 0
h-index: 0
机构:
Washington Univ, Sch Med, Ctr Genome Sci, St Louis, MO 63108 USA Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA

Knight, Rob
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
Howard Hughes Med Inst, Chevy Chase, MD USA Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
[4]
NAST: a multiple sequence alignment server for comparative analysis of 16S rRNA genes
[J].
DeSantis, T. Z.
;
Hugenholtz, P.
;
Keller, K.
;
Brodie, E. L.
;
Larsen, N.
;
Piceno, Y. M.
;
Phan, R.
;
Andersen, G. L.
.
NUCLEIC ACIDS RESEARCH,
2006, 34
:W394-W399

DeSantis, T. Z.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Calif Berkeley, Lawrence Berkeley Lab, Ctr Environm Biotechnol, Berkeley, CA 94720 USA Univ Calif Berkeley, Lawrence Berkeley Lab, Ctr Environm Biotechnol, Berkeley, CA 94720 USA

Hugenholtz, P.
论文数: 0 引用数: 0
h-index: 0
机构: Univ Calif Berkeley, Lawrence Berkeley Lab, Ctr Environm Biotechnol, Berkeley, CA 94720 USA

Keller, K.
论文数: 0 引用数: 0
h-index: 0
机构: Univ Calif Berkeley, Lawrence Berkeley Lab, Ctr Environm Biotechnol, Berkeley, CA 94720 USA

Brodie, E. L.
论文数: 0 引用数: 0
h-index: 0
机构: Univ Calif Berkeley, Lawrence Berkeley Lab, Ctr Environm Biotechnol, Berkeley, CA 94720 USA

Larsen, N.
论文数: 0 引用数: 0
h-index: 0
机构: Univ Calif Berkeley, Lawrence Berkeley Lab, Ctr Environm Biotechnol, Berkeley, CA 94720 USA

Piceno, Y. M.
论文数: 0 引用数: 0
h-index: 0
机构: Univ Calif Berkeley, Lawrence Berkeley Lab, Ctr Environm Biotechnol, Berkeley, CA 94720 USA

Phan, R.
论文数: 0 引用数: 0
h-index: 0
机构: Univ Calif Berkeley, Lawrence Berkeley Lab, Ctr Environm Biotechnol, Berkeley, CA 94720 USA

Andersen, G. L.
论文数: 0 引用数: 0
h-index: 0
机构: Univ Calif Berkeley, Lawrence Berkeley Lab, Ctr Environm Biotechnol, Berkeley, CA 94720 USA
[5]
Environmental shotgun sequencing: Its potential and challenges for studying the hidden world of microbes
[J].
Eisen, Jonathan A.
.
PLOS BIOLOGY,
2007, 5 (03)
:384-388

Eisen, Jonathan A.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Calif Davis, Genome Ctr, Sect Evolut & Ecol, Dept Med Microbiol & Immunol, Davis, CA 95616 USA Univ Calif Davis, Genome Ctr, Sect Evolut & Ecol, Dept Med Microbiol & Immunol, Davis, CA 95616 USA
[6]
An efficient algorithm for large-scale detection of protein families
[J].
Enright, AJ
;
Van Dongen, S
;
Ouzounis, CA
.
NUCLEIC ACIDS RESEARCH,
2002, 30 (07)
:1575-1584

Enright, AJ
论文数: 0 引用数: 0
h-index: 0
机构:
European Bioinformat Inst, EMBL Cambridge Outstn, Computat Gen Grp, Cambridge CB10 1SD, England European Bioinformat Inst, EMBL Cambridge Outstn, Computat Gen Grp, Cambridge CB10 1SD, England

Van Dongen, S
论文数: 0 引用数: 0
h-index: 0
机构: European Bioinformat Inst, EMBL Cambridge Outstn, Computat Gen Grp, Cambridge CB10 1SD, England

Ouzounis, CA
论文数: 0 引用数: 0
h-index: 0
机构: European Bioinformat Inst, EMBL Cambridge Outstn, Computat Gen Grp, Cambridge CB10 1SD, England
[7]
Topographical and Temporal Diversity of the Human Skin Microbiome
[J].
Grice, Elizabeth A.
;
Kong, Heidi H.
;
Conlan, Sean
;
Deming, Clayton B.
;
Davis, Joie
;
Young, Alice C.
;
Bouffard, Gerard G.
;
Blakesley, Robert W.
;
Murray, Patrick R.
;
Green, Eric D.
;
Turner, Maria L.
;
Segre, Julia A.
.
SCIENCE,
2009, 324 (5931)
:1190-1192

Grice, Elizabeth A.
论文数: 0 引用数: 0
h-index: 0
机构:
NHGRI, Genet & Mol Biol Branch, Bethesda, MD 20892 USA NHGRI, Genet & Mol Biol Branch, Bethesda, MD 20892 USA

Kong, Heidi H.
论文数: 0 引用数: 0
h-index: 0
机构:
NCI, Dermatol Branch, Ctr Canc Res, Bethesda, MD 20892 USA NHGRI, Genet & Mol Biol Branch, Bethesda, MD 20892 USA

Conlan, Sean
论文数: 0 引用数: 0
h-index: 0
机构:
NHGRI, Genet & Mol Biol Branch, Bethesda, MD 20892 USA NHGRI, Genet & Mol Biol Branch, Bethesda, MD 20892 USA

Deming, Clayton B.
论文数: 0 引用数: 0
h-index: 0
机构:
NHGRI, Genet & Mol Biol Branch, Bethesda, MD 20892 USA NHGRI, Genet & Mol Biol Branch, Bethesda, MD 20892 USA

Davis, Joie
论文数: 0 引用数: 0
h-index: 0
机构:
NHGRI, Off Translat Res, Bethesda, MD 20892 USA NHGRI, Genet & Mol Biol Branch, Bethesda, MD 20892 USA

Young, Alice C.
论文数: 0 引用数: 0
h-index: 0
机构:
NHGRI, NIH, Intramural Sequencing Ctr, Bethesda, MD 20892 USA NHGRI, Genet & Mol Biol Branch, Bethesda, MD 20892 USA

Bouffard, Gerard G.
论文数: 0 引用数: 0
h-index: 0
机构:
NHGRI, NIH, Intramural Sequencing Ctr, Bethesda, MD 20892 USA
NHGRI, Genome Technol Branch, Bethesda, MD 20892 USA NHGRI, Genet & Mol Biol Branch, Bethesda, MD 20892 USA

Blakesley, Robert W.
论文数: 0 引用数: 0
h-index: 0
机构:
NHGRI, NIH, Intramural Sequencing Ctr, Bethesda, MD 20892 USA
NHGRI, Genome Technol Branch, Bethesda, MD 20892 USA NHGRI, Genet & Mol Biol Branch, Bethesda, MD 20892 USA

Murray, Patrick R.
论文数: 0 引用数: 0
h-index: 0
机构:
NIH, Clin Microbiol Lab, Dept Lab Med, Ctr Clin, Bethesda, MD 20892 USA NHGRI, Genet & Mol Biol Branch, Bethesda, MD 20892 USA

Green, Eric D.
论文数: 0 引用数: 0
h-index: 0
机构:
NHGRI, NIH, Intramural Sequencing Ctr, Bethesda, MD 20892 USA
NHGRI, Genome Technol Branch, Bethesda, MD 20892 USA NHGRI, Genet & Mol Biol Branch, Bethesda, MD 20892 USA

Turner, Maria L.
论文数: 0 引用数: 0
h-index: 0
机构:
NCI, Dermatol Branch, Ctr Canc Res, Bethesda, MD 20892 USA NHGRI, Genet & Mol Biol Branch, Bethesda, MD 20892 USA

Segre, Julia A.
论文数: 0 引用数: 0
h-index: 0
机构:
NHGRI, Genet & Mol Biol Branch, Bethesda, MD 20892 USA NHGRI, Genet & Mol Biol Branch, Bethesda, MD 20892 USA
[8]
Accuracy and quality of massively parallel DNA pyrosequencing
[J].
Huse, Susan M.
;
Huber, Julie A.
;
Morrison, Hilary G.
;
Sogin, Mitchell L.
;
Mark Welch, David
.
GENOME BIOLOGY,
2007, 8 (07)

Huse, Susan M.
论文数: 0 引用数: 0
h-index: 0
机构:
Josephine Bay Paul Ctr, Marine Biol Lab, Woods Hole, MA 02543 USA Josephine Bay Paul Ctr, Marine Biol Lab, Woods Hole, MA 02543 USA

Huber, Julie A.
论文数: 0 引用数: 0
h-index: 0
机构:
Josephine Bay Paul Ctr, Marine Biol Lab, Woods Hole, MA 02543 USA Josephine Bay Paul Ctr, Marine Biol Lab, Woods Hole, MA 02543 USA

Morrison, Hilary G.
论文数: 0 引用数: 0
h-index: 0
机构:
Josephine Bay Paul Ctr, Marine Biol Lab, Woods Hole, MA 02543 USA Josephine Bay Paul Ctr, Marine Biol Lab, Woods Hole, MA 02543 USA

Sogin, Mitchell L.
论文数: 0 引用数: 0
h-index: 0
机构:
Josephine Bay Paul Ctr, Marine Biol Lab, Woods Hole, MA 02543 USA Josephine Bay Paul Ctr, Marine Biol Lab, Woods Hole, MA 02543 USA

Mark Welch, David
论文数: 0 引用数: 0
h-index: 0
机构:
Josephine Bay Paul Ctr, Marine Biol Lab, Woods Hole, MA 02543 USA Josephine Bay Paul Ctr, Marine Biol Lab, Woods Hole, MA 02543 USA
[9]
Ironing out the wrinkles in the rare biosphere through improved OTU clustering
[J].
Huse, Susan M.
;
Welch, David Mark
;
Morrison, Hilary G.
;
Sogin, Mitchell L.
.
ENVIRONMENTAL MICROBIOLOGY,
2010, 12 (07)
:1889-1898

Huse, Susan M.
论文数: 0 引用数: 0
h-index: 0
机构:
Marine Biol Lab, Josephine Bay Paul Ctr, Woods Hole, MA 02543 USA Marine Biol Lab, Josephine Bay Paul Ctr, Woods Hole, MA 02543 USA

Welch, David Mark
论文数: 0 引用数: 0
h-index: 0
机构:
Marine Biol Lab, Josephine Bay Paul Ctr, Woods Hole, MA 02543 USA Marine Biol Lab, Josephine Bay Paul Ctr, Woods Hole, MA 02543 USA

Morrison, Hilary G.
论文数: 0 引用数: 0
h-index: 0
机构:
Marine Biol Lab, Josephine Bay Paul Ctr, Woods Hole, MA 02543 USA Marine Biol Lab, Josephine Bay Paul Ctr, Woods Hole, MA 02543 USA

Sogin, Mitchell L.
论文数: 0 引用数: 0
h-index: 0
机构:
Marine Biol Lab, Josephine Bay Paul Ctr, Woods Hole, MA 02543 USA Marine Biol Lab, Josephine Bay Paul Ctr, Woods Hole, MA 02543 USA
[10]
HIERARCHICAL CLUSTERING SCHEMES
[J].
JOHNSON, SC
.
PSYCHOMETRIKA,
1967, 32 (03)
:241-254

JOHNSON, SC
论文数: 0 引用数: 0
h-index: 0
机构:
BELL TEL LAB, MURRAY HILL, NJ USA BELL TEL LAB, MURRAY HILL, NJ USA