An Automated Phylogenetic Tree-Based Small Subunit rRNA Taxonomy and Alignment Pipeline (STAP)

被引:37
作者
Wu, Dongying [1 ]
Hartman, Amber [1 ,6 ]
Ward, Naomi [4 ,5 ]
Eisen, Jonathan A. [1 ,2 ,3 ]
机构
[1] Univ Calif Davis, UC Davis Genome Ctr, Davis, CA 95616 USA
[2] Univ Calif Davis, Coll Biol Sci, Sect Evol & Ecol, Davis, CA USA
[3] Univ Calif Davis, Sch Med, Dept Med Microbiol & Immunol, Davis, CA USA
[4] Univ Wyoming, Dept Mol Biol, Laramie, WY USA
[5] Ctr Marine Biotechnol, Baltimore, MD USA
[6] Johns Hopkins Univ, Dept Biol, Baltimore, MD USA
来源
PLOS ONE | 2008年 / 3卷 / 07期
基金
美国国家科学基金会;
关键词
D O I
10.1371/journal.pone.0002566
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Comparative analysis of small-subunit ribosomal RNA (ss-rRNA) gene sequences forms the basis for much of what we know about the phylogenetic diversity of both cultured and uncultured microorganisms. As sequencing costs continue to decline and throughput increases, sequences of ss-rRNA genes are being obtained at an ever-increasing rate. This increasing flow of data has opened many new windows into microbial diversity and evolution, and at the same time has created significant methodological challenges. Those processes which commonly require time-consuming human intervention, such as the preparation of multiple sequence alignments, simply cannot keep up with the flood of incoming data. Fully automated methods of analysis are needed. Notably, existing automated methods avoid one or more steps that, though computationally costly or difficult, we consider to be important. In particular, we regard both the building of multiple sequence alignments and the performance of high quality phylogenetic analysis to be necessary. We describe here our fully-automated ss-rRNA taxonomy and alignment pipeline (STAP). It generates both high-quality multiple sequence alignments and phylogenetic trees, and thus can be used for multiple purposes including phylogenetically-based taxonomic assignments and analysis of species diversity in environmental samples. The pipeline combines publicly-available packages (PHYML, BLASTN and CLUSTALW) with our automatic alignment, masking, and tree-parsing programs. Most importantly, this automated process yields results comparable to those achievable by manual analysis, yet offers speed and capacity that are unattainable by manual efforts.
引用
收藏
页数:10
相关论文
共 46 条
[11]   Phylogenomics: Improving functional predictions for uncharacterized genes by evolutionary analysis [J].
Eisen, JA .
GENOME RESEARCH, 1998, 8 (03) :163-167
[12]   Horizontal gene transfer among microbial genomes: new insights from complete genome analysis [J].
Eisen, JA .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 2000, 10 (06) :606-611
[13]   Environmental shotgun sequencing: Its potential and challenges for studying the hidden world of microbes [J].
Eisen, Jonathan A. .
PLOS BIOLOGY, 2007, 5 (03) :384-388
[14]  
Fisher MM, 1999, APPL ENVIRON MICROB, V65, P4630
[15]   Re-evaluating prokaryotic species [J].
Gevers, D ;
Cohan, FM ;
Lawrence, JG ;
Spratt, BG ;
Coenye, T ;
Feil, EJ ;
Stackebrandt, E ;
Van de Peer, Y ;
Vandamme, P ;
Thompson, FL ;
Swings, J .
NATURE REVIEWS MICROBIOLOGY, 2005, 3 (09) :733-739
[16]   A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood [J].
Guindon, S ;
Gascuel, O .
SYSTEMATIC BIOLOGY, 2003, 52 (05) :696-704
[17]   Impact of culture-independent studies on the emerging phylogenetic view of bacterial diversity [J].
Hugenholtz, P ;
Goebel, BM ;
Pace, NR .
JOURNAL OF BACTERIOLOGY, 1998, 180 (18) :4765-4774
[18]   Counting the uncountable: Statistical approaches to estimating microbial diversity [J].
Hughes, JB ;
Hellmann, JJ ;
Ricketts, TH ;
Bohannan, BJM .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2001, 67 (10) :4399-4406
[19]   Recommended standards for the description of new species of anoxygenic phototrophic bacteria [J].
Imhoff, JF ;
Caumette, P .
INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY, 2004, 54 :1415-1421
[20]   The closest BLAST hit is often not the nearest neighbor [J].
Koski, LB ;
Golding, GB .
JOURNAL OF MOLECULAR EVOLUTION, 2001, 52 (06) :540-542