Self-organizing and self-correcting classifications of biological data

被引:9
作者
Garrity, GM [1 ]
Lilburn, TG
机构
[1] Michigan State Univ, Dept Microbiol & Mol Genet, E Lansing, MI 48824 USA
[2] Amer Type Culture Collect, Sci Informat Syst, Manassas, VA USA
关键词
D O I
10.1093/bioinformatics/bti346
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Rapid, automated means of organizing biological data are required if we hope to keep abreast of the flood of data emanating from sequencing, microarray and similar high-throughput analyses. Faced with the need to validate the annotation of thousands of sequences and to generate biologically meaningful classifications based on the sequence data, we turned to statistical methods in order to automate these processes. Results: An algorithm for automated classification based on evolutionary distance data was written in S. The algorithm was tested on a dataset of 1436 small subunit ribosomal RNA sequences and was able to classify the sequences according to an extant scheme, use statistical measurements of group membership to detect sequences that were misclassified within this scheme and produce a new classification. In this study, the use of the algorithm to address problems in prokaryotic taxonomy is discussed.
引用
收藏
页码:2309 / 2314
页数:6
相关论文
共 10 条
  • [1] [Anonymous], WFCC NEWSL
  • [2] Becker R., 1988, The new S language
  • [3] The Ribosomal Database Project (RDP-II): previewing a new autoaligner that allows regular updates and the new prokaryotic taxonomy
    Cole, JR
    Chai, B
    Marsh, TL
    Farris, RJ
    Wang, Q
    Kulam, SA
    Chandra, S
    McGarrell, DM
    Schmidt, TM
    Garrity, GM
    Tiedje, JM
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 442 - 443
  • [4] Garrity G.M., 2001, BERGEYS MANUAL SYSTE
  • [5] GARRITY GM, 2003, BERGEYS MANUAL SYSTE, DOI DOI 10.1007/BERGEYSOUTLINE
  • [6] Exploring prokaryotic taxonomy
    Lilburn, TG
    Garrity, GM
    [J]. INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY, 2004, 54 : 7 - 13
  • [7] Paradis E, 2004, BIOINFORMATICS, V20, P289, DOI [10.1093/bioinformatics/bty633, 10.1093/bioinformatics/btg412]
  • [8] A phylogenomic approach to microbial evolution
    Sicheritz-Pontén, T
    Andersson, SGE
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (02) : 545 - 552
  • [9] Sneath P. H. A., 1973, NUMERICAL TAXONOMY
  • [10] Venables WN, 2002, INTRO R