Taxonomic Reliability of DNA Sequences in Public Sequence Databases: A Fungal Perspective

被引:480
作者
Nilsson, R. Henrik [1 ]
Ryberg, Martin [1 ]
Kristiansson, Erik [2 ]
Abarenkov, Kessy [3 ]
Larsson, Karl-Henrik [1 ]
Koljalg, Urmas [3 ]
机构
[1] Univ Gothenburg, Dept Plant & Environm Sci, Gothenburg, Sweden
[2] Chalmers Univ Technol, Dept Math Stat, S-41296 Gothenburg, Sweden
[3] Univ Tartu, Inst Bot & Ecol, EE-50090 Tartu, Estonia
来源
PLOS ONE | 2006年 / 1卷 / 01期
关键词
DIVERSITY; IDENTIFICATION; EXAMPLE;
D O I
10.1371/journal.pone.0000059
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background. DNA sequences are increasingly seen as one of the primary information sources for species identification in many organism groups. Such approaches, popularly known as barcoding, are underpinned by the assumption that the reference databases used for comparison are sufficiently complete and feature correctly and informatively annotated entries. Methodology/Principal Findings. The present study uses a large set of fungal DNA sequences from the inclusive International Nucleotide Sequence Database to show that the taxon sampling of fungi is far from complete, that about 20% of the entries may be incorrectly identified to species level, and that the majority of entries lack descriptive and up-to-date annotations. Conclusions. The problems with taxonomic reliability and insufficient annotations in public DNA repositories form a tangible obstacle to sequence-based species identification, and it is manifest that the greatest challenges to biological barcoding will be of taxonomical, rather than technical, nature.
引用
收藏
页数:4
相关论文
共 25 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] Ribosomal ITS sequences and plant phylogenetic inference
    Alvarez, I
    Wendel, JF
    [J]. MOLECULAR PHYLOGENETICS AND EVOLUTION, 2003, 29 (03) : 417 - 434
  • [3] Benson Dennis A, 2005, Nucleic Acids Res, V33, pD34
  • [4] Defining operational taxonomic units using DNA barcode data
    Blaxter, M
    Mann, J
    Chapman, T
    Thomas, F
    Whitton, C
    Floyd, R
    Abebe, E
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2005, 360 (1462) : 1935 - 1943
  • [5] On the unreliability of published DNA sequences
    Bridge, PD
    Roberts, PJ
    Spooner, BM
    Panchal, G
    [J]. NEW PHYTOLOGIST, 2003, 160 (01) : 43 - 48
  • [6] Bruns TD, 2004, CAN J BOT, V82, P1122, DOI [10.1139/b04-021, 10.1139/B04-021]
  • [7] Glomales rRNA gene diversity - all that glisten's is not necessarily glomalean?
    Clapp, JP
    Rodriguez, A
    Dodd, JC
    [J]. MYCORRHIZA, 2002, 12 (05) : 269 - 270
  • [8] What are bacterial species?
    Cohan, FM
    [J]. ANNUAL REVIEW OF MICROBIOLOGY, 2002, 56 : 457 - 487
  • [9] DNA barcoding is no substitute for taxonomy
    Ebach, MC
    Holdrege, C
    [J]. NATURE, 2005, 434 (7034) : 697 - 697
  • [10] Critical factors for assembling a high volume of DNA barcodes
    Hajibabaei, M
    DeWaard, JR
    Ivanova, NV
    Ratnasingham, S
    Dooh, RT
    Kirk, SL
    Mackie, PM
    Hebert, PDN
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2005, 360 (1462) : 1959 - 1967