A comparison of algorithms for the identification of specimens using DNA barcodes: examples from gymnosperms

被引:132
作者
Little, Damon P. [1 ]
Stevenson, Dennis Wm. [1 ]
机构
[1] New York Bot Garden, Lewis B & Dorothy Cullman Program Mol Systemat St, Bronx, NY 10458 USA
关键词
D O I
10.1111/j.1096-0031.2006.00126.x
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In order to use DNA sequences for specimen identification (e.g., barcoding, fingerprinting) an algorithm to compare query sequences with a reference database is needed. Precision and accuracy of query sequence identification was estimated for hierarchical clustering (parsimony and neighbor joining), similarity methods (BLAST, BLAT and megaBLAST), combined clustering/similarity methods (BLAST/parsimony and BLAST/neighbor joining), diagnostic methods (DNA-BAR and DOME ID), and a new method (ATIM). We offer two novel alignment-free algorithmic solutions (DOME ID and ATIM) to identify query sequences for the purposes of DNA barcoding. Publicly available gymnosperm nrITS 2 and plastid matK sequences were used as test data sets. On the test data sets, almost all of the methods were able to accurately identify sequences to genus; however, no method was able to accurately identify query sequences to species at a frequency that would be considered useful for routine specimen identification (42-71% unambiguously correct). Clustering methods performed the worst (perhaps due to alignment issues). Similarity methods, ATIM, DNA-BAR, and DOME ID all performed at approximately the same level. Given the relative precision of the algorithms (median = 67% unambiguous), the low accuracy of species-level identification observed could be ascribed to the lack of correspondence between patterns of allelic similarity and species delimitations. Application of DNA barcoding to sequences of CITES listed cycads (Cycadopsida) provides an example of the potential application of DNA barcoding to enforcement of conservation laws. (c) The Willi Hennig Society 2006.
引用
收藏
页码:1 / 21
页数:21
相关论文
共 92 条
[61]   AN AMPLIFICATION OF THE PHYLOGENETIC SPECIES CONCEPT [J].
NIXON, KC ;
WHEELER, QD .
CLADISTICS-THE INTERNATIONAL JOURNAL OF THE WILLI HENNIG SOCIETY, 1990, 6 (03) :211-223
[62]   Making sense of EST sequences by CLOBBing them [J].
Parkinson, J ;
Guiliano, DB ;
Blaxter, M .
BMC BIOINFORMATICS, 2002, 3 (1)
[63]   IMPROVED TOOLS FOR BIOLOGICAL SEQUENCE COMPARISON [J].
PEARSON, WR ;
LIPMAN, DJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1988, 85 (08) :2444-2448
[64]   Molecular coproscopy:: Dung and diet of the extinct ground sloth Nothrotheriops shastensis [J].
Poinar, HN ;
Hofreiter, M ;
Spaulding, WG ;
Martin, PS ;
Stankiewicz, BA ;
Bland, H ;
Evershed, RP ;
Possnert, G ;
Pääbo, S .
SCIENCE, 1998, 281 (5375) :402-406
[65]   Comment on "Identifying spiders through DNA barcodes" [J].
Prendini, L .
CANADIAN JOURNAL OF ZOOLOGY, 2005, 83 (03) :498-504
[66]  
Rash S., 2002, P 6 ANN INT C COMP B, P254, DOI DOI 10.1145/565196.565229
[67]  
Raymond M, 1995, EVOLUTION, V49, P1280, DOI 10.1111/j.1558-5646.1995.tb04456.x
[68]   Vertical distribution of ectomycorrhizal fungal taxa in a podzol soil profile [J].
Rosling, A ;
Landeweert, R ;
Lindahl, BD ;
Larsson, KH ;
Kuyper, TW ;
Taylor, AFS ;
Finlay, RD .
NEW PHYTOLOGIST, 2003, 159 (03) :775-783
[69]   THE NEIGHBOR-JOINING METHOD - A NEW METHOD FOR RECONSTRUCTING PHYLOGENETIC TREES [J].
SAITOU, N ;
NEI, M .
MOLECULAR BIOLOGY AND EVOLUTION, 1987, 4 (04) :406-425
[70]   Applying DNA barcoding to red macroalgae: a preliminary appraisal holds promise for future applications [J].
Saunders, GW .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2005, 360 (1462) :1879-1888