Testing the reliability of genetic methods of species identification via simulation

被引:277
作者
Ross, Howard A. [1 ]
Murugan, Sumathi [1 ]
Li, Wai Lok Sibon [1 ]
机构
[1] Univ Auckland, Bioinformat Inst, Auckland 1142, New Zealand
关键词
BLAST; DNA barcoding; phylogenetic; species identification;
D O I
10.1080/10635150802032990
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Although genetic methods of species identification, especially DNA barcoding, are strongly debated, tests of these methods have been restricted to a few empirical cases for pragmatic reasons. Here we use simulation to test the performance of methods based on sequence comparison (BLAST and genetic distance) and tree topology over a wide range of evolutionary scenarios. Sequences were simulated on a range of gene trees spanning almost three orders of magnitude in tree depth and in coalescent depth; that is, deep or shallow trees with deep or shallow coalescences. When the query's conspecific sequences were included in the reference alignment, the rate of positive identification was related to the degree to which different species were genetically differentiated. The BLAST, distance, and liberal tree-based methods returned higher rates of correct identification than did the strict tree-based requirement that the query was within, but not sister to, a single-species clade. Under this more conservative approach, ambiguous outcomes occurred in inverse proportion to the number of reference sequences per species. When the query's conspecific sequences were not in the reference alignment, only the strict tree-based approach was relatively immune to making false-positive identifications. Thresholds affected the rates at which false-positive identifications were made when the query's species was unrepresented in the reference alignment but did not otherwise influence outcomes. A conservative approach using the strict tree-based method should be used initially in large-scale identification systems, with effort made to maximize sequence sampling within species. Once the genetic variation within a taxonomic group is well characterized and the taxonomy resolved, then the choice of method used should be dictated by considerations of computational efficiency. The requirement for extensive genetic sampling may render these techniques inappropriate in some circumstances.
引用
收藏
页码:216 / 230
页数:15
相关论文
共 47 条
[1]   A step toward barcoding life: A model-based, decision-theoretic method to assign genes to preexisting species groups [J].
Abdo, Zaid ;
Golding, G. Brian .
SYSTEMATIC BIOLOGY, 2007, 56 (01) :44-56
[2]   Comparative accuracy of methods for protein sequence similarity search [J].
Agarwal, P ;
States, DJ .
BIOINFORMATICS, 1998, 14 (01) :40-47
[3]   Searching DNA databases for similarities to DNA sequences: when is a match significant? [J].
Anderson, I ;
Brass, A .
BIOINFORMATICS, 1998, 14 (04) :349-356
[4]   Molecular genetic identification of whale and dolphin products from commercial markets in Korea and Japan [J].
Baker, CS ;
Cipriano, F ;
Palumbi, SR .
MOLECULAR ECOLOGY, 1996, 5 (05) :671-685
[5]   WHICH WHALES ARE HUNTED - A MOLECULAR-GENETIC APPROACH TO MONITORING WHALING [J].
BAKER, CS ;
PALUMBI, SR .
SCIENCE, 1994, 265 (5178) :1538-1539
[6]   Identifying spiders through DNA barcodes [J].
Barrett, RDH ;
Hebert, PDN .
CANADIAN JOURNAL OF ZOOLOGY, 2005, 83 (03) :481-491
[7]   The promise of a DNA taxonomy [J].
Blaxter, ML .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2004, 359 (1444) :669-679
[8]   Who will actually use DNA barcoding and what will it cost? [J].
Cameron, Stephen ;
Rubinoff, Daniel ;
Will, Kipling .
SYSTEMATIC BIOLOGY, 2006, 55 (05) :844-847
[9]   Land plants and DNA barcodes: short-term and long-term goals [J].
Chase, MW ;
Salamin, N ;
Wilkinson, M ;
Dunwell, JM ;
Kesanakurthi, RP ;
Haider, N ;
Savolainen, V .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2005, 360 (1462) :1889-1895
[10]   Molecular genetic identification of southern hemisphere beaked whales (Cetacea: Ziphiidae) [J].
Dalebout, ML ;
Van Helden, A ;
Van Waerebeek, K ;
Baker, CS .
MOLECULAR ECOLOGY, 1998, 7 (06) :687-694