Missing data, incomplete taxa, and phylogenetic accuracy

被引:509
作者
Wiens, JJ [1 ]
机构
[1] SUNY Stony Brook, Dept Ecol & Evolut, Stony Brook, NY 11794 USA
关键词
combining data; computer simulations; fossils; incomplete taxa; missing data; phylogenetic accuracy; supertrees;
D O I
10.1080/10635150390218330
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The problem of missing data is often considered to be the most important obstacle in reconstructing the phylogeny of fossil taxa and in combining data from diverse characters and taxa for phylogenetic analysis. Empirical and theoretical studies show that including highly incomplete taxa can lead to multiple equally parsimonious trees, poorly resolved consensus trees, and decreased phylogenetic accuracy. However, the mechanisms that cause incomplete taxa to be problematic have remained unclear. It has been widely assumed that incomplete taxa are problematic because of the proportion or amount of missing data that they bear. In this study, I use simulations to show that the reduced accuracy associated with including incomplete taxa is caused by these taxa bearing too few complete characters rather than too many missing data cells. This seemingly subtle distinction has a number of important implications. First, the so-called missing data problem for incomplete taxa is, paradoxically, not directly related to their amount or proportion of missing data. Thus, the level of completeness alone should not guide the exclusion of taxa (contrary to common practice), and these results may explain why empirical studies have sometimes found little relationship between the completeness of a taxon and its impact on an analysis. These results also (1) suggest a more effective strategy for dealing with incomplete taxa, (2) call into question a justification of the controversial phylogenetic supertree approach, and (3) show the potential for the accurate phylogenetic placement of highly incomplete taxa, both when combining diverse data sets and when analyzing relationships of fossil taxa.
引用
收藏
页码:528 / 538
页数:11
相关论文
共 32 条
[1]  
Anderson JS, 2001, SYST BIOL, V50, P170
[2]  
Ax P., 1987, PHYLOGENETIC SYSTEM
[3]   Assessment of the accuracy of matrix representation with parsimony analysis supertree construction [J].
Bininda-Emonds, ORP ;
Sanderson, MJ .
SYSTEMATIC BIOLOGY, 2001, 50 (04) :565-579
[4]   THE IMPORTANCE OF FOSSILS IN PHYLOGENY RECONSTRUCTION [J].
DONOGHUE, MJ ;
DOYLE, JA ;
GAUTHIER, J ;
KLUGE, AG ;
ROWE, T .
ANNUAL REVIEW OF ECOLOGY AND SYSTEMATICS, 1989, 20 :431-460
[5]   Phylogeny of the trilobite subgenus Acanthopyge (Lobopyge) [J].
Ebach, MC ;
Ahyong, ST .
CLADISTICS-THE INTERNATIONAL JOURNAL OF THE WILLI HENNIG SOCIETY, 2001, 17 (01) :1-10
[6]  
Gao Keqin, 1998, American Museum Novitates, V3230, P1
[7]   Resolution of a supertree/supermatrix paradox [J].
Gatesy, J ;
Matthee, C ;
DeSalle, R ;
Hayashi, C .
SYSTEMATIC BIOLOGY, 2002, 51 (04) :652-664
[8]   AMNIOTE PHYLOGENY AND THE IMPORTANCE OF FOSSILS [J].
GAUTHIER, J ;
KLUGE, AG ;
ROWE, T .
CLADISTICS, 1988, 4 (02) :105-209
[9]  
Gauthier J., 1986, Memoirs of the California academy of Science, V8, P1
[10]   A comprehensive phylogenetic study of amiid fishes (Amiidae) based on comparative skeletal anatomy. An empirical search for interconnected patterns of natural history [J].
Grande, L ;
Bemis, WE .
JOURNAL OF VERTEBRATE PALEONTOLOGY, 1998, 18 (01) :1-+