Error, bias, and long-branch attraction in data for two chloroplast photosystem genes in seed plants

被引:152
作者
Sanderson, MJ
Wojciechowski, MF
Hu, JM
Khan, TS
Brady, SG
机构
[1] Univ Calif Davis, Sect Evolut & Ecol, Davis, CA 95616 USA
[2] Univ Calif Berkeley, Jepson Herbaria, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Museum Paleontol, Berkeley, CA 94720 USA
[4] Univ Calif Davis, Ctr Populat Biol, Davis, CA 95616 USA
关键词
statistical consistency; maximum likelihood; parsimony;
D O I
10.1093/oxfordjournals.molbev.a026357
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Sequences of two chloroplast photosystem genes, psaA and psbB, together comprising about 3,500 bp, were obtained for all five major groups of extant seed plants and several outgroups among other vascular plants. Strongly supported, but significantly conflicting, phylogenetic signals were obtained in parsimony analyses from partitions of the data into first and second codon positions versus third positions. In the former, both genes agreed on a monophyletic gymnosperms, with Gnetales closely related to certain conifers. In the latter, Gnetales are inferred to be the sister group of all other seed plants, with gymnosperms paraphyletic. None of the data supported the modern "anthophyte hypothesis," which places Gnetales as the sister group of flowering plants. A series of simulation studies were undertaken to examine the error rate for parsimony inference. Three kinds of errors were examined: random error, systematic bias (both properties of finite data sets), and statistical inconsistency owing to long-branch attraction (an asymptotic property). Parsimony reconstructions were extremely biased for third-position data for psbB. Regardless of the true underlying tree, a tree in which Gnetales are sister to all other seed plants was likely to be reconstructed for these data. None of the combinations of genes or partitions permits the anthophyte tree to be reconstructed with high probability. Simulations of progressively larger data sets indicate the existence of long-branch attraction (statistical inconsistency) for third-position psbB data if either the anthophyte tree or the gymnosperm tree is correct. This is also true for the anthophyte tree using either psaA third positions or psbB first and second positions. A factor contributing to bias and inconsistency is extremely short branches at the base of the seed plant radiation, coupled with extremely high rates in Gnetales and nonseed plant outgroups.
引用
收藏
页码:782 / 797
页数:16
相关论文
共 60 条
[1]  
ADACHI J, 1996, MOLPHY COMPUTER PROG
[2]  
ADKINS RM, 1994, J MOL EVOL, V38, P215
[3]   FUNCTIONAL CONSTRAINTS AND RBCL EVIDENCE FOR LAND PLANT PHYLOGENY [J].
ALBERT, VA ;
BACKLUND, A ;
BREMER, K ;
CHASE, MW ;
MANHART, JR ;
MISHLER, BD ;
NIXON, KC .
ANNALS OF THE MISSOURI BOTANICAL GARDEN, 1994, 81 (03) :534-567
[4]  
[Anonymous], 1978, Atlas of protein sequence and structure
[5]   Distance measures in terms of substitution processes [J].
Baake, E ;
von Haeseler, A .
THEORETICAL POPULATION BIOLOGY, 1999, 55 (02) :166-175
[6]  
Björklund M, 1999, CLADISTICS, V15, P191, DOI 10.1111/j.1096-0031.1999.tb00261.x
[7]   EXTENSIVE VARIATION IN EVOLUTIONARY RATE OF RBCL GENE-SEQUENCES AMONG SEED PLANTS [J].
BOUSQUET, J ;
STRAUSS, SH ;
DOERKSEN, AH ;
PRICE, RA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (16) :7844-7848
[8]  
BOWE LM, 2000, IN PRESS P NATL ACAD
[9]   Wood, bark, and stem anatomy of gnetales: A summary [J].
Carlquist, S .
INTERNATIONAL JOURNAL OF PLANT SCIENCES, 1996, 157 (06) :S58-S76
[10]   DO LONG BRANCHES ATTRACT FLIES [J].
CARMEAN, D ;
CRESPI, BJ .
NATURE, 1995, 373 (6516) :666-666