Inferring angiosperm phylogeny from EST data with widespread gene duplication

被引:71
作者
Sanderson, Michael J. [1 ]
McMahon, Michelle M.
机构
[1] Univ Arizona, Dept Ecol & Evolutionary Biol, Tucson, AZ 85721 USA
[2] Univ Arizona, Dept Plant Sci, Tucson, AZ 85721 USA
关键词
D O I
10.1186/1471-2148-7-S1-S3
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Most studies inferring species phylogenies use sequences from single copy genes or sets of orthologs culled from gene families. For taxa such as plants, with very high levels of gene duplication in their nuclear genomes, this has limited the exploitation of nuclear sequences for phylogenetic studies, such as those available in large EST libraries. One rarely used method of inference, gene tree parsimony, can infer species trees from gene families undergoing duplication and loss, but its performance has not been evaluated at a phylogenomic scale for EST data in plants. Results: A gene tree parsimony analysis based on EST data was undertaken for six angiosperm model species and Pinus, an outgroup. Although a large fraction of the tentative consensus sequences obtained from the TIGR database of ESTs was assembled into homologous clusters too small to be phylogenetically informative, some 557 clusters contained promising levels of information. Based on maximum likelihood estimates of the gene trees obtained from these clusters, gene tree parsimony correctly inferred the accepted species tree with strong statistical support. A slight variant of this species tree was obtained when maximum parsimony was used to infer the individual gene trees instead. Conclusion: Despite the complexity of the EST data and the relatively small fraction eventually used in inferring a species tree, the gene tree parsimony method performed well in the face of very high apparent rates of duplication.
引用
收藏
页数:14
相关论文
共 77 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Phylogeny of the New World diploid cottons (Gossypium L., Malvaceae) based on sequences of three low-copy nuclear genes [J].
Alvarez, I ;
Cronn, R ;
Wendel, JF .
PLANT SYSTEMATICS AND EVOLUTION, 2005, 252 (3-4) :199-214
[3]   Gene conversion and the evolution of euryarchaeal chaperonins: A maximum likelihood-based method for detecting conflicting phylogenetic signals [J].
Archibald, JM ;
Roger, AJ .
JOURNAL OF MOLECULAR EVOLUTION, 2002, 55 (02) :232-245
[4]   Bayesian gene/species tree reconciliation and orthology analysis using MCMC [J].
Arvestad, Lars ;
Berglund, Ann-Charlotte ;
Lagergren, Jens ;
Sennblad, Bengt .
BIOINFORMATICS, 2003, 19 :i7-i15
[5]   The analysis of 100 genes supports the grouping of three highly divergent amoebae:: Dictyostelium, Entamoeba, and Mastigamoeba [J].
Bapteste, E ;
Brinkmann, H ;
Lee, JA ;
Moore, DV ;
Sensen, CW ;
Gordon, P ;
Duruflé, L ;
Gaasterland, T ;
Lopez, P ;
Müller, M ;
Philippe, H .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (03) :1414-1419
[6]   The evolution of supertrees [J].
Bininda-Emonds, ORP .
TRENDS IN ECOLOGY & EVOLUTION, 2004, 19 (06) :315-322
[7]   Evolutionary sequence analysis of complete eukaryote genomes [J].
Blair, JE ;
Shah, P ;
Hedges, SB .
BMC BIOINFORMATICS, 2005, 6 (1)
[8]   A simple and robust statistical test for detecting the presence of recombination [J].
Bruen, TC ;
Philippe, H ;
Bryant, D .
GENETICS, 2006, 172 (04) :2665-2681
[9]  
CHANG WC, 2005, THESIS IOWA STATE U
[10]   NOTUNG: A program for dating gene duplications and optimizing gene family trees [J].
Chen, K ;
Durand, D ;
Farach-Colton, M .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (3-4) :429-447