Simultaneous Bayesian gene tree reconstruction and reconciliation analysis

被引:107
作者
Akerborg, Oerjan [1 ]
Sennblad, Bengt [2 ]
Arvestad, Lars [1 ]
Lagergren, Jens [1 ]
机构
[1] Royal Inst Technol, Sch Comp Sci & Commun, SE-10044 Stockholm, Sweden
[2] Stockholm Univ, Stockholm Bioinformat Ctr, AlbaNova, SE-10691 Stockholm, Sweden
基金
瑞典研究理事会;
关键词
comparative genomics; probabilistic modeling; genome evolution; yeast; ADH; PROTEIN DOMAINS; DIVERGENCE TIME; DNA-SEQUENCES; EVOLUTION; PHYLOGENY; DUPLICATION; DEATH; BIRTH; DISTRIBUTIONS; INFERENCE;
D O I
10.1073/pnas.0806251106
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We present GSR, a probabilistic model integrating gene duplication, sequence evolution, and a relaxed molecular clock for substitution rates, that enables genomewide analysis of gene families. The gene duplication and loss process is a major cause for incongruence between gene and species tree, and deterministic methods have been developed to explain such differences through tree reconciliations. Although probabilistic methods for phylogenetic inference have been around for decades, probabilistic reconciliation methods are far less established. Based on our model, we have implemented a Bayesian analysis tool, PrIME-GSR, for gene tree inference that takes a known species tree into account. Our implementation is sound and we demonstrate its utility for genomewide gene-family analysis by applying it to recently presented yeast data. We validate PrIME-GSR by comparing with previous analyses of these data that take advantage of gene order information. In a case study we apply our method to the ADH gene family and are able to draw biologically relevant conclusions concerning gene duplications creating key yeast phenotypes. On a higher level this shows the biological relevance of our method. The obtained results demonstrate the value of a relaxed molecular clock. Our good performance will extend to species where gene order conservation is insufficient.
引用
收藏
页码:5714 / 5719
页数:6
相关论文
共 51 条
[1]   Birth-death prior on phylogeny and speed dating [J].
Akerborg, Orjan ;
Sennblad, Bengt ;
Lagergren, Jens .
BMC EVOLUTIONARY BIOLOGY, 2008, 8 (1)
[2]   Effects of models of rate evolution on estimation of divergence dates with special reference to the metazoan 18S ribosomal RNA Phylogeny [J].
Aris-Brosou, S ;
Yang, ZH .
SYSTEMATIC BIOLOGY, 2002, 51 (05) :703-714
[3]  
Arvestad L., 2004, P 8 ANN INT C RESAER, P326, DOI DOI 10.1145/974614.974657
[4]   Bayesian gene/species tree reconciliation and orthology analysis using MCMC [J].
Arvestad, Lars ;
Berglund, Ann-Charlotte ;
Lagergren, Jens ;
Sennblad, Bengt .
BIOINFORMATICS, 2003, 19 :i7-i15
[5]   The modern molecular clock [J].
Bromham, L ;
Penny, D .
NATURE REVIEWS GENETICS, 2003, 4 (03) :216-224
[6]   The Yeast Gene Order Browser: Combining curated homology and syntenic context reveals gene fate in polyploid species [J].
Byrne, KP ;
Wolfe, KH .
GENOME RESEARCH, 2005, 15 (10) :1456-1461
[7]   A METHOD FOR DEDUCING BRANCHING SEQUENCES IN PHYLOGENY [J].
CAMIN, JH ;
SOKAL, RR .
EVOLUTION, 1965, 19 (03) :311-326
[8]  
Degnan JH, 2005, EVOLUTION, V59, P24
[9]   Relaxed phylogenetics and dating with confidence [J].
Drummond, Alexei J. ;
Ho, Simon Y. W. ;
Phillips, Matthew J. ;
Rambaut, Andrew .
PLOS BIOLOGY, 2006, 4 (05) :699-710
[10]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376