The Gene Evolution Model and Computing Its Associated Probabilities

被引:55
作者
Arvestad, Lars [1 ]
Lagergren, Jens [1 ]
Sennblad, Bengt [2 ]
机构
[1] Royal Inst Technol, AlbaNova Univ Ctr, Sch Comp Sci & Commun, SE-10691 Stockholm, Sweden
[2] Stockholm Univ, AlbaNova Univ Ctr, Dept Biochem & Biophys, SE-10691 Stockholm, Sweden
基金
瑞典研究理事会;
关键词
Algorithms; Theory; Phylogeny; gene; evolution; probability; duplication; loss; reconciliation; DNA-SEQUENCES; SIGNED PERMUTATIONS; TREES; PHYLOGENY; INFERENCE; DISTRIBUTIONS; ALGORITHMS; ORTHOLOGS; LINEAGE;
D O I
10.1145/1502793.1502796
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Phylogeny is both a fundamental tool in biology and a rich source of fascinating modeling and algorithmic problems. Today's wealth of sequenced genomes makes it increasingly important to understand evolutionary events such as duplications, losses, transpositions, inversions, lateral transfers, and domain shuffling. We focus on the gene duplication event, that constitutes a major force in the creation of genes with new function [Ohno 1970; Lynch and Force 2000] and, thereby also, of biodiversity. We introduce the probabilistic gene evolution model, which describes how a gene tree evolves within a given species tree with respect to speciation, gene duplication, and gene loss. The actual relation between gene tree and species tree is captured by a reconciliation, a concept which we generalize for more expressiveness. The model is a canonical generalization of the classical linear birth-death process, obtained by replacing the interval where the process takes place by a tree. For the gene evolution model, we derive efficient algorithms for some associated probability distributions: the probability of a reconciled tree, the probability of a gene tree, the maximum probability reconciliation, the posterior probability of a reconciliation, and sampling reconciliations with respect to the posterior probability. These algorithms provides the basis for several applications, including species tree construction, reconciliation analysis, orthology analysis, biogeography, and host-parasite co-evolution.
引用
收藏
页数:44
相关论文
共 63 条
[51]   Explosive lineage-specific expansion of the orphan nuclear receptor HNF4 in nematodes [J].
Robinson-Rechavi, M ;
Maina, CV ;
Gissendanner, CR ;
Laudet, V ;
Sluder, A .
JOURNAL OF MOLECULAR EVOLUTION, 2005, 60 (05) :577-586
[52]   Genome-scale approaches to resolving incongruence in molecular phylogenies [J].
Rokas, A ;
Williams, BL ;
King, N ;
Carroll, SB .
NATURE, 2003, 425 (6960) :798-804
[53]   Comprehensive analysis of orthologous protein domains using the HOPS database [J].
Storm, CEV ;
Sonnhammer, ELL .
GENOME RESEARCH, 2003, 13 (10) :2353-2362
[54]   Automated ortholog inference from phylogenetic trees and calculation of orthology reliability [J].
Storm, CEV ;
Sonnhammer, ELL .
BIOINFORMATICS, 2002, 18 (01) :92-99
[55]   Overcredibility of molecular phylogenies obtained by Bayesian phylogenetics [J].
Suzuki, Y ;
Glazko, GV ;
Nei, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (25) :16138-16143
[56]   Nothofagus biogeography revisited with special emphasis on the enigmatic distribution of subgenus Brassospora in New Caledonia [J].
Swenson, U ;
Backlund, A ;
McLoughlin, S ;
Hill, RS .
CLADISTICS, 2001, 17 (01) :28-47
[57]  
TAJIMA F, 1983, GENETICS, V105, P437
[58]  
Tannier E, 2004, LECT NOTES COMPUT SC, V3109, P1
[59]   A genomic perspective on protein families [J].
Tatusov, RL ;
Koonin, EV ;
Lipman, DJ .
SCIENCE, 1997, 278 (5338) :631-637
[60]  
Thompson E.A., 1975, HUMAN EVOLUTIONARY T