Inference and Characterization of Horizontally Transferred Gene Families Using Stochastic Mapping

被引:51
作者
Cohen, Ofir [1 ]
Pupko, Tal [1 ]
机构
[1] Tel Aviv Univ, George S Wise Fac Life Sci, Dept Cell Res & Immunol, IL-69978 Tel Aviv, Israel
基金
以色列科学基金会;
关键词
phyletic pattern; probabilistic-evolutionary models; mixture models; genome evolution; horizontal gene transfer; gene-family content; MAXIMUM-LIKELIHOOD-ESTIMATION; NUCLEOTIDE SUBSTITUTION; PHYLOGENETIC ANALYSIS; GENOME INNOVATION; MIXTURE MODEL; DNA-SEQUENCES; BACTERIAL; EVOLUTION; RECONSTRUCTION; SITES;
D O I
10.1093/molbev/msp240
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Macrogenomic events, in which genes are gained and lost, play a pivotal evolutionary role in microbial evolution. Nevertheless, probabilistic-evolutionary models describing such events and methods for their robust inference are considerably less developed than existing methodologies for analyzing site-specific sequence evolution. Here, we present a novel method for the inference of gains and losses of gene families. First, we develop probabilistic-evolutionary models describing the dynamics of gene-family content, which are more biologically realistic than previously suggested models. In our likelihood-based models, gains and losses are represented by transitions between presence and absence, given an underlying phylogeny. We employ a mixture-model approach in which we allow both the gain rate and the loss rate to vary among gene families. Second, we use these models together with the analytic implementation of stochastic mapping to infer branch-specific events. Our novel methodology allows us to infer and quantify horizontal gene transfer (HGT) events. This enables us to rank various gene families and lineages according to their propensity to undergo gains and losses. Applying our methodology to 4,873 gene families shows that: 1) the novel mixture models describe the observed variability in gene-family content among microbes significantly better than previous models; 2) The stochastic mapping approach enables accurate inference of gain and loss events based on simulations; 3) At least 34% of the gene families analyzed are inferred to have experienced HGT at least once during their evolution; and 4) Gene families that were inferred to experience HGT are both enriched and depleted with respect to specific functional categories.
引用
收藏
页码:703 / 713
页数:11
相关论文
共 84 条
[1]  
[Anonymous], 1996, Stochastic Processes
[2]  
[Anonymous], 2002, Algorithms for Minimization Without Derivatives
[3]   Highways of gene sharing in prokaryotes [J].
Beiko, RG ;
Harlow, TJ ;
Ragan, MA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (40) :14332-14337
[4]   Controlling the false discovery rate in behavior genetics research [J].
Benjamini, Y ;
Drai, D ;
Elmer, G ;
Kafkafi, N ;
Golani, I .
BEHAVIOURAL BRAIN RESEARCH, 2001, 125 (1-2) :279-284
[5]   Evolution of microbial genomes: Sequence acquisition and loss [J].
Berg, OG ;
Kurland, CG .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (12) :2265-2276
[6]   Efficient likelihood computations with nonreversible models of evolution [J].
Boussau, Bastien ;
Gouy, Manolo .
SYSTEMATIC BIOLOGY, 2006, 55 (05) :756-768
[7]   Patterns of intron gain and conservation in eukaryotic genes [J].
Carmel, Liran ;
Rogozin, Igor B. ;
Wolf, Yuri I. ;
Koonin, Eugene V. .
BMC EVOLUTIONARY BIOLOGY, 2007, 7 (1)
[8]   Global extent of horizontal gene transfer [J].
Choi, In-Geol ;
Kim, Sung-Hou .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (11) :4489-4494
[9]   Toward automatic reconstruction of a highly resolved tree of life [J].
Ciccarelli, FD ;
Doerks, T ;
von Mering, C ;
Creevey, CJ ;
Snel, B ;
Bork, P .
SCIENCE, 2006, 311 (5765) :1283-1287
[10]   A likelihood framework to analyse phyletic patterns [J].
Cohen, Ofir ;
Rubinstein, Nimrod D. ;
Stern, Adi ;
Gophna, Uri ;
Pupko, Tal .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2008, 363 (1512) :3903-3911