FastML: a web server for probabilistic reconstruction of ancestral sequences

被引:242
作者
Ashkenazy, Haim [1 ]
Penn, Osnat [1 ]
Doron-Faigenboim, Adi [3 ]
Cohen, Ofir [1 ]
Cannarozzi, Gina [2 ]
Zomer, Oren [1 ]
Pupko, Tal [1 ]
机构
[1] Tel Aviv Univ, George S Wise Fac Life Sci, Dept Cell Res & Immunol, IL-69978 Tel Aviv, Israel
[2] Univ Bern, Inst Plant Sci, CH-3013 Bern, Switzerland
[3] ARO, Volcani Ctr, Inst Plant Sci, IL-50250 Bet Dagan, Israel
基金
以色列科学基金会;
关键词
AMINO-ACID-SEQUENCES; MITOCHONDRIAL-DNA; PHYLETIC PATTERNS; PROTEIN SEQUENCES; INFERENCE; MODEL; SUBSTITUTION; SITES; DIVERSITY; ALGORITHM;
D O I
10.1093/nar/gks498
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Ancestral sequence reconstruction is essential to a variety of evolutionary studies. Here, we present the FastML web server, a user-friendly tool for the reconstruction of ancestral sequences. FastML implements various novel features that differentiate it from existing tools: (i) FastML uses an indel-coding method, in which each gap, possibly spanning multiples sites, is coded as binary data. FastML then reconstructs ancestral indel states assuming a continuous time Markov process. FastML provides the most likely ancestral sequences, integrating both indels and characters; (ii) FastML accounts for uncertainty in ancestral states: it provides not only the posterior probabilities for each character and indel at each sequence position, but also a sample of ancestral sequences from this posterior distribution, and a list of the k-most likely ancestral sequences; (iii) FastML implements a large array of evolutionary models, which makes it generic and applicable for nucleotide, protein and codon sequences; and (iv) a graphical representation of the results is provided, including, for example, a graphical logo of the inferred ancestral sequences. The utility of FastML is demonstrated by reconstructing ancestral sequences of the Env protein from various HIV-1 subtypes. FastML is freely available for all academic users and is available online at http://fastml.tau.ac.il/.
引用
收藏
页码:W580 / W584
页数:5
相关论文
共 37 条
[1]   Plastid genome phylogeny and a model of amino acid substitution for proteins encoded by chloroplast DNA [J].
Adachi, J ;
Waddell, PJ ;
Martin, W ;
Hasegawa, M .
JOURNAL OF MOLECULAR EVOLUTION, 2000, 50 (04) :348-358
[2]  
Adachi J, 1996, J MOL EVOL, V42, P459
[3]   Reconstructing large regions of an ancestral mammalian genome in silico [J].
Blanchette, M ;
Green, ED ;
Miller, W ;
Haussler, D .
GENOME RESEARCH, 2004, 14 (12) :2412-2423
[4]   Recreating a functional ancestral archosaur visual pigment [J].
Chang, BSW ;
Jönsson, K ;
Kazmi, MA ;
Donoghue, MJ ;
Sakmar, TP .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (09) :1483-1489
[5]   A likelihood framework to analyse phyletic patterns [J].
Cohen, Ofir ;
Rubinstein, Nimrod D. ;
Stern, Adi ;
Gophna, Uri ;
Pupko, Tal .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2008, 363 (1512) :3903-3911
[6]   Inference of Gain and Loss Events from Phyletic Patterns Using Stochastic Mapping and Maximum Parsimony-A Simulation Study [J].
Cohen, Ofir ;
Pupko, Tal .
GENOME BIOLOGY AND EVOLUTION, 2011, 3 :1265-1275
[7]   Utilizing natural diversity to evolve protein function: applications towards thermostability [J].
Cole, Megan F. ;
Gaucher, Eric A. .
CURRENT OPINION IN CHEMICAL BIOLOGY, 2011, 15 (03) :399-406
[8]   WebLogo: A sequence logo generator [J].
Crooks, GE ;
Hon, G ;
Chandonia, JM ;
Brenner, SE .
GENOME RESEARCH, 2004, 14 (06) :1188-1190
[9]  
Dayhoff M O., 1978, Atlas of Protein Seq Struct, ppp 345
[10]   A combined empirical and mechanistic codon model [J].
Doron-Faigenboim, Adi ;
Pupko, Tal .
MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (02) :388-397