A novel method for estimating ancestral amino acid composition and its application to proteins of the Last Universal Ancestor

被引:23
作者
Brooks, DJ
Fresco, JR
Singh, M [1 ]
机构
[1] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
[2] Washington Univ, Sch Med, Dept Genet, St Louis, MO 63110 USA
[3] Princeton Univ, Dept Mol Biol, Princeton, NJ 08544 USA
[4] Princeton Univ, Lewis Sigler Inst Integrat Gen, Princeton, NJ 08544 USA
关键词
D O I
10.1093/bioinformatics/bth235
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Knowledge of how proteomic amino acid composition has changed over time is important for constructing realistic models of protein evolution and increasing our understanding of molecular evolutionary history. The proteomic amino acid composition of the Last Universal Ancestor (LUA) of life is of particular interest, since that might provide insight into the early evolution of proteins and the nature of the LUA itself. Results: We introduce a method to estimate ancestral amino acid composition that is based on expectation-maximization. On simulated data, the approach was found to be very effective in estimating ancestral amino acid composition, with accuracy improving as the number of residues in the dataset was increased. The method was then used to infer the amino acid composition of a set of proteins in the LUA. In general, as compared with the modern protein set, LUA proteins were found to be richer in amino acids that are believed to have been most abundant in the prebiotic environment and poorer in those believed to have been unavailable or scarce. Additionally, we found the inferred amino acid composition of this protein set in the LUA to be more similar to the observed composition of the same set in extant thermophilic species than in extant mesophilic species, supporting the idea that the LUA lived in a thermophilic environment.
引用
收藏
页码:2251 / 2257
页数:7
相关论文
共 24 条
[1]   Phylogenetic depth of the bacterial genera Aquifex and Thermotoga inferred from analysis of ribosomal protein, elongation factor, and RNA polymerase subunit sequences [J].
Bocchetta, M ;
Gribaldo, S ;
Sanangelantoni, A ;
Cammarano, P .
JOURNAL OF MOLECULAR EVOLUTION, 2000, 50 (04) :366-380
[2]   Phylogeny - A non-hyperthermophilic ancestor for bacteria [J].
Brochier, C ;
Philippe, H .
NATURE, 2002, 417 (6886) :244-244
[3]   Greater GNN pattern bias in sequence elements encoding conserved residues of ancient proteins may be an indicator of amino acid composition of early proteins [J].
Brooks, DJ ;
Fresco, JR .
GENE, 2003, 303 :177-185
[4]   Evolution of amino acid frequencies in proteins over deep time: Inferred order of introduction of amino acids into the genetic code [J].
Brooks, DJ ;
Fresco, JR ;
Lesk, AM ;
Singh, M .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (10) :1645-1655
[5]   Increased frequency of cysteine, tyrosine, and phenylalanine residues since the last universal ancestor [J].
Brooks, DJ ;
Fresco, JR .
MOLECULAR & CELLULAR PROTEOMICS, 2002, 1 (02) :125-131
[6]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[7]   The universal ancestor was a thermophile or a hyperthermophile [J].
Di Giulio, M .
GENE, 2001, 281 (1-2) :11-17
[8]  
FELSENSTEIN J, 1993, PHYLIP VERSION 35C D
[9]   Inferring pattern and process: Maximum-likelihood implementation of a nonhomogeneous model of DNA sequence evolution for phylogenetic analysis [J].
Galtier, N ;
Gouy, M .
MOLECULAR BIOLOGY AND EVOLUTION, 1998, 15 (07) :871-879
[10]   A nonhyperthermophilic common ancestor to extant life forms [J].
Galtier, N ;
Tourasse, N ;
Gouy, M .
SCIENCE, 1999, 283 (5399) :220-221