Reconstruction of ancestral protein sequences and its applications

被引:98
作者
Cai, W
Pei, JM
Grishin, NV
机构
[1] Univ Texas, SW Med Ctr, Howard Hughes Med Inst, Dallas, TX 75390 USA
[2] Univ Texas, SW Med Ctr, Dept Biochem, Dallas, TX 75390 USA
关键词
D O I
10.1186/1471-2148-4-33
中图分类号
Q [生物科学];
学科分类号
07 [理学]; 0710 [生物学]; 09 [农学];
摘要
Background: Modern-day proteins were selected during long evolutionary history as descendants of ancient life forms. In silico reconstruction of such ancestral protein sequences facilitates our understanding of evolutionary processes, protein classification and biological function. Additionally, reconstructed ancestral protein sequences could serve to fill in sequence space thus aiding remote homology inference. Results: We developed ANCESCON, a package for distance-based phylogenetic inference and reconstruction of ancestral protein sequences that takes into account the observed variation of evolutionary rates between positions that more precisely describes the evolution of protein families. To improve the accuracy of evolutionary distance estimation and ancestral sequence reconstruction, two approaches are proposed to estimate position-specific evolutionary rates. Comparisons show that at large evolutionary distances our method gives more accurate ancestral sequence reconstruction than PAML, PHYLIP and PAUP*. We apply the reconstructed ancestral sequences to homology inference and functional site prediction. We show that the usage of hypothetical ancestors together with the present day sequences improves profile-based sequence similarity searches; and that ancestral sequence reconstruction methods can be used to predict positions with functional specificity. Conclusions: As a computational tool to reconstruct ancestral protein sequences from a given multiple sequence alignment, ANCESCON shows high accuracy in tests and helps detection of remote homologs and prediction of functional sites. ANCESCON is freely available for noncommercial use. Pre-compiled versions for several platforms can be downloaded from ftp:// iole.swmed.edu/ pub/ANCESCON/.
引用
收藏
页数:23
相关论文
共 58 条
[1]
HIGH-RESOLUTION STRUCTURES OF ADENYLATE KINASE FROM YEAST LIGATED WITH INHIBITOR AP(5)A, SHOWING THE PATHWAY OF PHOSPHORYL TRANSFER [J].
ABELE, U ;
SCHULZ, GE .
PROTEIN SCIENCE, 1995, 4 (07) :1262-1271
[2]
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkr1065, 10.1093/nar/gkh121]
[4]
The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[5]
The structural basis of the activation of Ras by Sos [J].
Boriack-Sjodin, PA ;
Margarit, SM ;
Bar-Sagi, D ;
Kuriyan, J .
NATURE, 1998, 394 (6691) :337-343
[6]
Weighted neighbor joining: A likelihood-based approach to distance-based phylogeny reconstruction [J].
Bruno, WJ ;
Socci, ND ;
Halpern, AL .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (01) :189-197
[7]
Peptide ligands of pp60(c-src) SH2 domains: A thermodynamic and structural study [J].
Charifson, PS ;
Shewchuk, LM ;
Rocque, W ;
Hummel, CW ;
Jordan, SR ;
Mohr, C ;
Pacofsky, GJ ;
Peel, MR ;
Rodriguez, M ;
Sternbach, DD ;
Consler, TG .
BIOCHEMISTRY, 1997, 36 (21) :6283-6293
[8]
Crystal structures of a complexed and peptide-free membrane protein-binding domain: Molecular basis of peptide recognition by PDZ [J].
Doyle, DA ;
Lee, A ;
Lewis, J ;
Kim, E ;
Sheng, M ;
MacKinnon, R .
CELL, 1996, 85 (07) :1067-1076
[9]
Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763
[10]
EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376