ProteinHistorian: Tools for the Comparative Analysis of Eukaryote Protein Origin

被引:64
作者
Capra, John A. [1 ]
Williams, Alexander G. [1 ]
Pollard, Katherine S. [1 ,2 ,3 ]
机构
[1] Univ Calif San Francisco, J David Gladstone Inst, San Francisco, CA 94143 USA
[2] Univ Calif San Francisco, Inst Human Genet, San Francisco, CA 94143 USA
[3] Univ Calif San Francisco, Div Biostat, San Francisco, CA 94143 USA
基金
美国国家卫生研究院;
关键词
WEB-BASED TOOL; GENE ONTOLOGY; PHYLOGENETIC PROFILES; PURIFYING SELECTION; EVOLUTIONARY RATE; EMERGENCE; AGE;
D O I
10.1371/journal.pcbi.1002567
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The evolutionary history of a protein reflects the functional history of its ancestors. Recent phylogenetic studies identified distinct evolutionary signatures that characterize proteins involved in cancer, Mendelian disease, and different ontogenic stages. Despite the potential to yield insight into the cellular functions and interactions of proteins, such comparative phylogenetic analyses are rarely performed, because they require custom algorithms. We developed ProteinHistorian to make tools for performing analyses of protein origins widely available. Given a list of proteins of interest, ProteinHistorian estimates the phylogenetic age of each protein, quantifies enrichment for proteins of specific ages, and compares variation in protein age with other protein attributes. ProteinHistorian allows flexibility in the definition of protein age by including several algorithms for estimating ages from different databases of evolutionary relationships. We illustrate the use of ProteinHistorian with three example analyses. First, we demonstrate that proteins with high expression in human, compared to chimpanzee and rhesus macaque, are significantly younger than those with human-specific low expression. Next, we show that human proteins with annotated regulatory functions are significantly younger than proteins with catalytic functions. Finally, we compare protein length and age in many eukaryotic species and, as expected from previous studies, find a positive, though often weak, correlation between protein age and length. ProteinHistorian is available through a web server with an intuitive interface and as a set of command line tools; this allows biologists and bioinformaticians alike to integrate these approaches into their analysis pipelines. ProteinHistorian's modular, extensible design facilitates the integration of new datasets and algorithms. The ProteinHistorian web server, source code, and pre-computed ages for 32 eukaryotic genomes are freely available under the GNU public license at http://lighthouse.ucsf.edu/ProteinHistorian/.
引用
收藏
页数:9
相关论文
共 41 条
  • [1] Inverse relationship between evolutionary rate and age of mammalian genes
    Albà, MM
    Castresana, J
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2005, 22 (03) : 598 - 606
  • [2] Automatic clustering of orthologs and inparalogs shared by multiple proteomes
    Alexeyenko, Andrey
    Tamas, Ivica
    Liu, Gang
    Sonnhammer, Erik L. L.
    [J]. BIOINFORMATICS, 2006, 22 (14) : E9 - E15
  • [3] [Anonymous], SYST ZOOL
  • [4] [Anonymous], 2011, R: A Language and Environment for Statistical Computing
  • [5] [Anonymous], 2009, TIMETREE LIFE
  • [6] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [7] Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
  • [8] Bhattacharya Debashish, 2009, P116
  • [9] QuickGO: a web-based tool for Gene Ontology searching
    Binns, David
    Dimmer, Emily
    Huntley, Rachael
    Barrell, Daniel
    O'Donovan, Claire
    Apweiler, Rolf
    [J]. BIOINFORMATICS, 2009, 25 (22) : 3045 - 3046
  • [10] The Gene Ontology: enhancements for 2011
    Blake, J. A.
    Dolan, M.
    Drabkin, H.
    Hill, D. P.
    Ni, L.
    Sitnikov, D.
    Burgess, S.
    Buza, T.
    Gresham, C.
    McCarthy, F.
    Pillai, L.
    Wang, H.
    Carbon, S.
    Lewis, S. E.
    Mungall, C. J.
    Gaudet, P.
    Chisholm, R. L.
    Fey, P.
    Kibbe, W. A.
    Basu, S.
    Siegele, D. A.
    McIntosh, B. K.
    Renfro, D. P.
    Zweifel, A. E.
    Hu, J. C.
    Brown, N. H.
    Tweedie, S.
    Alam-Faruque, Y.
    Apweiler, R.
    Auchinchloss, A.
    Axelsen, K.
    Argoud-Puy, G.
    Bely, B.
    Blatter, M. -C.
    Bougueleret, L.
    Boutet, E.
    Branconi-Quintaje, S.
    Breuza, L.
    Bridge, A.
    Browne, P.
    Chan, W. M.
    Coudert, E.
    Cusin, I.
    Dimmer, E.
    Duek-Roggli, P.
    Eberhardt, R.
    Estreicher, A.
    Famiglietti, L.
    Ferro-Rojas, S.
    Feuermann, M.
    [J]. NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) : D559 - D564