Domain tree-based analysis of protein architecture evolution

被引:73
作者
Forslund, Kristoffer [1 ]
Henricson, Anna [2 ]
Hollich, Volker [2 ]
Sonnhammer, Erik L. L. [1 ]
机构
[1] Stockholm Univ, Stockholm Bioinformat Ctr, S-10691 Stockholm, Sweden
[2] Karolinska Inst, Dept Cell & Mol Biol, Stockholm, Sweden
关键词
protein; domain; architecture; evolution;
D O I
10.1093/molbev/msm254
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Understanding the dynamics behind domain architecture evolution is of great importance to unravel the functions of proteins. Complex architectures have been created throughout evolution by rearrangement and duplication events. An interesting question is how many times a particular architecture has been created, a form of convergent evolution or domain architecture reinvention. Previous studies have approached this issue by comparing architectures found in different species. We wanted to achieve a finer-grained analysis by reconstructing protein architectures on complete domain trees. The prevalence of domain architecture reinvention in 96 genomes was investigated with a novel domain tree-based method that uses maximum parsimony for inferring ancestral protein architectures. Domain architectures were taken from Pfam. To ensure robustness, we applied the method to bootstrap trees and only considered results with strong statistical support. We detected multiple origins for 12.4% of the scored architectures. In a much smaller data set, the subset of completely domain-assigned proteins, the figure was 5.6%. These results indicate that domain architecture reinvention is a much more common phenomenon than previously thought. We also determined which domains are most frequent in multiply created architectures and assessed whether specific functions could be attributed to them. However, no strong, functional bias was found in architectures with multiple origins.
引用
收藏
页码:254 / 264
页数:11
相关论文
共 23 条
  • [1] Domain combinations in archaeal, eubacterial and eukaryotic proteomes
    Apic, G
    Gough, J
    Teichmann, SA
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2001, 310 (02) : 311 - 325
  • [2] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [3] Protein length in eukaryotic and prokaryotic proteomes
    Brocchieri, L
    Karlin, S
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 (10) : 3390 - 3400
  • [4] Global extent of horizontal gene transfer
    Choi, In-Geol
    Kim, Sung-Hou
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (11) : 4489 - 4494
  • [5] THE MULTIPLICITY OF DOMAINS IN PROTEINS
    DOOLITTLE, RF
    [J]. ANNUAL REVIEW OF BIOCHEMISTRY, 1995, 64 : 287 - 314
  • [6] Multi-domain proteins in the three kingdoms of life:: Orphan domains and other unassigned regions
    Ekman, D
    Björklund, ÅK
    Frey-Skött, J
    Elofsson, A
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2005, 348 (01) : 231 - 243
  • [7] Pfam:: clans, web tools and services
    Finn, Robert D.
    Mistry, Jaina
    Schuster-Bockler, Benjamin
    Griffiths-Jones, Sam
    Hollich, Volker
    Lassmann, Timo
    Moxon, Simon
    Marshall, Mhairi
    Khanna, Ajay
    Durbin, Richard
    Eddy, Sean R.
    Sonnhammer, Erik L. L.
    Bateman, Alex
    [J]. NUCLEIC ACIDS RESEARCH, 2006, 34 : D247 - D251
  • [8] Modeling the evolution of protein domain architectures using maximum parsimony
    Fong, Jessica H.
    Geer, Lewis Y.
    Panchenko, Anna R.
    Bryant, Stephen H.
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2007, 366 (01) : 307 - 315
  • [9] Convergent evolution of domain architectures (is rare)
    Gough, J
    [J]. BIOINFORMATICS, 2005, 21 (08) : 1464 - 1471
  • [10] Glycoside hydrolases and glycosyltransferases. Families, modules, and implications for genomics
    Henrissat, B
    Davies, GJ
    [J]. PLANT PHYSIOLOGY, 2000, 124 (04) : 1515 - 1519