Domain rearrangements in protein evolution

被引:147
作者
Björklund, ÅK [1 ]
Ekman, D [1 ]
Light, S [1 ]
Frey-Skött, J [1 ]
Elofsson, A [1 ]
机构
[1] Stockholm Univ, Stockholm Bioinformat Ctr, SE-10691 Stockholm, Sweden
关键词
protein evolution; multi-domain proteins; proteome; GOGraph; Pfam;
D O I
10.1016/j.jmb.2005.08.067
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Most eukaryotic proteins are multi-domain proteins that are created from fusions of genes, deletions and internal repetitions. An investigation of such evolutionary events requires a method to find the domain architecture from which each protein originates. Therefore, we defined a novel measure, domain distance, which is calculated as the number of domains that differ between two domain architectures. Using this measure the evolutionary events that distinguish a protein from its closest ancestor have been studied and it was found that indels are more common than internal repetition and that the exchange of a domain is rare. Indels and repetitions are common at both the N and C-terminals while they are rare between domains. The evolution of the majority of multi-domain proteins can be explained by the stepwise insertions of single domains, with the exception of repeats that sometimes are duplicated several domains in tandem. We show that domain distances agree with sequence similarity and semantic similarity based on gene ontology annotations. In addition, we demonstrate the use of the domain distance measure to build evolutionary trees. Finally, the evolution of multi-domain proteins is exemplified by a closer study of the evolution of two protein families, non-receptor tyrosine kinases and RhoGEFs. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:911 / 923
页数:13
相关论文
共 42 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]   Domain combinations in archaeal, eubacterial and eukaryotic proteomes [J].
Apic, G ;
Gough, J ;
Teichmann, SA .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 310 (02) :311-325
[3]   Domain insertions in protein structures [J].
Aroul-Selvam, R ;
Hubbard, T ;
Sasidharan, R .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 338 (04) :633-641
[4]  
BAIROCH A, 1996, NUCLEIC ACIDS RES, V24, P17
[5]   The N-terminal moiety of CDC25(Mm), a GDP/GTP exchange factor of Ras proteins, controls the activity of the catalytic domain - Modulation by calmodulin and calpain [J].
Baouz, S ;
Jacquet, E ;
Bernardi, A ;
Parmeggiani, A .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1997, 272 (10) :6671-6676
[6]   Regulatory and signaling properties of the Vav family [J].
Bustelo, XR .
MOLECULAR AND CELLULAR BIOLOGY, 2000, 20 (05) :1461-1477
[7]   The multidomain protein Trio binds the LAR transmembrane tyrosine phosphatase, contains a protein kinase domain, and has separate rac-specific and rho-specific guanine nucleotide exchange factor domains [J].
Debant, A ;
SerraPages, C ;
Seipel, K ;
OBrien, S ;
Tang, M ;
Park, SH ;
Streuli, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (11) :5466-5471
[8]   B plexins activate Rho through PDZ-RhoGEF [J].
Driessens, MHE ;
Olivo, C ;
Nagata, K ;
Inagaki, M ;
Collard, JG .
FEBS LETTERS, 2002, 529 (2-3) :168-172
[9]   Multi-domain proteins in the three kingdoms of life:: Orphan domains and other unassigned regions [J].
Ekman, D ;
Björklund, ÅK ;
Frey-Skött, J ;
Elofsson, A .
JOURNAL OF MOLECULAR BIOLOGY, 2005, 348 (01) :231-243
[10]   CDART: Protein homology by domain architecture [J].
Geer, LY ;
Domrachev, M ;
Lipman, DJ ;
Bryant, SH .
GENOME RESEARCH, 2002, 12 (10) :1619-1623