Domain deletions and substitutions in the modular protein evolution

被引:96
作者
Weiner, J [1 ]
Beaussart, F [1 ]
Bornberg-Bauer, E [1 ]
机构
[1] Univ Munster, Sch Biol Sci, Div Bioinformat, D-48149 Munster, Germany
关键词
domain loss; fission; fusion; protein domains; protein evolution;
D O I
10.1111/j.1742-4658.2006.05220.x
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The main mechanisms shaping the modular evolution of proteins are gene duplication, fusion and fission, recombination and loss of fragments. While a large body of research has focused on duplications and fusions, we concentrated, in this study, on how domains are lost. We investigated motif databases and introduced a measure of protein similarity that is based on domain arrangements. Proteins are represented as strings of domains and comparison was based on the classic dynamic alignment scheme. We found that domain losses and duplications were more frequent at the ends of proteins. We showed that losses can be explained by the introduction of start and stop codons which render the terminal domains nonfunctional, such that further shortening, until the whole domain is lost, is not evolutionarily selected against. We demonstrated that domains which also occur as single-domain proteins are less than at the C terminus and in the middle, than at the C terminus. We conclude that fission/fusion events with single-domain proteins Occur mostly at the C terminus. We found that domain substitutions are rare, in particular in the middle of proteins. We also showed that many cases of substitutions or losses result from erroneous annotations, but we were also able to find courses of evolutionary events where domains vanish over time. This is explained by a case study on the bacterial formate dehydrogenases.
引用
收藏
页码:2037 / 2047
页数:11
相关论文
共 29 条
[1]  
Apic G, 2001, Bioinformatics, V17 Suppl 1, pS83
[2]   Domain combinations in archaeal, eubacterial and eukaryotic proteomes [J].
Apic, G ;
Gough, J ;
Teichmann, SA .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 310 (02) :311-325
[3]  
Apic Gordana, 2003, Journal of Structural and Functional Genomics, V4, P67, DOI 10.1023/A:1026113408773
[4]   The geometry of domain combination in proteins [J].
Bashton, M ;
Chothia, C .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 315 (04) :927-939
[5]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkr1065, 10.1093/nar/gkh121]
[6]   The evolution of domain arrangements in proteins and interaction networks [J].
Bornberg-Bauer, E ;
Beaussart, F ;
Kummerfeld, S ;
Teichmann, S ;
Weiner, J .
CELLULAR AND MOLECULAR LIFE SCIENCES, 2005, 62 (04) :435-445
[7]   Randomness, structural uniqueness, modularity and neutral evolution in sequence space of model proteins [J].
Bornberg-Bauer, E .
ZEITSCHRIFT FUR PHYSIKALISCHE CHEMIE-INTERNATIONAL JOURNAL OF RESEARCH IN PHYSICAL CHEMISTRY & CHEMICAL PHYSICS, 2002, 216 :139-154
[8]   Crystal structure of formate dehydrogenase H: Catalysis involving Mo, molybdopterin, selenocysteine, and an Fe4S4 cluster [J].
Boyington, JC ;
Gladyshev, VN ;
Khangulov, SV ;
Stadtman, TC ;
Sun, PD .
SCIENCE, 1997, 275 (5304) :1305-1308
[9]   Sequence permutations in the molecular evolution of DNA methyltransferases [J].
Bujnicki, Janusz M. .
BMC EVOLUTIONARY BIOLOGY, 2002, 2 (1)
[10]   ProDom and ProDom-CG: tools for protein domain analysis and whole genome comparisons [J].
Corpet, F ;
Servant, F ;
Gouzy, J ;
Kahn, D .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :267-269