A phylogenomic reconstruction of the protein world based on a genomic census of protein fold architecture

被引:32
作者
Wang, Minglei
Boca, Simina Maria
Kalelkar, Rakhee
Mittenthal, Jay E.
Caetano-Anolles, Gustavo [1 ]
机构
[1] Univ Illinois, Dept Crop Sci, Urbana, IL 61801 USA
[2] Univ Illinois, Dept Cell & Dev Biol, Urbana, IL 61801 USA
[3] Univ Illinois, Dept Math, Urbana, IL 61801 USA
关键词
evolutionary funnel; organismal diversification; origins of life; protein fold structure; architectural diversification;
D O I
10.1002/cplx.20141
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The protein world has a hierarchical and redundant organization that can be specified in terms of evolutionary units of molecular structure, the protein domains. The Structural Classification of Proteins (SCOP) has unified domains into a comparatively small set of folding architectures, the protein fold families and superfamilies, and these have been further grouped into protein folds. In this study, we reconstruct the evolution of the protein world using information embedded in a structural genomic census of fold architectures defined by a phylogenomic analysis of 185 completely sequenced genomes using advanced hidden Markov models and 776 folds described in SCOP release 1.67. Our study confirms the existence of defined evolutionary patterns of architectural diversification and explores how phylogenomic trees generated from folds relate to those reconstructed from fold superfamilies. Evolutionary patterns help us propose a general conceptual model that describes the growth of architectures in the protein world. (c) 2006 Wiley Periodicals, Inc.
引用
收藏
页码:27 / 40
页数:14
相关论文
共 40 条
[1]  
Ancel LW, 2000, J EXP ZOOL, V288, P242, DOI 10.1002/1097-010X(20001015)288:3<242::AID-JEZ5>3.0.CO
[2]  
2-O
[3]   SCOP database in 2004: refinements integrate structure and sequence family data [J].
Andreeva, A ;
Howorth, D ;
Brenner, SE ;
Hubbard, TJP ;
Chothia, C ;
Murzin, AG .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D226-D229
[4]   Trends in protein evolution inferred from sequence and structure analysis [J].
Aravind, L ;
Mazumder, R ;
Vasudevan, S ;
Koonin, EV .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2002, 12 (03) :392-399
[5]   Protein length in eukaryotic and prokaryotic proteomes [J].
Brocchieri, L ;
Karlin, S .
NUCLEIC ACIDS RESEARCH, 2005, 33 (10) :3390-3400
[6]  
BULL AT, 1992, ANNU REV MICROBIOL, V46, P219, DOI 10.1146/annurev.micro.46.1.219
[7]   Universal sharing patterns in proteomes and evolution of protein fold architecture and life [J].
Caetano-Anollés, G ;
Caetano-Anollés, D .
JOURNAL OF MOLECULAR EVOLUTION, 2005, 60 (04) :484-498
[8]   An evolutionarily structured universe of protein architecture [J].
Caetano-Anollés, G ;
Caetano-Anollés, D .
GENOME RESEARCH, 2003, 13 (07) :1563-1571
[9]   Evolution of the protein repertoire [J].
Chothia, C ;
Gough, J ;
Vogel, C ;
Teichmann, SA .
SCIENCE, 2003, 300 (5626) :1701-1703
[10]   Homology among (βα)8 barrels:: Implications for the evolution of metabolic pathways [J].
Copley, RR ;
Bork, P .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 303 (04) :627-640