Protein family expansions and biological complexity

被引:167
作者
Vogel, Christine [1 ]
Chothia, Cyrus
机构
[1] MRC, Mol Biol Lab, Cambridge, England
[2] Univ Texas, Inst Cellular & Mol Biol, Austin, TX 78712 USA
基金
英国医学研究理事会;
关键词
D O I
10.1371/journal.pcbi.0020048
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
During the course of evolution, new proteins are produced very largely as the result of gene duplication, divergence and, in many cases, combination. This means that proteins or protein domains belong to families or, in cases where their relationships can only be recognised on the basis of structure, superfamilies whose members descended from a common ancestor. The size of superfamilies can vary greatly. Also, during the course of evolution organisms of increasing complexity have arisen. In this paper we determine the identity of those superfamilies whose relative sizes in different organisms are highly correlated to the complexity of the organisms. As a measure of the complexity of 38 uni- and multicellular eukaryotes we took the number of different cell types of which they are composed. Of 1,219 superfamilies, there are 194 whose sizes in the 38 organisms are strongly correlated with the number of cell types in the organisms. We give outline descriptions of these superfamilies. Half are involved in extracellular processes or regulation and smaller proportions in other types of activity. Half of all superfamilies have no significant correlation with complexity. We also determined whether the expansions of large superfamilies correlate with each other. We found three large clusters of correlated expansions: one involves expansions in both vertebrates and plants, one just in vertebrates, and one just in plants. Our work identifies important protein families and provides one explanation of the discrepancy between the total number of genes and the apparent physiological complexity of eukaryotic organisms.
引用
收藏
页码:370 / 382
页数:13
相关论文
共 46 条
[1]   SCOP database in 2004: refinements integrate structure and sequence family data [J].
Andreeva, A ;
Howorth, D ;
Brenner, SE ;
Hubbard, TJP ;
Chothia, C ;
Murzin, AG .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D226-D229
[2]   Domain combinations in archaeal, eubacterial and eukaryotic proteomes [J].
Apic, G ;
Gough, J ;
Teichmann, SA .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 310 (02) :311-325
[3]   Origin of multicellular eukaryotes - insights from proteome comparisons [J].
Aravind, L ;
Subramanian, G .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 1999, 9 (06) :688-694
[4]   An ontology for cell types [J].
Bard, J ;
Rhee, SY ;
Ashburner, M .
GENOME BIOLOGY, 2005, 6 (02)
[5]   Ensembl 2006 [J].
Birney, E. ;
Andrews, D. ;
Caccamo, M. ;
Chen, Y. ;
Clarke, L. ;
Coates, G. ;
Cox, T. ;
Cunningham, F. ;
Curwen, V. ;
Cutts, T. ;
Down, T. ;
Durbin, R. ;
Fernandez-Suarez, X. M. ;
Flicek, P. ;
Graf, S. ;
Hammond, M. ;
Herrero, J. ;
Howe, K. ;
Iyer, V. ;
Jekosch, K. ;
Kahari, A. ;
Kasprzyk, A. ;
Keefe, D. ;
Kokocinski, F. ;
Kulesha, E. ;
London, D. ;
Longden, I. ;
Melsopp, C. ;
Meidl, P. ;
Overduin, B. ;
Parker, A. ;
Proctor, G. ;
Prlic, A. ;
Rae, M. ;
Rios, D. ;
Redmond, S. ;
Schuster, M. ;
Sealy, I. ;
Searle, S. ;
Severin, J. ;
Slater, G. ;
Smedley, D. ;
Smith, J. ;
Stabenau, A. ;
Stalker, J. ;
Trevanion, S. ;
Ureta-Vidal, A. ;
Vogel, J. ;
White, S. ;
Woodwark, C. .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D556-D561
[6]   Protein variety and functional diversity: Swiss-Prot annotation in its biological context [J].
Boeckmann, B ;
Blatter, MC ;
Famiglietti, L ;
Hinz, U ;
Lane, L ;
Roechert, B ;
Bairoch, A .
COMPTES RENDUS BIOLOGIES, 2005, 328 (10-11) :882-899
[7]   SHUFFLED DOMAINS IN EXTRACELLULAR PROTEINS [J].
BORK, P .
FEBS LETTERS, 1991, 286 (1-2) :47-54
[8]   Alternative splicing and genome complexity [J].
Brett, D ;
Pospisil, H ;
Valcárcel, J ;
Reich, J ;
Bork, P .
NATURE GENETICS, 2002, 30 (01) :29-30
[9]   A common rule for the scaling of carnivore density [J].
Carbone, C ;
Gittleman, JL .
SCIENCE, 2002, 295 (5563) :2273-2276
[10]   Comparison of the complete protein sets of worm and yeast: Orthology and divergence [J].
Chervitz, SA ;
Aravind, L ;
Sherlock, G ;
Ball, CA ;
Koonin, EV ;
Dwight, SS ;
Harris, MA ;
Dolinski, K ;
Mohr, S ;
Smith, T ;
Weng, S ;
Cherry, JM ;
Botstein, D .
SCIENCE, 1998, 282 (5396) :2022-2028