Evolution of function in protein superfamilies, from a structural perspective

被引:530
作者
Todd, AE
Orengo, CA
Thornton, JM
机构
[1] UCL, Dept Biochem & Mol Biol, London WC1E 6BT, England
[2] Univ London Birkbeck Coll, Dept Crystallog, London WC1E 7HX, England
基金
英国生物技术与生命科学研究理事会; 英国医学研究理事会;
关键词
enzyme superfamilies; three-dimensional structure; function; diversity; evolution;
D O I
10.1006/jmbi.2001.4513
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The recent growth in protein databases has revealed the functional diversity of many protein superfamilies. We have assessed the functional variation of homologous enzyme superfamilies containing two or more enzymes, as defined by the CATH protein structure classification, by way of the Enzyme Commission (EC) scheme. Combining sequence and structure information to identify relatives, the majority of superfamilies display variation in enzyme function, with 25 % of superfamilies in the PDB having members of different enzyme types. We determined the extent of functional similarity at different levels of sequence identity for 486,000 homologous pairs (enzyme/enzyme and enzyme/non-enzyme), with structural and sequence relatives included. For single and multi-domain proteins, variation in EC number is rare above 40 % sequence identity, and above 30 %, the first three digits may be predicted with an accuracy of at least 90 %. For more distantly related proteins sharing less than 30 % sequence identity, functional variation is significant, and below this threshold, structural data are essential for understanding the molecular basis of observed functional differences. To explore the mechanisms for generating functional diversity during evolution, we have studied in detail 31 diverse structural enzyme superfamilies for which structural data are available. A large number of variations and peculiarities are observed, at the atomic level through to gross structural rearrangements. Almost all superfamilies exhibit functional diversity generated by local sequence variation and domain shuffling. Commonly, substrate specificity is diverse across a superfamily, whilst the reaction chemistry is maintained. In many superfamilies, the position of catalytic residues may vary despite playing equivalent functional roles in related proteins. The implications of functional diversity within supefamilies for the structural genomics projects are discussed. More detailed information on these superfamilies is available at http:/ /www.biochem.ucl.ac.uk/bsm/ FAM-EC /. (C) 2001 Academic Press.
引用
收藏
页码:1113 / 1143
页数:31
相关论文
共 113 条
[1]  
AEvarsson A, 1999, NAT STRUCT BIOL, V6, P785
[2]   EVOLUTIONARY RELATIONSHIPS AMONG PYRIDOXAL-5'-PHOSPHATE-DEPENDENT ENZYMES - REGIO-SPECIFIC ALPHA-FAMILY, BETA-FAMILY, AND GAMMA-FAMILY [J].
ALEXANDER, FW ;
SANDMEIER, E ;
MEHTA, PK ;
CHRISTEN, P .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1994, 219 (03) :953-960
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]   The catalytic domain of the P-type ATPase has the haloacid dehalogenase fold [J].
Aravind, L ;
Galperin, MY ;
Koonin, EV .
TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (04) :127-129
[5]  
Artymiuk PJ, 1997, NATURE, V388, P33, DOI 10.1038/40310
[6]  
ASHBURNER M, 1994, DEVELOPMENT, V120, P2077
[7]   Understanding enzyme superfamilies - Chemistry as the fundamental determinant in the evolution of new catalytic activities [J].
Babbitt, PC ;
Gerlt, JA .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1997, 272 (49) :30591-30594
[8]   The ENZYME database in 2000 [J].
Bairoch, A .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :304-305
[9]   The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :45-48
[10]   ELECTRON-DENSITY MAP OF APOFERRITIN AT 2.8-A RESOLUTION [J].
BANYARD, SH ;
STAMMERS, DK ;
HARRISON, PM .
NATURE, 1978, 271 (5642) :282-284