The CATH Hierarchy Revisited-Structural Divergence in Domain Superfamilies and the Continuity of Fold Space

被引:34
作者
Cuff, Alison [1 ]
Redfern, Oliver C. [1 ]
Greene, Lesley [1 ]
Sillitoe, Ian [1 ]
Lewis, Tony [1 ]
Dibley, Mark [1 ]
Reid, Adam [1 ]
Pearl, Frances [1 ]
Dallman, Tim [1 ]
Todd, Annabel [2 ]
Garratt, Richard [3 ]
Thornton, Janet [1 ,2 ]
Orengo, Christine [1 ]
机构
[1] UCL, Inst Struct & Mol Biol, London WC1E 6BT, England
[2] European Bioinformat Inst, Cambridge CB10 1SD, England
[3] Univ Sao Paulo, Inst Fis Sao Carlos, Sao Carlos, SP, Brazil
基金
英国生物技术与生命科学研究理事会;
关键词
PROTEIN-STRUCTURE; SEMANTIC SIMILARITY; STRUCTURE DATABASE; ALIGNMENT METHODS; GENE ONTOLOGY; CLASSIFICATION; EVOLUTION; FAMILIES; GENOMES; SEQUENCE;
D O I
10.1016/j.str.2009.06.015
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
This paper explores the structural continuum in CATH and the extent to which superfamilies adopt distinct folds. Although most superfamilies are structurally conserved, in some of the most highly populated superfamilies (4% of all superfamilies) there is considerable structural divergence. While relatives share a similar fold in the evolutionary conserved core, diverse elaborations to this core can result in significant differences in the global structures. Applying similar protocols to examine the extent to which structural overlaps occur between different fold groups, it appears this effect is confined to just a few architectures and is largely due to small, recurring super-secondary motifs (e.g., alpha beta-motifs, alpha-hairpins). Although 24% of superfamilies overlap with superfamilies having different folds, only 14% of nonredundant structures in CATH are involved in overlaps. Nevertheless, the existence of these overlaps suggests that, in some regions of structure space, the fold universe should be seen as more continuous.
引用
收藏
页码:1051 / 1062
页数:12
相关论文
共 45 条
[1]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[2]   Gene3D: Structural assignment for whole genes and genomes using the CATH domain structure database [J].
Buchan, DWA ;
Shepherd, AJ ;
Lee, D ;
Pearl, FMG ;
Rison, SCG ;
Thornton, JM ;
Orengo, CA .
GENOME RESEARCH, 2002, 12 (03) :503-514
[3]   Implications of structural genomics target selection strategies: Pfam5000, whole genome, and random approaches [J].
Chandonia, JM ;
Brenner, SE .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 58 (01) :166-179
[4]   THE RELATION BETWEEN THE DIVERGENCE OF SEQUENCE AND STRUCTURE IN PROTEINS [J].
CHOTHIA, C ;
LESK, AM .
EMBO JOURNAL, 1986, 5 (04) :823-826
[5]  
Dengler U, 2001, PROTEINS, V42, P332, DOI 10.1002/1097-0134(20010215)42:3<332::AID-PROT40>3.0.CO
[6]  
2-S
[7]   Fragnostic: walking through protein structure space [J].
Friedberg, I ;
Godzik, A .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W249-W251
[8]   The structure of protein evolution and the evolution of protein structure [J].
Goldstein, Richard A. .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2008, 18 (02) :170-177
[9]   Progress towards mapping the universe of protein folds [J].
Grant, A ;
Lee, D ;
Orengo, C .
GENOME BIOLOGY, 2004, 5 (05)
[10]   The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution [J].
Greene, Lesley H. ;
Lewis, Tony E. ;
Addou, Sarah ;
Cuff, Alison ;
Dallman, Tim ;
Dibley, Mark ;
Redfern, Oliver ;
Pearl, Frances ;
Nambudiry, Rekha ;
Reid, Adam ;
Sillitoe, Ian ;
Yeats, Corin ;
Thornton, Janet M. ;
Orengo, Christine A. .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D291-D297