A systematic comparison of protein structure classifications: SCOP, CATH and FSSP

被引:126
作者
Hadley, C [1 ]
Jones, DT [1 ]
机构
[1] Univ Warwick, Dept Biol Sci, Prot Struct Grp, Coventry CV4 7AL, W Midlands, England
来源
STRUCTURE WITH FOLDING & DESIGN | 1999年 / 7卷 / 09期
关键词
evolution; protein folds; protein-structure classification; taxonomy;
D O I
10.1016/S0969-2126(99)80177-4
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Background: Several methods of structural classification have been developed to introduce some order to the large amount of data present in the Protein Data Bank. Such methods facilitate structural comparisons and provide a greater understanding of structure and function. The most widely used and comprehensive databases are SCOP, OATH and FSSP, which represent three unique methods of classifying protein structures: purely manual, a combination of manual and automated, and purely automated, respectively. In order to develop reliable template libraries and benchmarks for protein-fold recognition, a systematic comparison of these databases has been carried out to determine their overall agreement in classifying protein structures. Results: Approximately two-thirds of the protein chains in each database are common to all three databases. Despite employing different methods, and basing their systems on different rules of protein structure and taxonomy, SCOP, CATH and FSSP agree on the majority of their classifications. Discrepancies and inconsistencies are accounted for by a small number of explanations. Other interesting features have been identified, and Various differences between manual and automatic classification methods are presented. Conclusions: Using these databases requires an understanding of the rules upon which they are based; each method offers certain advantages depending on the biological requirements and knowledge of the user. The degree of discrepancy between the systems also has an impact on reliability of prediction methods that employ these schemes as benchmarks. To generate accurate fold templates for threading, we extract information from a consensus database, encompassing agreements between SCOP, CATH and FSSP.
引用
收藏
页码:1099 / 1112
页数:14
相关论文
共 36 条
[11]   PARSER FOR PROTEIN-FOLDING UNITS [J].
HOLM, L ;
SANDER, C .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1994, 19 (03) :256-268
[12]  
HOLM L, 1994, NUCLEIC ACIDS RES, V22, P3600
[13]   A DATABASE OF PROTEIN-STRUCTURE FAMILIES WITH COMMON FOLDING MOTIFS [J].
HOLM, L ;
OUZOUNIS, C ;
SANDER, C ;
TUPAREV, G ;
VRIEND, G .
PROTEIN SCIENCE, 1992, 1 (12) :1691-1698
[14]  
Jones S, 1998, PROTEIN SCI, V7, P233
[15]   MOLSCRIPT - A PROGRAM TO PRODUCE BOTH DETAILED AND SCHEMATIC PLOTS OF PROTEIN STRUCTURES [J].
KRAULIS, PJ .
JOURNAL OF APPLIED CRYSTALLOGRAPHY, 1991, 24 :946-950
[16]   STRUCTURAL PATTERNS IN GLOBULAR PROTEINS [J].
LEVITT, M ;
CHOTHIA, C .
NATURE, 1976, 261 (5561) :552-558
[17]   THREADING A DATABASE OF PROTEIN CORES [J].
MADEJ, T ;
GIBRAT, JF ;
BRYANT, SH .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1995, 23 (03) :356-369
[18]   Protein folds and functions [J].
Martin, AC ;
Orengo, CA ;
Hutchinson, EG ;
Jones, S ;
Karmirantzou, M ;
Laskowski, RA ;
Mitchell, JB ;
Taroni, C ;
Thornton, JM .
STRUCTURE, 1998, 6 (07) :875-884
[19]   Analysis of domain structural class using an automated class assignment protocol [J].
Michie, AD ;
Orengo, CA ;
Thornton, JM .
JOURNAL OF MOLECULAR BIOLOGY, 1996, 262 (02) :168-185
[20]   HOMSTRAD: A database of protein structure alignments for homologous families [J].
Mizuguchi, K ;
Deane, CM ;
Blundell, TL ;
Overington, JP .
PROTEIN SCIENCE, 1998, 7 (11) :2469-2471