SCOPe: Structural Classification of Proteins-extended, integrating SCOP and ASTRAL data and classification of new structures

被引:516
作者
Fox, Naomi K. [1 ]
Brenner, Steven E. [1 ,2 ]
Chandonia, John-Marc [1 ]
机构
[1] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Phys Biosci Div, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Plant & Microbial Biol, Berkeley, CA 94720 USA
基金
美国国家卫生研究院;
关键词
COMPENDIUM; ASSIGNMENT; SEQUENCES; GENOMICS;
D O I
10.1093/nar/gkt1240
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Structural Classification of Proteins-extended (SCOPe, http://scop.berkeley.edu) is a database of protein structural relationships that extends the SCOP database. SCOP is a manually curated ordering of domains from the majority of proteins of known structure in a hierarchy according to structural and evolutionary relationships. Development of the SCOP 1.x series concluded with SCOP 1.75. The ASTRAL compendium provides several databases and tools to aid in the analysis of the protein structures classified in SCOP, particularly through the use of their sequences. SCOPe extends version 1.75 of the SCOP database, using automated curation methods to classify many structures released since SCOP 1.75. We have rigorously benchmarked our automated methods to ensure that they are as accurate as manual curation, though there are many proteins to which our methods cannot be applied. SCOPe is also partially manually curated to correct some errors in SCOP. SCOPe aims to be backward compatible with SCOP, providing the same parseable files and a history of changes between all stable SCOP and SCOPe releases. SCOPe also incorporates and updates the ASTRAL database. The latest release of SCOPe, 2.03, contains 59 514 Protein Data Bank (PDB) entries, increasing the number of structures classified in SCOP by 55% and including more than 65% of the protein structures in the PDB.
引用
收藏
页码:D304 / D309
页数:6
相关论文
共 11 条
[1]   Data growth and its impact on the SCOP database: new developments [J].
Andreeva, Antonina ;
Howorth, Dave ;
Chandonia, John-Marc ;
Brenner, Steven E. ;
Hubbard, Tim J. P. ;
Chothia, Cyrus ;
Murzin, Alexey G. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D419-D425
[2]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[3]   The ASTRAL compendium for protein structure and sequence analysis [J].
Brenner, SE ;
Koehl, P ;
Levitt, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :254-256
[4]  
Brenner SE, 1996, METHOD ENZYMOL, V266, P635
[5]   The ASTRAL Compendium in 2004 [J].
Chandonia, JM ;
Hon, G ;
Walker, NS ;
Lo Conte, L ;
Koehl, P ;
Levitt, M ;
Brenner, SE .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D189-D192
[6]   ASTRAL compendium enhancements [J].
Chandonia, JM ;
Walker, NS ;
Conte, LL ;
Koehl, P ;
Levitt, M ;
Brenner, SE .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :260-263
[7]   SCOPmap: Automated assignment of protein structures to evolutionary superfamilies [J].
Cheek, S ;
Qi, Y ;
Krishna, SS ;
Kinch, LN ;
Grishin, NV .
BMC BIOINFORMATICS, 2004, 5 (1)
[8]   Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure [J].
Gough, J ;
Karplus, K ;
Hughey, R ;
Chothia, C .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 313 (04) :903-919
[9]  
Lo Conte L, 2002, NUCLEIC ACIDS RES, V30, P264
[10]   SCOP - A STRUCTURAL CLASSIFICATION OF PROTEINS DATABASE FOR THE INVESTIGATION OF SEQUENCES AND STRUCTURES [J].
MURZIN, AG ;
BRENNER, SE ;
HUBBARD, T ;
CHOTHIA, C .
JOURNAL OF MOLECULAR BIOLOGY, 1995, 247 (04) :536-540