Data growth and its impact on the SCOP database: new developments

被引:715
作者
Andreeva, Antonina [1 ]
Howorth, Dave [1 ]
Chandonia, John-Marc [2 ,3 ]
Brenner, Steven E. [2 ]
Hubbard, Tim J. P. [4 ]
Chothia, Cyrus [5 ]
Murzin, Alexey G. [1 ]
机构
[1] MRC Ctr Protien Engn, Cambridge CB2 0QH, England
[2] Univ Calif Berkeley, Dept Plant & Microbial Biol, Berkeley, CA 94720 USA
[3] Berkeley Natl Lab, Phys Biosci Div, Berkeley, CA 94720 USA
[4] Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England
[5] MRC Lab Mol Biol, Cambridge CB2 0QH, England
基金
英国医学研究理事会; 英国惠康基金;
关键词
D O I
10.1093/nar/gkm993
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The Structural Classification of Proteins (SCOP) database is a comprehensive ordering of all proteins of known structure, according to their evolutionary and structural relationships. The SCOP hierarchy comprises the following levels: Species, Protein, Family, Superfamily, Fold and Class. While keeping the original classification scheme intact, we have changed the production of SCOP in order to cope with a rapid growth of new structural data and to facilitate the discovery of new protein relationships. We describe ongoing developments and new features implemented in SCOP. A new update protocol supports batch classification of new protein structures by their detected relationships at Family and Superfamily levels in contrast to our previous sequential handling of new structural data by release date. We introduce pre-SCOP, a preview of the SCOP developmental version that enables earlier access to the information on new relationships. We also discuss the impact of worldwide Structural Genomics initiatives, which are producing new protein structures at an increasing rate, on the rates of discovery and growth of protein families and superfamilies. SCOP can be accessed at http://scop.mrc-lmb.cam.ac.uk/scop.
引用
收藏
页码:D419 / D425
页数:7
相关论文
共 18 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] SCOP database in 2004: refinements integrate structure and sequence family data
    Andreeva, A
    Howorth, D
    Brenner, SE
    Hubbard, TJP
    Chothia, C
    Murzin, AG
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D226 - D229
  • [3] SISYPHUS - structural alignments for proteins with non-trivial relationships
    Andreeva, Antonina
    Prlic, Andreas
    Hubbard, Tim J. P.
    Murzin, Alexey G.
    [J]. NUCLEIC ACIDS RESEARCH, 2007, 35 : D253 - D259
  • [4] Evolution of protein fold in the presence of functional constraints
    Andreeva, Antonina
    Murzin, Alexey G.
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2006, 16 (03) : 399 - 408
  • [5] The universal protein resource (UniProt)
    Bairoch, Amos
    Bougueleret, Lydie
    Altairac, Severine
    Amendolia, Valeria
    Auchincloss, Andrea
    Puy, Ghislaine Argoud
    Axelsen, Kristian
    Baratin, Delphine
    Blatter, Marie-Claude
    Boeckmann, Brigitte
    Bollondi, Laurent
    Boutet, Emmanuel
    Quintaje, Silvia Braconi
    Breuza, Lionel
    Bridge, Alan
    deCastro, Edouard
    Coral, Danielle
    Coudert, Elisabeth
    Cusin, Isabelle
    Dobrokhotov, Pavel
    Dornevil, Dolnide
    Duvaud, Severine
    Estreicher, Anne
    Famiglietti, Livia
    Feuermann, Marc
    Gehant, Sebastian
    Farriol-Mathis, Nathalie
    Ferro, Serenella
    Gasteiger, Elisabeth
    Gateau, Alain
    Gerritsen, Vivienne
    Gos, Arnaud
    Gruaz-Gumowski, Nadine
    Hinz, Ursula
    Hulo, Chantal
    Hulo, Nicolas
    Ioannidis, Vassilios
    Ivanyi, Ivan
    James, Janet
    Jain, Eric
    Jimenez, Silvia
    Jungo, Florence
    Junker, Vivien
    Keller, Guillaume
    Lachaize, Corinne
    Lane-Guermonprez, Lydie
    Langendijk-Genevaux, Petra
    Lara, Vicente
    Lemercier, Philippe
    Le Saux, Virginie
    [J]. NUCLEIC ACIDS RESEARCH, 2007, 35 : D193 - D197
  • [6] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [7] The ASTRAL Compendium in 2004
    Chandonia, JM
    Hon, G
    Walker, NS
    Lo Conte, L
    Koehl, P
    Levitt, M
    Brenner, SE
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D189 - D192
  • [8] The impact of structural genomics: Expectations and outcomes
    Chandonia, JM
    Brenner, SE
    [J]. SCIENCE, 2006, 311 (5759) : 347 - 351
  • [9] DONG A, 2004, PDB ID 1T1J CRYSTAL
  • [10] Pfam:: clans, web tools and services
    Finn, Robert D.
    Mistry, Jaina
    Schuster-Bockler, Benjamin
    Griffiths-Jones, Sam
    Hollich, Volker
    Lassmann, Timo
    Moxon, Simon
    Marshall, Mhairi
    Khanna, Ajay
    Durbin, Richard
    Eddy, Sean R.
    Sonnhammer, Erik L. L.
    Bateman, Alex
    [J]. NUCLEIC ACIDS RESEARCH, 2006, 34 : D247 - D251