Definition and classification of evaluation units for CASP10

被引:18
作者
Taylor, Todd J. [1 ]
Tai, Chin-Hsien [1 ]
Huang, Yuanpeng J. [2 ]
Block, Jeremy [2 ]
Bai, Hongjun [1 ]
Kryshtafovych, Andriy [3 ]
Montelione, Gaetano T. [2 ]
Lee, Byungkook [1 ]
机构
[1] NCI, Mol Biol Lab, Ctr Canc Res, NIH, Bethesda, MD 20892 USA
[2] Rutgers State Univ, Robert Wood Johnson Med Sch, Northeast Struct Genom Consortium, Ctr Adv Biotechnol & Med,Dept Mol Biol & Biochem, Piscataway, NJ 08854 USA
[3] Univ Calif Davis, Genome Ctr, Davis, CA 95616 USA
关键词
PROTEIN STRUCTURES; TARGET CLASSIFICATION; STRUCTURE ALIGNMENT; DOMAIN DEFINITION; GENERATION; PRECISION; ALGORITHM; ACCURACY; PROGRAM; SERVER;
D O I
10.1002/prot.24434
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
For the 10th experiment on Critical Assessment of the techniques of protein Structure Prediction (CASP), the prediction target proteins were broken into independent evaluation units (EUs), which were then classified into template-based modeling (TBM) or free modeling (FM) categories. We describe here how the EUs were defined and classified, what issues arose in the process, and how we resolved them. EUs are frequently not the whole target proteins but the constituting structural domains. However, the assessors from CASP7 on combined more than one domain into 1 EU for some targets, which implied that the assessment also included evaluation of the prediction of the relative position and orientation of these domains. In CASP10, we followed and expanded this notion by defining multidomain EUs for a number of targets. These included 3 EUs, each made of two domains of familiar fold but arranged in a novel manner and for which the focus of evaluation was the interdomain arrangement. An EU was classified to the TBM category if a template could be found by sequence similarity searches and to FM if a structural template could not be found by structural similarity searches. The EUs that did not fall cleanly in either of these cases were classified case-by-case, often including consideration of the overall quality and characteristics of the predictions. © 2013 Wiley Periodicals, Inc.
引用
收藏
页码:14 / 25
页数:12
相关论文
共 33 条
[1]   PDP: protein domain parser [J].
Alexandrov, N ;
Shindyalov, I .
BIOINFORMATICS, 2003, 19 (03) :429-430
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[4]   Assessing model accuracy using the homology modeling automatically software [J].
Bhattacharya, Aneerban ;
Wunderlich, Zeba ;
Monleon, Daniel ;
Tejero, Roberto ;
Montelione, Gaetano T. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 70 (01) :105-118
[5]   Evaluating protein structures determined by structural genomics consortia [J].
Bhattacharya, Aneerban ;
Tejero, Roberto ;
Montelione, Gaetano T. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 66 (04) :778-795
[6]   KinImmerse: Macromolecular VR for NMR ensembles [J].
Block, Jeremy N. ;
Zielinski, David J. ;
Chen, Vincent B. ;
Davis, Ian W. ;
Vinson, Claire ;
Brady, Rachael ;
Richardson, Jane S. ;
Richardson, David C. .
SOURCE CODE FOR BIOLOGY AND MEDICINE, 2009, 4 (01)
[7]   KiNG (Kinemage, Next Generation): A versatile interactive molecular and scientific visualization program [J].
Chen, Vincent B. ;
Davis, Ian W. ;
Richardson, David C. .
PROTEIN SCIENCE, 2009, 18 (11) :2403-2409
[8]   Domain definition and target classification for CASP7 [J].
Clarke, Neil D. ;
Ezkurdia, Iakes ;
Kopp, Jurgen ;
Read, Randy J. ;
Schwede, Torsten ;
Tress, Michael .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 69 :10-18
[9]   MolProbity: all-atom contacts and structure validation for proteins and nucleic acids [J].
Davis, Ian W. ;
Leaver-Fay, Andrew ;
Chen, Vincent B. ;
Block, Jeremy N. ;
Kapral, Gary J. ;
Wang, Xueyi ;
Murray, Laura W. ;
Arendall, W. Bryan, III ;
Snoeyink, Jack ;
Richardson, Jane S. ;
Richardson, David C. .
NUCLEIC ACIDS RESEARCH, 2007, 35 :W375-W383
[10]   Structural principles of Leucine-Rich repeat (LRR) proteins [J].
Enkhbayar, P ;
Kamiya, M ;
Osaki, M ;
Matsumoto, T ;
Matsushima, N .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 54 (03) :394-403