Assessment of predictions in the model quality assessment category

被引:86
作者
Cozzetto, Domenico
Kryshtafovych, Andriy
Ceriani, Michele
Tramontano, Anna
机构
[1] Univ Roma La Sapienza, Dept Biochem Sci Rossi Fanelli, I-00185 Rome, Italy
[2] Univ Calif Davis, Genome Ctr, Prot Struct Predict Ctr, Davis, CA 95616 USA
[3] Univ Roma La Sapienza, Fdn Cenci Bolognetti, Ist Pasteur, I-00185 Rome, Italy
关键词
CASP; protein structure prediction; model quality assessment;
D O I
10.1002/prot.21669
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The article presents our evaluation of the predictions submitted to the model quality assessment (QA) category in CASP7. In this newly introduced category, predictors were asked to provide quality estimates for protein structure models. The QA category uses the automatically produced models that are traditionally distributed to CASP participants as input for predictions. Predictors were asked to provide an index of the quality of these individual models (QM1) as well as an index for the expected correctness of each of their residues (QM2). We computed the correlation between the observed and predicted quality of the models and of the individual residues achieved by the participating groups and evaluated the statistical significance of the differences. We also compared the results with those obtained by a "naive predictor" that assigns a quality score related to how close the model is to the structure of the most similar protein of known structure. The aims of a method for assessing the overall quality of a model can be twofold: selecting the best (or one of the best) model(s) among a set of plausible choices, or assigning a nonrelative quality value to an individual model. The applications of the two strategies are different, albeit equally important. Our assessment of the QA category demonstrates that methods for addressing the first task effectively do exist, while there is room for improvement as far as the second aspect is concerned. Notwithstanding the limited number of groups submitting predictions for residue-level accuracy, our data demonstrate that a respectable accuracy in this task can be achieved by methods relying on the comparison of different models for the same target.
引用
收藏
页码:175 / 183
页数:9
相关论文
共 25 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] Anderson TW., 2003, INTRO MULTIVARIATE S
  • [3] Protein structure prediction and structural genomics
    Baker, D
    Sali, A
    [J]. SCIENCE, 2001, 294 (5540) : 93 - 96
  • [4] The PMDB Protein Model Database
    Castrignano, Tiziana
    De Meo, Paolo D'Onorio
    Cozzetto, Domenico
    Talamo, Ivano Giuseppe
    Tramontano, Anna
    [J]. NUCLEIC ACIDS RESEARCH, 2006, 34 : D306 - D309
  • [5] THE RELATION BETWEEN THE DIVERGENCE OF SEQUENCE AND STRUCTURE IN PROTEINS
    CHOTHIA, C
    LESK, AM
    [J]. EMBO JOURNAL, 1986, 5 (04) : 823 - 826
  • [6] Relationship between multiple sequence alignments and quality of protein comparative models
    Cozzetto, D
    Tramontano, A
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 58 (01) : 151 - 157
  • [7] Evaluating the usefulness of protein structure models for molecular replacement
    Giorgetti, A
    Raimondo, D
    Miele, AE
    Tramontano, A
    [J]. BIOINFORMATICS, 2005, 21 : 72 - 76
  • [8] Assessment of CASP7 structure predictions for template free targets
    Jauch, Ralf
    Yeo, Hock Chuan
    Kolatkar, Prasanna R.
    Clarke, Neil D.
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 69 : 57 - 67
  • [9] The SWISS-MODEL repository: new features and functionalities
    Kopp, Juergen
    Schwede, Torsten
    [J]. NUCLEIC ACIDS RESEARCH, 2006, 34 : D315 - D318
  • [10] Assessment of CASP7 predictions for template-based modeling targets
    Kopp, Jurgen
    Bordoli, Lorenza
    Battey, James N. D.
    Kiefer, Florian
    Schwede, Torsten
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 69 : 38 - 56