Improving sequence-based fold recognition by using 3D model quality assessment

被引:49
作者
Pettitt, CS [1 ]
McGuffin, LJ [1 ]
Jones, DT [1 ]
机构
[1] UCL, Dept Comp Sci, Bioinformat Unit, London WC1E 6BT, England
基金
英国生物技术与生命科学研究理事会;
关键词
D O I
10.1093/bioinformatics/bti540
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The ability of a simple method (MODCHECK) to determine the sequence-structure compatibility of a set of structural models generated by fold recognition is tested in a thorough benchmark analysis. Four Model Quality Assessment Programs (MQAPs) were tested on 188 targets from the latest LiveBench-9 automated structure evaluation experiment. We systematically test and evaluate whether the MQAP methods can successfully detect native-likemodels. Results: We show that compared with the other three methods tested MODCHECK is the most reliable method for consistently performing the best top model selection and for ranking the models. In addition, we show that the choice of model similarity score used to assess a model's similarity to the experimental structure can influence the overall performance of these tools. Although these MQAP methods fail to improve the model selection performance for methods that already incorporate protein three dimension (3D) structural information, an improvement is observed for methods that are purely sequence-based, including the best profile-profile methods. This suggests that even the best sequence-based fold recognition methods can still be improved by taking into account the 3D structural information.
引用
收藏
页码:3509 / 3515
页数:7
相关论文
共 28 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   LiveBench-1: Continuous benchmarking of protein structure prediction servers [J].
Bujnicki, JM ;
Elofsson, A ;
Fischer, D ;
Rychlewski, L .
PROTEIN SCIENCE, 2001, 10 (02) :352-361
[3]   A study of quality measures for protein threading models [J].
Cristobal, Susana ;
Zemla, Adam ;
Fischer, Daniel ;
Rychlewski, Leszek ;
Elofsson, Arne .
BMC BIOINFORMATICS, 2001, 2 (1)
[4]   ORFeus: detection of distant homology using sequence profiles and predicted secondary structure [J].
Ginalski, K ;
Pas, J ;
Wyrwicz, LS ;
von Grotthuss, M ;
Bujnicki, JM ;
Rychlewski, L .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3804-3807
[5]   IDENTIFICATION OF NATIVE PROTEIN FOLDS AMONGST A LARGE NUMBER OF INCORRECT MODELS - THE CALCULATION OF LOW-ENERGY CONFORMATIONS FROM POTENTIALS OF MEAN FORCE [J].
HENDLICH, M ;
LACKNER, P ;
WEITCKUS, S ;
FLOECKNER, H ;
FROSCHAUER, R ;
GOTTSBACHER, K ;
CASARI, G ;
SIPPL, MJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 216 (01) :167-180
[6]   EVALUATION OF PROTEIN MODELS BY ATOMIC SOLVATION PREFERENCE [J].
HOLM, L ;
SANDER, C .
JOURNAL OF MOLECULAR BIOLOGY, 1992, 225 (01) :93-105
[7]   Improving the quality of twilight-zone alignments [J].
Jaroszewski, L ;
Rychlewski, L ;
Godzik, A .
PROTEIN SCIENCE, 2000, 9 (08) :1487-1496
[8]   GenTHREADER: An efficient and reliable protein fold recognition method for genomic sequences [J].
Jones, DT .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 287 (04) :797-815
[9]   A NEW APPROACH TO PROTEIN FOLD RECOGNITION [J].
JONES, DT ;
TAYLOR, WR ;
THORNTON, JM .
NATURE, 1992, 358 (6381) :86-89
[10]   Hidden Markov models for detecting remote protein homologies [J].
Karplus, K ;
Barrett, C ;
Hughey, R .
BIOINFORMATICS, 1998, 14 (10) :846-856