SPICKER:: A clustering approach to identify near-native protein folds

被引:317
作者
Zhang, Y [1 ]
Skolnick, J [1 ]
机构
[1] SUNY Buffalo, Ctr Excellence Bioinformat, Buffalo, NY 14203 USA
关键词
SPICKER; near-native folds; TASSER;
D O I
10.1002/jcc.20011
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
We have developed SPICKER, a simple and efficient strategy to identify near-native folds by clustering protein structures generated during computer simulations. In general, the most populated clusters tend to be closer to the native conformation than the lowest energy structures. To assess the generality of the approach, we applied SPICKER to 1489 representative benchmark proteins less than or equal to200 residues that cover the PDB at the level of 35% sequence identity; each contains up to 280,000 structure decoys generated using the recently developed TASSER (Threading ASSembly Refinement) algorithm. The best of the top five identified folds has a root-mean-square deviation from native (RMSD) in the top 1.4% of all decoys. For 78% of the proteins, the difference in RMSD from native to the identified models and RMSD from native to the absolutely best individual decoy is below 1 Angstrom the majority belong to the targets with converged conformational distributions. Although native fold identification from divergent decoy structures remains a challenge, our overall results show significant improvement over our previous Clustering algorithms. (C) 2004 Wiley Periodicals, Inc.
引用
收藏
页码:865 / 871
页数:7
相关论文
共 14 条
[1]  
Betancourt MR, 2001, J COMPUT CHEM, V22, P339, DOI 10.1002/1096-987X(200102)22:3<339::AID-JCC1006>3.0.CO
[2]  
2-R
[3]   Pathways to a protein folding intermediate observed in a 1-microsecond simulation in aqueous solution [J].
Duan, Y ;
Kollman, PA .
SCIENCE, 1998, 282 (5389) :740-744
[4]   RESPECTIVE ROLES OF SHORT-RANGE AND LONG-RANGE INTERACTIONS IN PROTEIN FOLDING [J].
GO, N ;
TAKETOMI, H .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1978, 75 (02) :559-563
[5]   Parallel tempering algorithm for conformational studies of biological molecules [J].
Hansmann, UHE .
CHEMICAL PHYSICS LETTERS, 1997, 281 (1-3) :140-150
[6]   Recent improvements in prediction of protein structure by global optimization of a potential energy function [J].
Pillardy, A ;
Czaplewski, C ;
Liwo, A ;
Lee, J ;
Ripoll, DR ;
Kazmierkiewicz, R ;
Oldziej, S ;
Wedemeyer, WJ ;
Gibson, KD ;
Arnautova, YA ;
Saunders, J ;
Ye, YJ ;
Scheraga, HA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (05) :2329-2333
[7]   COMPARATIVE PROTEIN MODELING BY SATISFACTION OF SPATIAL RESTRAINTS [J].
SALI, A ;
BLUNDELL, TL .
JOURNAL OF MOLECULAR BIOLOGY, 1993, 234 (03) :779-815
[8]   Clustering of low-energy conformations near the native structures of small proteins [J].
Shortle, D ;
Simons, KT ;
Baker, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (19) :11158-11162
[9]   Prospects for ab initio protein structural genomics [J].
Simons, KT ;
Strauss, C ;
Baker, D .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 306 (05) :1191-1199
[10]  
SKOLNICK J, 2004, IN PRESS PROTEIN