A supersecondary structure library and search algorithm for modeling loops in protein structures

被引:48
作者
Fernandez-Fuentes, Narcis
Oliva, Baldomero
Fiser, Andras
机构
[1] Albert Einstein Coll Med, Dept Biochem, Bronx, NY 10461 USA
[2] Albert Einstein Coll Med, Seaver Fdn Ctr Bioinformat, Bronx, NY 10461 USA
[3] Univ Pompeu Fabra, Struct Bioinformat Grp, Barcelona 08003, Catalonia, Spain
关键词
D O I
10.1093/nar/gkl156
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present a fragment-search based method for predicting loop conformations in protein models. A hierarchical and multidimensional database has been set up that currently classifies 105 950 loop fragments and loop flanking secondary structures. Besides the length of the loops and types of bracing secondary structures the database is organized along four internal coordinates, a distance and three types of angles characterizing the geometry of stem regions. Candidate fragments are selected from this library by matching the length, the types of bracing secondary structures of the query and satisfying the geometrical restraints of the stems and subsequently inserted in the query protein framework where their fit is assessed by the root mean square deviation (r.m.s.d.) of stem regions and by the number of rigid body clashes with the environment. In the final step remaining candidate loops are ranked by a Z-score that combines information on sequence similarity and fit of predicted and observed phi/psi main chain dihedral angle propensities. Confidence Z-score cut-offs were determined for each loop length that identify those predicted fragments that outperform a competitive ab initio method. A web server implements the method, regularly updates the fragment library and performs prediction. Predicted segments are returned, or optionally, these can be completed with side chain reconstruction and subsequently annealed in the environment of the query protein by conjugate gradient minimization. The prediction method was tested on artificially prepared search datasets where all trivial sequence similarities on the SCOP superfamily level were removed. Under these conditions it is possible to predict loops of length 4, 8 and 12 with coverage of 98, 78 and 28% with at least of 0.22, 1.38 and 2.47 angstrom of r.m.s.d. accuracy, respectively. In a head-to-head comparison on loops extracted from freshly deposited new protein folds the current method outperformed in a similar to 5:1 ratio an earlier developed database search method.
引用
收藏
页码:2085 / 2097
页数:13
相关论文
共 80 条
[21]   Have we seen all structures corresponding to short protein fragments in the Protein Data Bank? An update [J].
Du, PC ;
Andrec, M ;
Levy, RM .
PROTEIN ENGINEERING, 2003, 16 (06) :407-414
[22]   ArchDB: automated protein loop classification as a tool for structural genomics [J].
Espadaler, J ;
Fernandez-Fuentes, N ;
Hermoso, A ;
Querol, E ;
Aviles, FX ;
Sternberg, MJE ;
Oliva, B .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D185-D188
[23]   Prediction of the conformation and geometry of loops in globular proteins: Testing ArchDB, a structural classification of loops [J].
Fernandez-Fuentes, N ;
Querol, E ;
Aviles, FX ;
Sternberg, MJE ;
Oliva, B .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 60 (04) :746-757
[24]   COMPARISON OF SYSTEMATIC SEARCH AND DATABASE METHODS FOR CONSTRUCTING SEGMENTS OF PROTEIN-STRUCTURE [J].
FIDELIS, K ;
STERN, PS ;
BACON, D ;
MOULT, J .
PROTEIN ENGINEERING, 1994, 7 (08) :953-960
[25]  
FINE R M, 1986, Proteins Structure Function and Genetics, V1, P342, DOI 10.1002/prot.340010408
[26]   Modeling of loops in protein structures [J].
Fiser, A ;
Do, RKG ;
Sali, A .
PROTEIN SCIENCE, 2000, 9 (09) :1753-1773
[27]   ModLoop: automated modeling of loops in protein structures [J].
Fiser, A ;
Sali, A .
BIOINFORMATICS, 2003, 19 (18) :2500-2501
[28]   Protein structure modeling in the proteomics era [J].
Fiser, A .
EXPERT REVIEW OF PROTEOMICS, 2004, 1 (01) :97-110
[29]   Evolution and physics in comparative protein structure modeling [J].
Fiser, A ;
Feig, M ;
Brooks, CL ;
Sali, A .
ACCOUNTS OF CHEMICAL RESEARCH, 2002, 35 (06) :413-421
[30]   COMPARATIVE MODEL-BUILDING OF THE MAMMALIAN SERINE PROTEASES [J].
GREER, J .
JOURNAL OF MOLECULAR BIOLOGY, 1981, 153 (04) :1027-1042