Have we seen all structures corresponding to short protein fragments in the Protein Data Bank? An update

被引:41
作者
Du, PC
Andrec, M
Levy, RM
机构
[1] Rutgers State Univ, Dept Chem & Biol Chem, Piscataway, NJ 08854 USA
[2] Rutgers State Univ, BioMaPS Inst, Piscataway, NJ 08854 USA
来源
PROTEIN ENGINEERING | 2003年 / 16卷 / 06期
关键词
database; fold; loop modeling; protein fragment; protein structure;
D O I
10.1093/protein/gzg052
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Assembling short fragments from known structures has been a widely used approach to construct novel protein structures. To what extent there exist structurally similar fragments in the database of known structures for short fragments of a novel protein is a question that is fundamental to this approach. This work addresses that question for seven-, nine- and 15-residue fragments. For each fragment size, two databases, a query database and a template database of fragments from high-quality protein structures in SCOP20 and SCOP90, respectively, were constructed. For each fragment in the query database, the template database was scanned to find the lowest r.m.s.d. fragment among non-homologous structures. For seven-residue fragments, there is a 99% probability that there exists such a fragment within 0.7 Angstrom r.m.s.d. for each loop fragment. For nine-residue fragments there is a 96% probability of a fragment within 1 Angstrom r.m.s.d., while for 15-residue fragments there is a 91% probability of a fragment within 2 Angstrom r.m.s.d.. These results, which update previous studies, show that there exists sufficient coverage to model even a novel fold using fragments from the Protein Data Bank, as the current database of known structures has increased enormously in the last few years. We have also explored the use of a grid search method for loop homology modeling and make some observations about the use of a grid search compared with a database search for the loop modeling problem.
引用
收藏
页码:407 / 414
页数:8
相关论文
共 33 条
[1]   Protein backbone structure determination using only residual dipolar couplings from one ordering medium [J].
Andrec, M ;
Du, PC ;
Levy, RM .
JOURNAL OF BIOMOLECULAR NMR, 2001, 21 (04) :335-347
[2]  
Andrec Michael, 2002, Journal of Structural and Functional Genomics, V2, P103, DOI 10.1023/A:1020435630054
[3]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[4]   The ASTRAL compendium for protein structure and sequence analysis [J].
Brenner, SE ;
Koehl, P ;
Levitt, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :254-256
[5]   Prediction of local structure in proteins using a library of sequence-structure motifs [J].
Bystroff, C ;
Baker, D .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 281 (03) :565-577
[6]   ASTRAL compendium enhancements [J].
Chandonia, JM ;
Walker, NS ;
Conte, LL ;
Koehl, P ;
Levitt, M ;
Brenner, SE .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :260-263
[7]   CONFORMATION OF TWISTED BETA-PLEATED SHEETS IN PROTEINS [J].
CHOTHIA, C .
JOURNAL OF MOLECULAR BIOLOGY, 1973, 75 (02) :295-302
[8]   Protein backbone angle restraints from searching a database for chemical shift and sequence homology [J].
Cornilescu, G ;
Delaglio, F ;
Bax, A .
JOURNAL OF BIOMOLECULAR NMR, 1999, 13 (03) :289-302
[9]   THE BUILDING OF PROTEIN STRUCTURES FROM ALPHA-CARBON COORDINATES [J].
CORREA, PE .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1990, 7 (04) :366-377
[10]  
Deane CM, 2000, PROTEINS, V40, P135, DOI 10.1002/(SICI)1097-0134(20000701)40:1<135::AID-PROT150>3.3.CO