The challenge of protein structure determination - lessons from structural genomics

被引:101
作者
Slabinski, Lukasz
Jaroszewski, Lukasz
Rodrigues, Ana P. C.
Rychlewski, Leszek
Wilson, Ian A.
Lesley, Scott A.
Godzik, Adam
机构
[1] Burnham Inst Med Res, Joint Ctr Struct Genom, La Jolla, CA 92037 USA
[2] BioInfoBank Inst, PL-60744 Poznan, Poland
[3] Burnham Inst Med Res, Joint Ctr Mol Modeling, La Jolla, CA 92037 USA
[4] Scripps Res Inst, Joint Ctr Struct Genom, La Jolla, CA 92037 USA
[5] Novartis Res Fdn, Genom Inst, Joint Ctr Struct Genom, San Diego, CA 92121 USA
关键词
X-ray crystallography; protein crystallization; protein structure initiative; structural genomics; target selection;
D O I
10.1110/ps.073037907
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The process of experimental determination of protein structure is marred with a high ratio of failures at many stages. With availability of large quantities of data from high-throughput structure determination in structural genomics centers, we can now learn to recognize protein features correlated with failures; thus, we can recognize proteins more likely to succeed and eventually learn how to modify those that are less likely to succeed. Here, we identify several protein features that correlate strongly with successful protein production and crystallization and combine them into a single score that assesses "crystallization feasibility.'' The formula derived here was tested with a jackknife procedure and validated on independent benchmark sets. The "crystallization feasibility'' score described here is being applied to target selection in the Joint Center for Structural Genomics, and is now contributing to increasing the success rate, lowering the costs, and shortening the time for protein structure determination. Analyses of PDB depositions suggest that very similar features also play a role in non-high-throughput structure determination, suggesting that this crystallization feasibility score would also be of significant interest to structural biology, as well as to molecular and biochemistry laboratories.
引用
收藏
页码:2472 / 2482
页数:11
相关论文
共 25 条
[1]   Improved prediction of signal peptides: SignalP 3.0 [J].
Bendtsen, JD ;
Nielsen, H ;
von Heijne, G ;
Brunak, S .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 340 (04) :783-795
[2]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[3]  
BERNTONE P, 2001, NUCLEIC ACIDS RES, V29, P2884
[4]   Protein biophysical properties that correlate with crystallization success in Thermotoga maritima:: Maximum clustering strategy for structural genomics [J].
Canaves, JM ;
Page, R ;
Wilson, IA ;
Stevens, RC .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 344 (04) :977-991
[5]   Target selection and deselection at the Berkeley Structural Genomics Center [J].
Chandonia, JM ;
Kim, SH ;
Brenner, SE .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 62 (02) :356-370
[6]   TargetDB: a target registration database for structural genomics projects [J].
Chen, L ;
Oughtred, R ;
Berman, HM ;
Westbrook, J .
BIOINFORMATICS, 2004, 20 (16) :2860-2862
[7]  
Christendat D, 2000, NAT STRUCT BIOL, V7, P903
[8]  
CREGHTON TE, 1984, PROTEINS STRUCTURES
[9]   AGGREGATING OPINIONS THROUGH LOGARITHMIC POOLING [J].
GENEST, C ;
WEERAHANDI, S ;
ZIDEK, JV .
THEORY AND DECISION, 1984, 17 (01) :61-70
[10]   Mining the structural genomics pipeline: Identification of protein properties that affect high-throughput experimental analysis [J].
Goh, CS ;
Lan, N ;
Douglas, SM ;
Wu, BL ;
Echols, N ;
Smith, A ;
Milburn, D ;
Montelione, GT ;
Zhao, HY ;
Gerstein, M .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 336 (01) :115-130