Data mining crystallization databases: Knowledge-based approaches to optimize protein crystal screens

被引:72
作者
Kimber, MS
Vallee, F
Houston, S
Necakov, A
Skarina, T
Evdokimova, E
Beasley, S
Christendat, D
Savchenko, A
Arrowsmith, CH
Vedadi, M
Gerstein, M
Edwards, AM
机构
[1] Affinium Pharmaceut Inc, Toronto, ON M5J 1V6, Canada
[2] Univ Toronto, Ontario Canc Inst, Div Mol & Struct Biol, Toronto, ON, Canada
[3] Univ Toronto, Dept Med Biophys, Toronto, ON, Canada
[4] Yale Univ, New Haven, CT USA
来源
PROTEINS-STRUCTURE FUNCTION AND GENETICS | 2003年 / 51卷 / 04期
关键词
crystal screening; crystallization; data mining; structural proteomics;
D O I
10.1002/prot.10340
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein crystallization is a major bottleneck in protein X-ray crystallography, the workhorse of most structural proteomics projects. Because the principles that govern protein crystallization are too poorly understood to allow them to be used in a strongly predictive sense, the most common crystallization strategy entails screening a wide variety of solution conditions to identify the small subset that will support crystal nucleation and growth. We tested the hypothesis that more efficient crystallization strategies could be formulated by extracting useful patterns and correlations from the large data sets of crystallization trials created in structural proteomics projects. A database of crystallization conditions was constructed for 755 different proteins purified and crystallized under uniform conditions. Forty-five percent of the proteins formed crystals. Data mining identified the conditions that crystallize the most proteins, revealed that many conditions are highly correlated in their behavior, and showed that the crystallization success rate is markedly dependent on the organism from which proteins derive. Of the proteins that crystallized in a 48-condition experiment, 60% could be crystallized in as few as 6 conditions and 94% in 24 conditions. Consideration of the full range of information coming from crystal screening trials allows one to design screens that are maximally productive while consuming minimal resources, and also suggests further useful conditions for extending existing screens.
引用
收藏
页码:562 / 568
页数:7
相关论文
共 14 条
[1]  
CARTER CW, 1979, J BIOL CHEM, V254, P2219
[2]   PhyloDraw: a phylogenetic tree drawing system [J].
Choi, JH ;
Jung, HY ;
Kim, HS ;
Cho, HG .
BIOINFORMATICS, 2000, 16 (11) :1056-1058
[3]  
Christendat D, 2000, NAT STRUCT BIOL, V7, P903
[4]  
CUDNEY B, 1994, ACTA CRYSTALLOGR D, V50, P414, DOI 10.1107/S0907444994002660
[5]   BIOLOGICAL MACROMOLECULE CRYSTALLIZATION DATABASE, VERSION-3.0 - NEW FEATURES, DATA AND THE NASA ARCHIVE FOR PROTEIN CRYSTAL-GROWTH DATA [J].
GILLILAND, GL ;
TUNG, M ;
BLAKESLEE, DM ;
LADNER, JE .
ACTA CRYSTALLOGRAPHICA SECTION D-BIOLOGICAL CRYSTALLOGRAPHY, 1994, 50 :408-413
[6]  
JANKARIK J, 1991, J APPL CRYSTALLOGR, V24, P409
[7]   Multiple sequence alignment with Clustal x [J].
Jeanmougin, F ;
Thompson, JD ;
Gouy, M ;
Higgins, DG ;
Gibson, TJ .
TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (10) :403-405
[8]   SEARCH DESIGNS FOR PROTEIN CRYSTALLIZATION BASED ON ORTHOGONAL ARRAYS [J].
KINGSTON, RL ;
BAKER, HM ;
BAKER, EN .
ACTA CRYSTALLOGRAPHICA SECTION D-BIOLOGICAL CRYSTALLOGRAPHY, 1994, 50 :429-440
[9]  
MCPHERSON A, 1999, CRYSTALLIZATION BIOL, P586
[10]  
Saridakis E, 2000, PROTEIN SCI, V9, P755