Re-clustering the database for crystallization of macromolecules

被引:15
作者
Farr, RG [1 ]
Perryman, AL [1 ]
Samudzi, CT [1 ]
机构
[1] Univ Missouri, Dept Biochem, Columbia, MO 65211 USA
关键词
statistical analysis; crystallization conditions; crystal growth;
D O I
10.1016/S0022-0248(97)00492-2
中图分类号
O7 [晶体学];
学科分类号
0702 ; 070205 ; 0703 ; 080501 ;
摘要
The current version of the biological macromolecule crystallization database (BMCD version 3.0) was statistically analyzed using clustering techniques. This is an effort to look for trends that may be useful in the crystallization of new macromolecules. Our previous statistical analysis of the BMCD was performed on version 1.0 [C.T. Samudzi, M.J. Fivash, J.M. Rosenberg, J. Crystal Growth 123 (1992) 47]. That database contained information on a total of 1025 crystallization experiments for 820 biological macromolecules (about 35% of those entries were incomplete and, thus, inappropriate for analysis). Version 3.0 of the BMCD is more than 90% complete and contains information on a total of about 2300 crystallization experiments for approximately 1500 biological macromolecules [G.L. Gilliland, M. Tung, D.M. Bakerslee, J.E. Ladner, Acta Cryst. D 50 (1994) 408]. With significantly more data in the BMCD, the question is whether trends have changed. The SAS software [SAS Institute Inc., SAS/STAT, Version 6, 4th ed., vol. 1] was used throughout the analysis. The following crystallization parameters were used in defining an experiment: pH, temperature, molecular weight, macromolecular concentration, precipitant type and crystallization method. Using these parameters, a measure of the differences between experiments was developed. Groups or clusters of similar experiments were identified as those close together based upon this difference measure. The database was successfully resolved into 25 clusters. The pseudo-F statistic for 25 clusters was 306.30 and is statistically significant (p < 0.0001). Although eight of these clusters can be treated as outliers, the other 17 clusters provide useful information in recognizing new patterns and developing strategies for crystallization of macromolecules. (C) 1998 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:653 / 668
页数:16
相关论文
共 14 条
[1]  
CARTER CW, 1979, J BIOL CHEM, V254, P2219
[2]   DETERMINATION OF LOCAL REFRACTIVE-INDEX FOR PROTEIN AND VIRUS CRYSTALS IN SOLUTION BY MACH-ZEHNDER INTERFEROMETRY [J].
COLE, T ;
KATHMAN, A ;
KOSZELAK, S ;
MCPHERSON, A .
ANALYTICAL BIOCHEMISTRY, 1995, 231 (01) :92-98
[3]   Protein crystallization [J].
Durbin, SD ;
Feher, G .
ANNUAL REVIEW OF PHYSICAL CHEMISTRY, 1996, 47 :171-204
[4]   BIOLOGICAL MACROMOLECULE CRYSTALLIZATION DATABASE, VERSION-3.0 - NEW FEATURES, DATA AND THE NASA ARCHIVE FOR PROTEIN CRYSTAL-GROWTH DATA [J].
GILLILAND, GL ;
TUNG, M ;
BLAKESLEE, DM ;
LADNER, JE .
ACTA CRYSTALLOGRAPHICA SECTION D-BIOLOGICAL CRYSTALLOGRAPHY, 1994, 50 :408-413
[6]  
HENNESSY D, 1994, ISMB, V94, P179
[7]  
KLEINBAUM DG, 1978, APPLIED REGRESSION A, P235
[8]   MECHANISMS OF GROWTH FOR PROTEIN AND VIRUS CRYSTALS [J].
MALKIN, AJ ;
KUZNETSOV, YG ;
LAND, TA ;
DEYOREO, JJ ;
MCPHERSON, A .
NATURE STRUCTURAL BIOLOGY, 1995, 2 (11) :956-959
[9]  
McPherson A., 1982, PREPARATION ANAL PRO
[10]  
MCPHERSON A, 1991, CRYSTALLIZATION MEMB, P1