Better models by discarding data?

被引:196
作者
Diederichs, K. [1 ]
Karplus, P. A. [2 ]
机构
[1] Univ Konstanz, Fac Biol, D-78457 Constance, Germany
[2] Oregon State Univ, Dept Biochem & Biophys, Corvallis, OR 97331 USA
来源
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY | 2013年 / 69卷
关键词
D O I
10.1107/S0907444913001121
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In macromolecular X-ray crystallography, typical data sets have substantial multiplicity. This can be used to calculate the consistency of repeated measurements and thereby assess data quality. Recently, the properties of a correlation coefficient, CC1/2, that can be used for this purpose were characterized and it was shown that CC1/2 has superior properties compared with 'merging' R values. A derived quantity, CC*, links data and model quality. Using experimental data sets, the behaviour of CC1/2 and the more conventional indicators were compared in two situations of practical importance: merging data sets from different crystals and selectively rejecting weak observations or (merged) unique reflections from a data set. In these situations controlled 'paired-refinement' tests show that even though discarding the weaker data leads to improvements in the merging R values, the refined models based on these data are of lower quality. These results show the folly of such data-filtering practices aimed at improving the merging R values. Interestingly, in all of these tests CC1/2 is the one data-quality indicator for which the behaviour accurately reflects which of the alternative data-handling strategies results in the best-quality refined model. Its properties in the presence of systematic error are documented and discussed.
引用
收藏
页码:1215 / 1222
页数:8
相关论文
共 17 条
[1]   PHENIX: a comprehensive Python']Python-based system for macromolecular structure solution [J].
Adams, Paul D. ;
Afonine, Pavel V. ;
Bunkoczi, Gabor ;
Chen, Vincent B. ;
Davis, Ian W. ;
Echols, Nathaniel ;
Headd, Jeffrey J. ;
Hung, Li-Wei ;
Kapral, Gary J. ;
Grosse-Kunstleve, Ralf W. ;
McCoy, Airlie J. ;
Moriarty, Nigel W. ;
Oeffner, Robert ;
Read, Randy J. ;
Richardson, David C. ;
Richardson, Jane S. ;
Terwilliger, Thomas C. ;
Zwart, Peter H. .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2010, 66 :213-221
[2]   A COMPUTER-LINKED CATHODE-RAY TUBE MICRODENSITOMETER FOR X-RAY CRYSTALLOGRAPHY [J].
ARNDT, UW ;
CROWTHER, RA ;
MALLETT, JFW .
JOURNAL OF PHYSICS E-SCIENTIFIC INSTRUMENTS, 1968, 1 (05) :510-&
[3]   Improved R-factors for diffraction data analysis in macromolecular crystallography [J].
Diederichs, K ;
Karplus, PA .
NATURE STRUCTURAL BIOLOGY, 1997, 4 (04) :269-275
[4]  
Diederichs K., 2013, ADV METHODS BIOMOLEC
[5]   An introduction to data reduction: space-group determination, scaling and intensity statistics [J].
Evans, Philip R. .
ACTA CRYSTALLOGRAPHICA SECTION D-BIOLOGICAL CRYSTALLOGRAPHY, 2011, 67 :282-292
[6]   TREATMENT OF NEGATIVE INTENSITY OBSERVATIONS [J].
FRENCH, S ;
WILSON, K .
ACTA CRYSTALLOGRAPHICA SECTION A, 1978, 34 (JUL) :517-525
[7]   The application of hierarchical cluster analysis to the selection of isomorphous crystals [J].
Giordano, Rita ;
Leal, Ricardo M. F. ;
Bourenkov, Gleb P. ;
McSweeney, Sean ;
Popov, Alexander N. .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2012, 68 :649-658
[8]   Integration, scaling, space-group assignment and post-refinement [J].
Kabsch, Wolfgang .
ACTA CRYSTALLOGRAPHICA SECTION D-BIOLOGICAL CRYSTALLOGRAPHY, 2010, 66 :133-144
[9]   XDS [J].
Kabsch, Wolfgang .
ACTA CRYSTALLOGRAPHICA SECTION D-BIOLOGICAL CRYSTALLOGRAPHY, 2010, 66 :125-132
[10]  
Karplus PA, 2012, SCIENCE, V336, P1030, DOI [10.1126/science.1218231, 10.1126/science.121823]