Protein NMR recall, precision, and F-measure scores (RPF scores):: Structure quality assessment measures based on information retrieval statistics

被引:197
作者
Huang, YJ
Powers, R
Montelione, GT [1 ]
机构
[1] Rutgers State Univ, Ctr Adv Biotechnol & Med, Piscataway, NJ 08854 USA
[2] Rutgers State Univ, Dept Mol Biol & Biochem, Piscataway, NJ 08854 USA
[3] NE Struct Genom & Consortium, Piscataway, NJ 08854 USA
[4] Robert Wood Johnson Med Sch, Piscataway, NJ 08854 USA
[5] Univ Nebraska, Dept Chem, Lincoln, NE 68588 USA
关键词
D O I
10.1021/ja047109h
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
One of the most important challenges in modern protein NMR is the development of fast and sensitive structure quality assessment measures that can be used to evaluate the "goodness-of-fit" of the 3D structure with NOESY data, to indicate the correctness of the fold and accuracy of the resulting structure. Quality assessment is especially critical for automated NOESY interpretation and structure determination approaches. This paper describes new NMR quality assessment scores, including Recall, Precision, and F-measure scores (referred to here are "NMR RPF" scores), which quickly provide global measures of the goodness-of-fit of the 3D structures with NOESY peak lists using methods from information retrieval statistics. The sensitivity of the F-measure is improved using a scaled Fold Discriminating Power (DP) score. These statistical RPF scores are quite rapid to compute since NOE assignments and complete relaxation matrix calculations are not required. A graphical method for site-specific assessment of structure quality based on the Precision statistic is also described. These statistical measures are demonstrated to be valuable for assessing protein NMR structure accuracy. Their relationships to other proposed NMR "R-factors" and structure quality assessment scores are also discussed.
引用
收藏
页码:1665 / 1674
页数:10
相关论文
共 46 条
[1]  
[Anonymous], 1980, BIOPHYS CHEM
[2]   Letter to the Editor:: Resonance assignments for the hypothetical protein yggU from Escherichia coli [J].
Aramini, JM ;
Mills, JL ;
Xiao, R ;
Acton, TB ;
Wu, MJ ;
Szyperski, T ;
Montelione, GT .
JOURNAL OF BIOMOLECULAR NMR, 2003, 27 (03) :285-286
[3]   Weak alignment offers new NMR opportunities to study protein structure and dynamics [J].
Bax, A .
PROTEIN SCIENCE, 2003, 12 (01) :1-16
[4]  
BAX A, 1994, METHOD ENZYMOL, V239, P79
[5]  
BIAMONTI C, 1994, ADV BIOPHYS CHEM, V4, P51
[6]  
BORGIAS BA, 1989, METHOD ENZYMOL, V176, P169
[7]  
BRUNGER AT, 1992, X PLOR VERSION 3 1 S
[8]   3-DIMENSIONAL STRUCTURE OF INTERLEUKIN-8 IN SOLUTION [J].
CLORE, GM ;
APPELLA, E ;
YAMADA, M ;
MATSUSHIMA, K ;
GRONENBORN, AM .
BIOCHEMISTRY, 1990, 29 (07) :1689-1696
[9]  
CLORE GM, 1994, METHOD ENZYMOL, V239, P349
[10]   Direct observation of hydrogen bonds in proteins by interresidue 3hJNC′ scalar couplings [J].
Cordier, F ;
Grzesiek, S .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1999, 121 (07) :1601-1602