Protein NMR recall, precision, and F-measure scores (RPF scores):: Structure quality assessment measures based on information retrieval statistics

被引:197
作者
Huang, YJ
Powers, R
Montelione, GT [1 ]
机构
[1] Rutgers State Univ, Ctr Adv Biotechnol & Med, Piscataway, NJ 08854 USA
[2] Rutgers State Univ, Dept Mol Biol & Biochem, Piscataway, NJ 08854 USA
[3] NE Struct Genom & Consortium, Piscataway, NJ 08854 USA
[4] Robert Wood Johnson Med Sch, Piscataway, NJ 08854 USA
[5] Univ Nebraska, Dept Chem, Lincoln, NE 68588 USA
关键词
D O I
10.1021/ja047109h
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
One of the most important challenges in modern protein NMR is the development of fast and sensitive structure quality assessment measures that can be used to evaluate the "goodness-of-fit" of the 3D structure with NOESY data, to indicate the correctness of the fold and accuracy of the resulting structure. Quality assessment is especially critical for automated NOESY interpretation and structure determination approaches. This paper describes new NMR quality assessment scores, including Recall, Precision, and F-measure scores (referred to here are "NMR RPF" scores), which quickly provide global measures of the goodness-of-fit of the 3D structures with NOESY peak lists using methods from information retrieval statistics. The sensitivity of the F-measure is improved using a scaled Fold Discriminating Power (DP) score. These statistical RPF scores are quite rapid to compute since NOE assignments and complete relaxation matrix calculations are not required. A graphical method for site-specific assessment of structure quality based on the Precision statistic is also described. These statistical measures are demonstrated to be valuable for assessing protein NMR structure accuracy. Their relationships to other proposed NMR "R-factors" and structure quality assessment scores are also discussed.
引用
收藏
页码:1665 / 1674
页数:10
相关论文
共 46 条
[31]   High-resolution solution structure of basic fibroblast growth factor determined by multidimensional heteronuclear magnetic resonance spectroscopy [J].
Moy, FJ ;
Seddon, AP ;
Bohlen, P ;
Powers, R .
BIOCHEMISTRY, 1996, 35 (42) :13552-13561
[32]   Solution structure of human IL-13 and implication for receptor binding [J].
Moy, FJ ;
Diblasio, E ;
Wilhelm, J ;
Powers, R .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 310 (01) :219-230
[33]   Quantitative evaluation of experimental NMR restraints [J].
Nabuurs, SB ;
Spronk, CAEM ;
Krieger, E ;
Maassen, H ;
Vriend, G ;
Vuister, GW .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2003, 125 (39) :12026-12034
[34]   DETERMINATION OF 3-DIMENSIONAL STRUCTURES OF PROTEINS BY SIMULATED ANNEALING WITH INTERPROTON DISTANCE RESTRAINTS - APPLICATION TO CRAMBIN, POTATO CARBOXYPEPTIDASE INHIBITOR AND BARLEY SERINE PROTEINASE INHIBITOR-2 [J].
NILGES, M ;
GRONENBORN, AM ;
BRUNGER, AT ;
CLORE, GM .
PROTEIN ENGINEERING, 1988, 2 (01) :27-38
[35]   RASMOL - BIOMOLECULAR GRAPHICS FOR ALL [J].
SAYLE, RA ;
MILNERWHITE, EJ .
TRENDS IN BIOCHEMICAL SCIENCES, 1995, 20 (09) :374-376
[36]   SWISS-MODEL: an automated protein homology-modeling server [J].
Schwede, T ;
Kopp, J ;
Guex, N ;
Peitsch, MC .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3381-3385
[37]   1.56-ANGSTROM STRUCTURE OF MATURE TRUNCATED HUMAN FIBROBLAST COLLAGENASE [J].
SPURLINO, JC ;
SMALLWOOD, AM ;
CARLTON, DD ;
BANKS, TM ;
VAVRA, KJ ;
JOHNSON, JS ;
COOK, ER ;
FALVO, J ;
WAHL, RC ;
PULVINO, TA ;
WENDOLOSKI, JJ ;
SMITH, DL .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1994, 19 (02) :98-109
[38]  
STOUT GH, 1968, XRAY STRUCTURE DETER
[39]   Direct measurement of distances and angles in biomolecules by NMR in a dilute liquid crystalline medium [J].
Tjandra, N ;
Bax, A .
SCIENCE, 1997, 278 (5340) :1111-1114
[40]   NUCLEAR MAGNETIC DIPOLE INTERACTIONS IN FIELD-ORIENTED PROTEINS - INFORMATION FOR STRUCTURE DETERMINATION IN SOLUTION [J].
TOLMAN, JR ;
FLANAGAN, JM ;
KENNEDY, MA ;
PRESTEGARD, JH .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1995, 92 (20) :9279-9283