The NOESY JIGSAW: Automated protein secondary structure and main-chain assignment from sparse, unassigned NMR data

被引:67
作者
Bailey-Kellogg, C
Widge, A
Kelley, JJ
Berardi, MJ
Bushweller, JH
Donald, BR [1 ]
机构
[1] Dartmouth Comp Sci Dept, Sudikoff Lab 6211, Hanover, NH 03755 USA
[2] Dartmouth Chem Dept, Hanover, NH 03755 USA
[3] Univ Virginia, Charlottesville, VA 22906 USA
关键词
nuclear magnetic resonance spectroscopy; automated resonance assignment; structural genomics/proteomics; protein secondary structure; graph algorithms; probabilistic reasoning;
D O I
10.1089/106652700750050934
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
High-throughput, data-directed computational protocols for Structural Genomics (or Proteomics) are required in order to evaluate the protein products of genes for structure and function at rates comparable to current gene-sequencing technology. This paper presents the JIGSAW algorithm, a novel high-throughput, automated approach to protein structure characterization with nuclear magnetic resonance (NMR), JIGSAW applies graph algorithms and probabilistic reasoning techniques, enforcing first-principles consistency rules in order to overcome a 5-10% signal-to-noise ratio, It consists of two main components: (1) graph-based secondary structure pattern identification in unassigned heteronuclear NMR data, and (2) assignment of spectral peaks by probabilstic alignment of identified secondary structure elements against the primary sequence. Deferring assignment eliminates the bottleneck faced by traditional approaches, which begin by correlating peaks among dozens of experiments. JIGSAW utilizes only four experiments, none of which requires C-13-labeled protein, thus dramatically reducing both the amount and expense of wet lab molecular biology and the total spectrometer time, Results for three test proteins demonstrate that JIGSAW correctly identifies 79-100% of alpha -helical and 46-65% of beta -sheet NOE connectivities and correctly aligns 33-100% of secondary structure elements. JIGSAW is very fast, running in minutes on a Pentium-class Linux workstation, This approach yields quick and reasonably accurate (as opposed to the traditional slow and extremely accurate) structure calculations. It could be useful for quick structural assays to speed data to the biologist early in an investigation and could in principle be applied in an automation-like fashion to a large fraction of the proteome.
引用
收藏
页码:537 / 558
页数:22
相关论文
共 47 条
[1]  
[Anonymous], 2018, Protein nmr spectroscopy: principles and practice
[2]  
[Anonymous], [No title captured]
[3]   Enhanced protein fold recognition using secondary structure information from NMR [J].
Ayers, DJ ;
Gooley, PR ;
Widmer-Cooper, A ;
Torda, AE .
PROTEIN SCIENCE, 1999, 8 (05) :1127-1133
[4]  
Bartels C, 1997, J COMPUT CHEM, V18, P139, DOI 10.1002/(SICI)1096-987X(19970115)18:1<139::AID-JCC13>3.0.CO
[5]  
2-H
[6]   THE PROGRAM XEASY FOR COMPUTER-SUPPORTED NMR SPECTRAL-ANALYSIS OF BIOLOGICAL MACROMOLECULES [J].
BARTELS, C ;
XIA, TH ;
BILLETER, M ;
GUNTERT, P ;
WUTHRICH, K .
JOURNAL OF BIOMOLECULAR NMR, 1995, 6 (01) :1-10
[7]  
Chen T., 1999, P 3 ANN INT C COMP M, P94
[8]   MAPPING OF THE BINDING INTERFACES OF THE PROTEINS OF THE BACTERIAL PHOSPHOTRANSFERASE SYSTEM, HPR AND IIA(GLC) [J].
CHEN, Y ;
REIZER, J ;
SAIER, MH ;
FAIRBROTHER, WJ ;
WRIGHT, PE .
BIOCHEMISTRY, 1993, 32 (01) :32-37
[9]   Tools for the automated assignment of high-resolution three-dimensional protein NMR spectra based on pattern recognition techniques [J].
Croft, D ;
Kemmink, J ;
Neidig, KP ;
Oschkinat, H .
JOURNAL OF BIOMOLECULAR NMR, 1997, 10 (03) :207-219
[10]   JPred: a consensus secondary structure prediction server [J].
Cuff, JA ;
Clamp, ME ;
Siddiqui, AS ;
Finlay, M ;
Barton, GJ .
BIOINFORMATICS, 1998, 14 (10) :892-893