Probabilistic identification of spin systems and their assignments including coil-helix inference as output (PISTACHIO)

被引:46
作者
Eghbalnia, HR
Bahrami, A
Wang, LY
Assadi, A
Markley, JL
机构
[1] Natl Magnet Resonance Facil Madison, Dept Biochem, Madison, WI 53706 USA
[2] Univ Wisconsin, Ctr Eukaryot Struct Genom, Dept Biochem, Madison, WI 53706 USA
[3] Univ Wisconsin, Grad Program Biophys, Madison, WI 53706 USA
[4] Univ Wisconsin, Dept Math, Madison, WI 53706 USA
基金
美国国家卫生研究院;
关键词
automation; backbone assignments; particle interaction model; sidechain assignments;
D O I
10.1007/s10858-005-7944-6
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present a novel automated strategy ( PISTACHIO) for the probabilistic assignment of backbone and sidechain chemical shifts in proteins. The algorithm uses peak lists derived from various NMR experiments as input and provides as output ranked lists of assignments for all signals recognized in the input data as constituting spin systems. PISTACHIO was evaluate00000000d by comparing its performance with raw peak-picked data from 15 proteins ranging from 54 to 300 residues; the results were compared with those achieved by experts analyzing the same datasets by hand. As scored against the best available independent assignments for these proteins, the first-ranked PISTACHIO assignments were 80-100% correct for backbone signals and 75-95% correct for sidechain signals. The independent assignments benefited, in a number of cases, from structural data (e. g. from NOESY spectra) that were unavailable to PISTACHIO. Any number of datasets in any combination can serve as input. Thus PISTACHIO can be used as datasets are collected to ascertain the current extent of secure assignments, to identify residues with low assignment probability, and to suggest the types of additional data needed to remove ambiguities. The current implementation of PISTACHIO, which is available from a server on the Internet, supports input data from 15 standard double- and triple-resonance experiments. The software can readily accommodate additional types of experiments, including data from selectively labeled samples. The assignment probabilities can be carried forward and refined in subsequent steps leading to a structure. The performance of PISTACHIO showed no direct dependence on protein size, but correlated instead with data quality ( completeness and signal-to-noise). PISTACHIO represents one component of a comprehensive probabilistic approach we are developing for the collection and analysis of protein NMR data.
引用
收藏
页码:219 / 233
页数:15
相关论文
共 57 条
[1]   Protein sequential resonance assignments by combinatorial enumeration using 13Cα chemical shifts and their (i, i-1) sequential connectivities [J].
Andrec, M ;
Levy, RM .
JOURNAL OF BIOMOLECULAR NMR, 2002, 23 (04) :263-270
[2]  
[Anonymous], 2000, SOLVE IT MODERN HEUR
[3]  
[Anonymous], 1989, GENETIC ALGORITHM SE
[4]  
[Anonymous], 1991, Handbook of genetic algorithms
[5]  
[Anonymous], GEN PROGR P 1 ANN C
[6]   A tracked approach for automated NMR assignments in proteins (TATAPRO) [J].
Atreya, HS ;
Sahu, SC ;
Chary, KVR ;
Govil, G .
JOURNAL OF BIOMOLECULAR NMR, 2000, 17 (02) :125-136
[7]   The NOESY JIGSAW: Automated protein secondary structure and main-chain assignment from sparse, unassigned NMR data [J].
Bailey-Kellogg, C ;
Widge, A ;
Kelley, JJ ;
Berardi, MJ ;
Bushweller, JH ;
Donald, BR .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (3-4) :537-558
[8]  
Bartels C, 1997, J COMPUT CHEM, V18, P139, DOI 10.1002/(SICI)1096-987X(19970115)18:1<139::AID-JCC13>3.0.CO
[9]  
2-H
[10]   THE PROGRAM XEASY FOR COMPUTER-SUPPORTED NMR SPECTRAL-ANALYSIS OF BIOLOGICAL MACROMOLECULES [J].
BARTELS, C ;
XIA, TH ;
BILLETER, M ;
GUNTERT, P ;
WUTHRICH, K .
JOURNAL OF BIOMOLECULAR NMR, 1995, 6 (01) :1-10