Probabilistic identification of spin systems and their assignments including coil-helix inference as output (PISTACHIO)

被引：46

作者：

Eghbalnia, HR

Bahrami, A

Wang, LY

Assadi, A

Markley, JL

机构：

[1] Natl Magnet Resonance Facil Madison, Dept Biochem, Madison, WI 53706 USA

[2] Univ Wisconsin, Ctr Eukaryot Struct Genom, Dept Biochem, Madison, WI 53706 USA

[3] Univ Wisconsin, Grad Program Biophys, Madison, WI 53706 USA

[4] Univ Wisconsin, Dept Math, Madison, WI 53706 USA

来源：

JOURNAL OF BIOMOLECULAR NMR | 2005年 / 32卷 / 03期

基金：

美国国家卫生研究院;

关键词：

automation; backbone assignments; particle interaction model; sidechain assignments;

D O I：

10.1007/s10858-005-7944-6

中图分类号：

Q5 [生物化学]; Q7 [分子生物学];

学科分类号：

071010 ; 081704 ;

摘要：

We present a novel automated strategy ( PISTACHIO) for the probabilistic assignment of backbone and sidechain chemical shifts in proteins. The algorithm uses peak lists derived from various NMR experiments as input and provides as output ranked lists of assignments for all signals recognized in the input data as constituting spin systems. PISTACHIO was evaluate00000000d by comparing its performance with raw peak-picked data from 15 proteins ranging from 54 to 300 residues; the results were compared with those achieved by experts analyzing the same datasets by hand. As scored against the best available independent assignments for these proteins, the first-ranked PISTACHIO assignments were 80-100% correct for backbone signals and 75-95% correct for sidechain signals. The independent assignments benefited, in a number of cases, from structural data (e. g. from NOESY spectra) that were unavailable to PISTACHIO. Any number of datasets in any combination can serve as input. Thus PISTACHIO can be used as datasets are collected to ascertain the current extent of secure assignments, to identify residues with low assignment probability, and to suggest the types of additional data needed to remove ambiguities. The current implementation of PISTACHIO, which is available from a server on the Internet, supports input data from 15 standard double- and triple-resonance experiments. The software can readily accommodate additional types of experiments, including data from selectively labeled samples. The assignment probabilities can be carried forward and refined in subsequent steps leading to a structure. The performance of PISTACHIO showed no direct dependence on protein size, but correlated instead with data quality ( completeness and signal-to-noise). PISTACHIO represents one component of a comprehensive probabilistic approach we are developing for the collection and analysis of protein NMR data.

引用

页码：219 / 233

页数：15

共 57 条

[1] Protein sequential resonance assignments by combinatorial enumeration using 13Cα chemical shifts and their (i, i-1) sequential connectivities [J].