Linear analysis of carbon-13 chemical shift differences and its application to the detection and correction of errors in referencing and spin system identifications

被引:59
作者
Wang, LY
Eghbalnia, HR
Bahrami, A
Markley, JL
机构
[1] Natl Magnet Resonance Facil, Dept Biochem, Madison, WI 53706 USA
[2] Ctr Eukaryot Struct Genom, Dept Biochem, Madison, WI 53706 USA
[3] Univ Wisconsin, Grad Program Biophys, Madison, WI 53706 USA
[4] Univ Wisconsin, Dept Math, Madison, WI 53706 USA
关键词
carbon-13 chemical shifts; linear analysis of chemical shifts (LACS); protein backbone geometry; proton chemical shifts; RefDB; TALOS;
D O I
10.1007/s10858-005-1717-0
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Statistical analysis reveals that the set of differences between the secondary shifts of the alpha- and beta-carbons for residues i of a protein (Delta delta C-13(i)alpha - Delta delta C-13(i)beta) provides the means to detect and correct referencing errors for H-1 and C-13 nuclei within a given dataset. In a correctly referenced protein dataset, linear regression plots of Delta delta C-13(i)alpha; Delta delta C-13(i)beta, or Delta delta H-1(i)alpha vs. (Delta delta C-13(i)alpha - Delta delta C-13(i)beta) pass through the origin from two directions, the helix-to-coil and strand-to-coil directions. Thus, linear analysis of chemical shifts (LACS) can be used to detect referencing errors and to recalibrate the H-1 and C-13 chemical shift scales if needed. The analysis requires only that the signals be identified with distinct residue types (intra-residue spin systems). LACS allows errors in calibration to be detected and corrected in advance of sequence-specific assignments and secondary structure determinations. Signals that do not fit the linear model (outliers) deserve scrutiny since they could represent errors in identifying signals with a particular residue, or interesting features such as a cis-peptide bond. LACS provides the basis for the automated detection of such features and for testing reassignment hypotheses. Early detection and correction of errors in referencing and spin system identifications can improve the speed and accuracy of chemical shift assignments and secondary structure determinations. We have used LACS to create a database of offset-corrected chemical shifts corresponding to nearly 1800 BMRB entries: similar to 300 with and similar to 1500 without corresponding three-dimensional (3D) structures. This database can serve as a resource for future analysis of the effects of amino acid sequence and protein secondary and tertiary structure on NMR chemical shifts.
引用
收藏
页码:13 / 22
页数:10
相关论文
共 28 条
[1]  
Barnett V., 1984, Outliers in Statistical Data, V2nd
[2]   Temperature dependence of H-1 chemical shifts in proteins [J].
Baxter, NJ ;
Williamson, MP .
JOURNAL OF BIOMOLECULAR NMR, 1997, 9 (04) :359-369
[3]   The effect of ring currents on carbon chemical shifts in cytochromes [J].
Blanchard, L ;
Hunter, CN ;
Williamson, MP .
JOURNAL OF BIOMOLECULAR NMR, 1997, 9 (04) :389-395
[4]   Protein backbone angle restraints from searching a database for chemical shift and sequence homology [J].
Cornilescu, G ;
Delaglio, F ;
Bax, A .
JOURNAL OF BIOMOLECULAR NMR, 1999, 13 (03) :289-302
[5]   RING CURRENT THEORIES IN NUCLEAR MAGNETIC-RESONANCE [J].
HAIGH, CW ;
MALLION, RB .
PROGRESS IN NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY, 1979, 13 :303-344
[6]   ROBUST REGRESSION USING ITERATIVELY RE-WEIGHTED LEAST-SQUARES [J].
HOLLAND, PW ;
WELSCH, RE .
COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1977, 6 (09) :813-827
[7]   Accurate and automated classification of protein secondary structure with PsiCSI [J].
Hung, LH ;
Samudrala, R .
PROTEIN SCIENCE, 2003, 12 (02) :288-295
[8]   Cα and Cβ carbon-13 chemical shifts in proteins from an empirical database [J].
Iwadate, M ;
Asakura, T ;
Williamson, MP .
JOURNAL OF BIOMOLECULAR NMR, 1999, 13 (03) :199-211
[9]   DICTIONARY OF PROTEIN SECONDARY STRUCTURE - PATTERN-RECOGNITION OF HYDROGEN-BONDED AND GEOMETRICAL FEATURES [J].
KABSCH, W ;
SANDER, C .
BIOPOLYMERS, 1983, 22 (12) :2577-2637
[10]  
LE HB, 1994, J BIOMOL NMR, V4, P341