PRINCIPAL COMPONENT ANALYSIS OF FOURIER-TRANSFORM INFRARED AND/OR CIRCULAR-DICHROISM SPECTRA OF PROTEINS APPLIED IN A CALIBRATION OF PROTEIN SECONDARY STRUCTURE

被引:26
作者
PRIBIC, R
机构
[1] Faculty of Physics and Astronomy, Free University, Amsterdam
关键词
D O I
10.1006/abio.1994.1541
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Gaining information on the secondary structure of a protein from its spectra is presented as a calibration problem. The secondary structures known from X-ray studies and the spectra of 21 proteins are represented by a linear model. Fourier transform infrared (FTIR) spectra from 1700 to 1600 cm(-1), circular dichroism (CD) spectra from 178 to 260 nm, and combined spectra are used; the secondary structure classes of interest are alpha-helices, antiparallel beta-sheets, parallel beta-sheets, beta-turns, and ''other.'' The calibration is solved in two steps: (i) the dependencies between the structures and the spectra of reference proteins are found using the least-squares estimator, and (ii) the secondary structure of a protein is predicted from its spectra using the information gained in the first step and principal component analysis. The problem of information content of the reference spectra is analyzed using the linearly independent pieces of information, the so-called principal components, provided by singular value decomposition. Attention is paid to a number of the principal components sufficient for the prediction, which may be less than the total number. A relative estimable parameter is used to determine unambiguously the number of the components corresponding to the minimum mean square error of the predictor. The analysis gives the solutions to this linear calibration relevant to the underlying protein problem, thus reducing subjective assessments as well as computations. (C) 1994 Academic Press, Inc.
引用
收藏
页码:26 / 34
页数:9
相关论文
共 33 条
[1]  
AITCHISON J, 1975, STATISTICAL PREDICTI
[2]  
Anderson T. W., 1958, INTRO MULTIVARIATE S
[3]  
Bates D.M., 1988, NONLINEAR REGRESSION
[4]  
CANTOR CR, 1980, BIOPHYSICAL CHEM, V2
[5]   CIRCULAR DICHROIC ANALYSIS OF PROTEIN CONFORMATION - INCLUSION OF BETA-TURNS [J].
CHANG, CT ;
WU, CSC ;
YANG, JT .
ANALYTICAL BIOCHEMISTRY, 1978, 91 (01) :13-31
[6]   DETERMINATION OF THE SECONDARY STRUCTURE-CONTENT OF PROTEINS IN AQUEOUS-SOLUTIONS FROM THEIR AMIDE-I AND AMIDE-II INFRARED BANDS - COMPARISON BETWEEN CLASSICAL AND PARTIAL LEAST-SQUARES METHODS [J].
DOUSSEAU, F ;
PEZOLET, M .
BIOCHEMISTRY, 1990, 29 (37) :8771-8779
[7]   CROSS-VALIDATORY CHOICE OF THE NUMBER OF COMPONENTS FROM A PRINCIPAL COMPONENT ANALYSIS [J].
EASTMENT, HT ;
KRZANOWSKI, WJ .
TECHNOMETRICS, 1982, 24 (01) :73-77
[8]  
GOLUB GH, 1990, MATRIX COMPUTATIONS
[9]   COMPUTED CIRCULAR DICHROISM SPECTRA FOR EVALUATION OF PROTEIN CONFORMATION [J].
GREENFIE.N ;
FASMAN, GD .
BIOCHEMISTRY, 1969, 8 (10) :4108-&
[10]   DOES FOURIER-TRANSFORM INFRARED-SPECTROSCOPY PROVIDE USEFUL INFORMATION ON PROTEIN STRUCTURES [J].
HARIS, PI ;
CHAPMAN, D .
TRENDS IN BIOCHEMICAL SCIENCES, 1992, 17 (09) :328-333