A MULTIVARIATE-ANALYSIS METHOD FOR DISCRIMINATING PROTEIN SECONDARY STRUCTURAL SEGMENTS

被引:35
作者
KANEHISA, M
机构
来源
PROTEIN ENGINEERING | 1988年 / 2卷 / 02期
关键词
D O I
10.1093/protein/2.2.87
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Using discriminant analysis, three types of protein secondary structure segments-helices, .beta.-strands and coils-are discriminated by amino acid sequence information alone. A variable in the discriminant analysis is defined by the amino acid index used to represent the sequence data and by the calculation method used to extract a feature in this representation. Thus, the three types of secondary structure segments derived from a set of non-homologous proteins from the Protein Data Bank are analyzed by 888 variables, which correspond to the mean, standard deviation, 3.6-residue periodicity and 2-residue periodicity for the numerical profiles determined from 222 published amino acid indices. These variables are combined to obtain best discrimination of the three types of segments. When up to three variables are combined, the best discrimination rate was 75%. The variables selected consist of the mean .alpha. propensity (or turn propensity), the mean of .beta. propensity, and the 3.6-residue periodicity of hydrophobicity. This variable selection procedure can also be applied to other types of discrimination problem, once groups of sequence data are properly organized.
引用
收藏
页码:87 / 92
页数:6
相关论文
共 24 条
[1]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[2]  
Chou P Y, 1978, Adv Enzymol Relat Areas Mol Biol, V47, P45
[3]   HYDROPHOBICITY SCALES AND COMPUTATIONAL TECHNIQUES FOR DETECTING AMPHIPATHIC STRUCTURES IN PROTEINS [J].
CORNETTE, JL ;
CEASE, KB ;
MARGALIT, H ;
SPOUGE, JL ;
BERZOFSKY, JA ;
DELISI, C .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 195 (03) :659-685
[4]  
Dayhoff H., 1978, ALTAS PROTEIN SEQUEN, V5, P363
[5]   THE PROTEIN IDENTIFICATION RESOURCE (PIR) [J].
GEORGE, DG ;
BARKER, WC ;
HUNT, LT .
NUCLEIC ACIDS RESEARCH, 1986, 14 (01) :11-15
[6]   AMINO-ACID DIFFERENCE FORMULA TO HELP EXPLAIN PROTEIN EVOLUTION [J].
GRANTHAM, R .
SCIENCE, 1974, 185 (4154) :862-864
[7]   CHARACTERIZATION OF MULTIPLE BENDS IN PROTEINS [J].
ISOGAI, Y ;
NEMETHY, G ;
RACKOVSKY, S ;
LEACH, SJ ;
SCHERAGA, HA .
BIOPOLYMERS, 1980, 19 (06) :1183-1210
[8]   HOW GOOD ARE PREDICTIONS OF PROTEIN SECONDARY STRUCTURE [J].
KABSCH, W ;
SANDER, C .
FEBS LETTERS, 1983, 155 (02) :179-182
[9]   DICTIONARY OF PROTEIN SECONDARY STRUCTURE - PATTERN-RECOGNITION OF HYDROGEN-BONDED AND GEOMETRICAL FEATURES [J].
KABSCH, W ;
SANDER, C .
BIOPOLYMERS, 1983, 22 (12) :2577-2637
[10]   LOCAL HYDROPHOBICITY STABILIZES SECONDARY STRUCTURES IN PROTEINS [J].
KANEHISA, MI ;
TSONG, TY .
BIOPOLYMERS, 1980, 19 (09) :1617-1628