DISCOVERING STRUCTURAL CORRELATIONS IN ALPHA-HELICES

被引:62
作者
KLINGLER, TM
BRUTLAG, DL
机构
[1] STANFORD UNIV, SCH MED, DEPT BIOCHEM, STANFORD, CA 94305 USA
[2] STANFORD UNIV, SCH MED, MED INFORMAT SECT, STANFORD, CA 94305 USA
关键词
ALPHA-HELIX STRUCTURE; AMINO ACID CORRELATIONS; MOTIF MODELING; SEQUENCE ANALYSIS; SIDE-CHAIN INTERACTIONS; STRUCTURE ANALYSIS;
D O I
10.1002/pro.5560031024
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We have developed a new representation for structural and functional motifs in protein sequences based on correlations between pairs of amino acids and applied it to alpha-helical and beta-sheet sequences. Existing probabilistic methods for representing and analyzing protein sequences have traditionally assumed conditional independence of evidence. In other words, amino acids are assumed to have no effect on each other. However, analyses of protein structures have repeatedly demonstrated the importance of interactions between amino acids in conferring both structure and function. Using Bayesian networks, we are able to model the relationships between amino acids at distinct positions in a protein sequence in addition to the amino acid distributions at each position. We have also developed an automated program for discovering sequence correlations using standard statistical tests and validation techniques. In this paper, we test this program on sequences from secondary structure motifs, namely alpha-helices and beta-sheets. In each case, the correlations our program discovers correspond well with known physical and chemical interactions between amino acids in structures. Furthermore, we show that, using different chemical alphabets for the amino acids, we discover structural relationships based on the same chemical principle used in constructing the alphabet. This new representation of 3-dimensional features in protein motifs, such as those arising from structural or functional constraints on the sequence, can be used to improve sequence analysis tools including pattern analysis and database search.
引用
收藏
页码:1847 / 1857
页数:11
相关论文
共 38 条
[1]   CHARGED HISTIDINE AFFECTS ALPHA-HELIX STABILITY AT ALL POSITIONS IN THE HELIX BY INTERACTING WITH THE BACKBONE CHARGES [J].
ARMSTRONG, KM ;
BALDWIN, RL .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1993, 90 (23) :11337-11340
[2]   THE SWISS-PROT PROTEIN-SEQUENCE DATA-BANK [J].
BAIROCH, A ;
BOECKMANN, B .
NUCLEIC ACIDS RESEARCH, 1991, 19 :2247-2248
[3]   DETERMINANTS OF A PROTEIN FOLD - UNIQUE FEATURES OF THE GLOBIN AMINO-ACID-SEQUENCES [J].
BASHFORD, D ;
CHOTHIA, C ;
LESK, AM .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 196 (01) :199-216
[4]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[5]  
BURLEY SK, 1988, ADV PROTEIN CHEM, V39, P125
[6]   EMPIRICAL PREDICTIONS OF PROTEIN CONFORMATION [J].
CHOU, PY ;
FASMAN, GD .
ANNUAL REVIEW OF BIOCHEMISTRY, 1978, 47 :251-276
[7]   SIDE-CHAIN ENTROPY OPPOSES ALPHA-HELIX FORMATION BUT RATIONALIZES EXPERIMENTALLY DETERMINED HELIX-FORMING PROPENSITIES [J].
CREAMER, TP ;
ROSE, GD .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (13) :5937-5941
[8]   COMPUTER-AIDED DIAGNOSIS OF ACUTE ABDOMINAL PAIN [J].
DEDOMBAL, FT ;
MCCANN, AP ;
LEAPER, DJ ;
STANILAND, JR ;
HORROCKS, JC .
BMJ-BRITISH MEDICAL JOURNAL, 1972, 2 (5804) :9-+
[9]   THE HYDROPHOBIC MOMENT DETECTS PERIODICITY IN PROTEIN HYDROPHOBICITY [J].
EISENBERG, D ;
WEISS, RM ;
TERWILLIGER, TC .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1984, 81 (01) :140-144
[10]   ANALYSIS OF ACCURACY AND IMPLICATIONS OF SIMPLE METHODS FOR PREDICTING SECONDARY STRUCTURE OF GLOBULAR PROTEINS [J].
GARNIER, J ;
OSGUTHORPE, DJ ;
ROBSON, B .
JOURNAL OF MOLECULAR BIOLOGY, 1978, 120 (01) :97-120