Measurements of protein sequence-structure correlations

被引:36
作者
Crooks, GE [1 ]
Wolfe, J [1 ]
Brenner, SE [1 ]
机构
[1] Univ Calif Berkeley, Dept Plant & Microbial Biol, Berkeley, CA 94720 USA
关键词
protein structure; contact potentials; mutual information; secondary structure; hydrophobicity;
D O I
10.1002/prot.20262
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Correlations between protein structures and amino acid sequences are widely used for protein structure prediction. For example, secondary structure predictors generally use correlations between a secondary structure sequence and corresponding primary structure sequence, whereas threading algorithms and similar tertiary structure predictors typically incorporate interresidue contact potentials. To investigate the relative importance of these sequence-structure interactions, we measured the mutual information among the primary structure, secondary structure and sidechain surface exposure, both for adjacent residues along the amino acid sequence and for tertiary structure contacts between residues distantly separated along the backbone. We found that local interactions along the amino acid chain are far more important than non-local contacts and that correlations between proximate amino acids are essentially uninformative. This suggests that knowledge-based contact potentials may be less important for structure predication than is generally believed. (C) 2004 Wiley-Liss, Inc.
引用
收藏
页码:804 / 810
页数:7
相关论文
共 42 条
[1]   FREE-ENERGY LANDSCAPE FOR PROTEIN-FOLDING KINETICS - INTERMEDIATES, TRAPS, AND MULTIPLE PATHWAYS IN THEORY AND LATTICE MODEL SIMULATIONS [J].
ABKEVICH, VI ;
GUTIN, AM ;
SHAKHNOVICH, EI .
JOURNAL OF CHEMICAL PHYSICS, 1994, 101 (07) :6052-6062
[2]  
Alexandrov N N, 1996, Pac Symp Biocomput, P53
[3]   SCOP database in 2004: refinements integrate structure and sequence family data [J].
Andreeva, A ;
Howorth, D ;
Brenner, SE ;
Hubbard, TJP ;
Chothia, C ;
Murzin, AG .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D226-D229
[4]  
[Anonymous], 1955, INFORM THEORY PSYCHO
[5]   Exploring protein sequence space using knowledge-based potentials [J].
Babajide, A ;
Farber, R ;
Hofacker, IL ;
Inman, J ;
Lapedes, AS ;
Stadler, PF .
JOURNAL OF THEORETICAL BIOLOGY, 2001, 212 (01) :35-46
[6]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[7]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[8]  
Brandon C. I., 1998, INTRO PROTEIN STRUCT
[9]   The ASTRAL Compendium in 2004 [J].
Chandonia, JM ;
Hon, G ;
Walker, NS ;
Lo Conte, L ;
Koehl, P ;
Levitt, M ;
Brenner, SE .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D189-D192
[10]   NATURE OF ACCESSIBLE AND BURIED SURFACES IN PROTEINS [J].
CHOTHIA, C .
JOURNAL OF MOLECULAR BIOLOGY, 1976, 105 (01) :1-14