The effect of long-range interactions on the secondary structure formation of proteins

被引:75
作者
Kihara, D [1 ]
机构
[1] Purdue Univ, Marky Ctr Struct Biol, Dept Biol Sci Comp Sci, Bindley Biosci Ctr, W Lafayette, IN 47907 USA
关键词
secondary structure prediction; long-range interaction; residue contact order; beta-strand formation;
D O I
10.1110/ps.051479505
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The influence of long-range residue interactions on defining secondary structure in a protein has long been discussed and is often cited as the current limitation to accurate secondary structure prediction. There are several experimental examples where a local sequence alone is not sufficient to determine its secondary structure, but a comprehensive survey on a large data set has not yet been done. Interestingly, some earlier studies denied the negative effect of long-range interactions on secondary structure prediction accuracy. Here, we have introduced the residue contact order (RCO), which directly indicates the separation of contacting residues in terms of the position in the sequence, and examined the relationship between the RCO and the prediction accuracy. A large data set of 2777 nonhomologous proteins was used in our analysis. Unlike previous studies, we do find that prediction accuracy drops as residues have contacts with more distant residues. Moreover, this negative correlation between the RCO and the prediction accuracy was found not only for beta-strands, but also for alpha-helices. The prediction accuracy of beta-strands is lower if residues have a high RCO or a low RCO, which corresponds to the situation that a beta-sheet is formed by beta-strands from different chains in a protein complex. The reason why the current study draws the opposite conclusion from the previous studies is examined. The implication for protein folding is also discussed.
引用
收藏
页码:1955 / 1963
页数:9
相关论文
共 61 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[3]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[4]   Protein folding mechanisms and energy landscape of src SH3 domain studied by a structure prediction toolbox [J].
Chikenji, G ;
Fujitsuka, Y ;
Takada, S .
CHEMICAL PHYSICS, 2004, 307 (2-3) :157-162
[5]  
Chou P Y, 1978, Adv Enzymol Relat Areas Mol Biol, V47, P45
[6]   EMPIRICAL PREDICTIONS OF PROTEIN CONFORMATION [J].
CHOU, PY ;
FASMAN, GD .
ANNUAL REVIEW OF BIOCHEMISTRY, 1978, 47 :251-276
[7]   Protein secondary structure: entropy, correlations and prediction [J].
Crooks, GE ;
Brenner, SE .
BIOINFORMATICS, 2004, 20 (10) :1603-1611
[8]  
Cuff JA, 2000, PROTEINS, V40, P502, DOI 10.1002/1097-0134(20000815)40:3<502::AID-PROT170>3.0.CO
[9]  
2-Q
[10]  
Delano WL., 2002, The PyMOL Molecular Graphics System