Fold recognition with minimal gaps

被引:4
作者
Chen, W
Mirny, L
Shakhnovich, EI
机构
[1] Harvard Univ, Dept Chem & Biol Chem, Cambridge, MA 02138 USA
[2] Harvard Univ, Dept Biophys, Cambridge, MA 02138 USA
[3] MIT, Div Hlth Sci & Technol, Cambridge, MA 02139 USA
关键词
fold recognition; PSI-BLAST; random energy model; epsilon cutoff parameter;
D O I
10.1002/prot.10402
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Here we present a simplified form of threading that uses only a 20 X 20 two-body residue-based potential and restricted number of gaps. Despite its simplicity and transparency the Monte Carlo-based threading algorithm performs very well in a rigorous test of fold recognition. The results suggest that by simplifying and constraining the decoy space, one can achieve better fold recognition. Fold recognition results are compared with and supplemented by a PSI-BIAST search. The statistical significance of threading results is rigorously evaluated from statistics of extremes by comparison with optimal alignments of a large set of randomly shuffled sequences. The statistical theory, based on the Random Energy Model, yields a cumulative statistical parameter, F, that attests to the likelihood of correct fold recognition. A large E indicates a significant energy gap between the optimal alignment and decoy alignments and, consequently, a high probability that the fold is correctly recognized. For a particular number of gaps, the E parameter reaches its maximal value, and the fold is recognized. As the number of gaps further increases, the likelihood of correct fold recognition drops off. This is because the decoy space is small when gaps are restricted to a small number, but the native alignment is still well approximated, whereas unrestricted increase of the number of gaps leads to rapid growth of the number of decoys and their statistical dominance over the correct alignment. It is shown that best results are obtained when a combination of one-, two-, and three-gap threading is used. To this end, use of the E parameter is crucial for rigorous comparison of results across the different decoy spaces belonging to a different number of gaps.
引用
收藏
页码:531 / 543
页数:13
相关论文
共 40 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[3]   STATISTICS OF SEQUENCE-STRUCTURE THREADING [J].
BRYANT, SH ;
ALTSCHUL, SF .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1995, 5 (02) :236-244
[4]  
Bryant SH, 1996, PROTEINS, V26, P172
[5]   SPIN-GLASSES AND THE STATISTICAL-MECHANICS OF PROTEIN FOLDING [J].
BRYNGELSON, JD ;
WOLYNES, PG .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (21) :7524-7528
[6]   Free energy self-averaging in protein-sized random heteropolymers [J].
Chuang, J ;
Grosberg, AY ;
Kardar, M .
PHYSICAL REVIEW LETTERS, 2001, 87 (07) :78104-1
[7]   RANDOM-ENERGY MODEL - LIMIT OF A FAMILY OF DISORDERED MODELS [J].
DERRIDA, B .
PHYSICAL REVIEW LETTERS, 1980, 45 (02) :79-82
[8]  
Dunbrack RL, 1999, PROTEINS, P81
[9]   Protein structure: What is it possible to predict now? [J].
Finkelstein, AV .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1997, 7 (01) :60-71
[10]   3D protein folds: Homologs against errors - a simple estimate based on the random energy model [J].
Finkelstein, AV .
PHYSICAL REVIEW LETTERS, 1998, 80 (21) :4823-4825