THE PROTEIN THREADING PROBLEM WITH SEQUENCE AMINO-ACID INTERACTION PREFERENCES IS NP-COMPLETE

被引:176
作者
LATHROP, RH
机构
[1] Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge
来源
PROTEIN ENGINEERING | 1994年 / 7卷 / 09期
关键词
CONTACT POTENTIALS; INVERSE PROTEIN FOLDING; NP-COMPLETE; PROTEIN STRUCTURE PREDICTION; PROTEIN THREADING; SEQUENCE-STRUCTURE ALIGNMENT;
D O I
10.1093/protein/7.9.1059
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In recent protein structure prediction research there has been a great deal of interest in using amino acid interaction preferences (e.g. contact potentials or potentials of mean force) to align ('thread') a protein sequence to a known structural motif. An important open question is whether a polynomial time algorithm for finding the globally optimal threading is possible. We identify the two critical conditions governing this question: (i) variable-length gaps are admitted into the alignment, and (ii) interactions between amino acids from the sequence are admitted into the score function. We prove that if both these conditions are allowed then the protein threading decision problem (does there exist a threading with a score less than or equal to K?) is NP-complete (in the strong sense, i.e. is not merely a number problem) and the related problem of finding the globally optimal protein threading is NP-hard. Therefore, no polynomial time algorithm is possible (unless P = NP). This result augments existing proofs that the direct protein folding problem is NP-complete by providing the corresponding proof for the 'inverse' protein folding problem. It provides a theoretical basis for understanding algorithms currently in use and indicates that computational strategies from other NP-complete problems may be useful for predictive algorithms.
引用
收藏
页码:1059 / 1068
页数:10
相关论文
共 38 条
[1]  
[Anonymous], 1971, STOC 71, DOI DOI 10.1145/800157.805047
[2]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[3]  
BROOKS CL, 1990, PROTEINS THEORETICAL
[4]   AN EMPIRICAL ENERGY FUNCTION FOR THREADING PROTEIN-SEQUENCE THROUGH THE FOLDING MOTIF [J].
BRYANT, SH ;
LAWRENCE, CE .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1993, 16 (01) :92-112
[5]   PROTEINS - 1000 FAMILIES FOR THE MOLECULAR BIOLOGIST [J].
CHOTHIA, C .
NATURE, 1992, 357 (6379) :543-544
[6]   AN EMPIRICAL-APPROACH TO PROTEIN CONFORMATION STABILITY AND FLEXIBILITY [J].
CREIGHTON, TE .
BIOPOLYMERS, 1983, 22 (01) :49-58
[7]   PREDICTION OF PROTEIN FOLDING FROM AMINO-ACID-SEQUENCE OVER DISCRETE CONFORMATION SPACES [J].
CRIPPEN, GM .
BIOCHEMISTRY, 1991, 30 (17) :4232-4237
[8]   THE DEAD-END ELIMINATION THEOREM AND ITS USE IN PROTEIN SIDE-CHAIN POSITIONING [J].
DESMET, J ;
DEMAEYER, M ;
HAZES, B ;
LASTERS, I .
NATURE, 1992, 356 (6369) :539-542
[9]   A SEARCH FOR THE MOST STABLE FOLDS OF PROTEIN CHAINS [J].
FINKELSTEIN, AV ;
REVA, BA .
NATURE, 1991, 351 (6326) :497-499
[10]   COMPLEXITY OF PROTEIN-FOLDING [J].
FRAENKEL, AS .
BULLETIN OF MATHEMATICAL BIOLOGY, 1993, 55 (06) :1199-1210