The prediction of protein contacts from multiple sequence alignments

被引:55
作者
Thomas, DJ
Casari, G
Sander, C
机构
[1] Europ. Molecular Biology Laboratory, 69012 Heidelberg
来源
PROTEIN ENGINEERING | 1996年 / 9卷 / 11期
关键词
correlated mutations; multiple sequence alignments; protein contact prediction;
D O I
10.1093/protein/9.11.941
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We have studied the question of how much extra predictive power the correlated mutational behaviour of pairs of amino acid residues separated along a sequence has concerning the likelihood of those residues being in contact in the folded protein. The mutational behaviour is deduced from multiple sequence alignments. Our findings are that there is, indeed, some valuable information available from this source and that it is sufficient to make a significant improvement in our ability to predict contacts, when compared with earlier methods that do not take into account the correlations between the mutations. This improvement is approximately twice as large as can be obtained by the more economical method of simply averaging pair preferences over the same sequence alignment. Even when using a method based on pair preferences, a further significant improvement can be made by penalizing more variable regions (on the reasonable assumption that invariant residues are relatively more likely to be in contact), though we have found no way of improving the pair preference method to the extent that it matches the method based on correlated behaviour, Our new method is thought to be the best data-based method of contact prediction developed so far, achieving, on average, an improvement over a random (i.e. information-free) prediction of a factor of five when the number of contacts predicted is chosen to match the number that actually occur.
引用
收藏
页码:941 / 948
页数:8
相关论文
共 16 条
[1]  
BAIROCH A, 1994, NUCLEIC ACIDS RES, V22, P3578
[2]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[3]   COVARIATION OF RESIDUES IN THE HOMEODOMAIN SEQUENCE FAMILY [J].
CLARKE, ND .
PROTEIN SCIENCE, 1995, 4 (11) :2269-2278
[4]   CORRELATED MUTATIONS AND RESIDUE CONTACTS IN PROTEINS [J].
GOBEL, U ;
SANDER, C ;
SCHNEIDER, R ;
VALENCIA, A .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1994, 18 (04) :309-317
[5]   IDENTIFICATION OF NATIVE PROTEIN FOLDS AMONGST A LARGE NUMBER OF INCORRECT MODELS - THE CALCULATION OF LOW-ENERGY CONFORMATIONS FROM POTENTIALS OF MEAN FORCE [J].
HENDLICH, M ;
LACKNER, P ;
WEITCKUS, S ;
FLOECKNER, H ;
FROSCHAUER, R ;
GOTTSBACHER, K ;
CASARI, G ;
SIPPL, MJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 216 (01) :167-180
[6]  
HOBOHM U, 1992, PROTEIN SCI, V1, P409
[7]   A NEW APPROACH TO PROTEIN FOLD RECOGNITION [J].
JONES, DT ;
TAYLOR, WR ;
THORNTON, JM .
NATURE, 1992, 358 (6381) :86-89
[8]   PAIRED NATURAL CYSTEINE MUTATION MAPPING - AID TO CONSTRAINING MODELS OF PROTEIN TERTIARY STRUCTURE [J].
KREISBERG, R ;
BUCHNER, V ;
ARAD, D .
PROTEIN SCIENCE, 1995, 4 (11) :2405-2410
[9]   ASSESSMENT OF PROTEIN MODELS WITH 3-DIMENSIONAL PROFILES [J].
LUTHY, R ;
BOWIE, JU ;
EISENBERG, D .
NATURE, 1992, 356 (6364) :83-85
[10]   ESTIMATION OF EFFECTIVE INTERRESIDUE CONTACT ENERGIES FROM PROTEIN CRYSTAL-STRUCTURES - QUASI-CHEMICAL APPROXIMATION [J].
MIYAZAWA, S ;
JERNIGAN, RL .
MACROMOLECULES, 1985, 18 (03) :534-552