Protein fold recognition by prediction-based threading

被引:200
作者
Rost, B [1 ]
Schneider, R [1 ]
Sander, C [1 ]
机构
[1] EBI, CAMBRIDGE CB10 1RQ, ENGLAND
关键词
protein structure prediction; threading; remote homology detection; fold recognition; secondary structure;
D O I
10.1006/jmbi.1997.1101
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In fold recognition by threading one takes the amino acid sequence of a protein and evaluates how well it fits into one of the known three-dimensional (3D) protein structures. The quality of sequence-structure fit is typically evaluated using inter-residue potentials of mean force or other statistical parameters. Here, we present an alternative approach to evaluating sequence-structure fitness. Starting from the amino acid sequence we first predict secondary structure and solvent accessibility for each residue. We then thread the resulting one-dimensional (1D) profile of predicted structure assignments into each of the known 3D structures. The optimal threading for each sequence-structure pair is obtained using dynamic programming. The overall best sequence-structure pair constitutes the predicted 3D structure for the input sequence. The method is fine-tuned by adding information from direct sequence-sequence comparison and applying a series of empirical filters. Although the method relies on reduction of 3D information into 1D structure profiles, its accuracy is, surprisingly, not clearly inferior to methods based on evaluation of residue interactions in 3D. We therefore hypothesise that existing 1D-3D threading methods essentially do not capture more than the fitness of an amino acid sequence for a particular 1D succession of secondary structure segments and residue solvent accessibility. The prediction-based threading method on average finds any structurally homologous region at first rank in 29% of the cases (including sequence information). For the 22% first hits detected at highest scores, the expected accuracy rose to 75%. However, the task of detecting entire folds rather than homologous fragments was managed much better; 45 to 75% of the first hits correctly recognised the fold. (C) 1997 Academic Press Limited.
引用
收藏
页码:471 / 480
页数:10
相关论文
共 48 条
  • [1] RECOGNITION OF DISTANTLY RELATED PROTEINS THROUGH ENERGY CALCULATIONS
    ABAGYAN, R
    FRISHMAN, D
    ARGOS, P
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1994, 19 (02): : 132 - 140
  • [2] BAIROCH A, 1994, NUCLEIC ACIDS RES, V22, P3578
  • [3] The SWISS-PROT protein sequence data bank and its new supplement TREMBL
    Bairoch, A
    Apweiler, R
    [J]. NUCLEIC ACIDS RESEARCH, 1996, 24 (01) : 21 - 25
  • [4] PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES
    BERNSTEIN, FC
    KOETZLE, TF
    WILLIAMS, GJB
    MEYER, EF
    BRICE, MD
    RODGERS, JR
    KENNARD, O
    SHIMANOUCHI, T
    TASUMI, M
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) : 535 - 542
  • [5] A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE
    BOWIE, JU
    LUTHY, R
    EISENBERG, D
    [J]. SCIENCE, 1991, 253 (5016) : 164 - 170
  • [6] IDENTIFICATION OF PROTEIN FOLDS - MATCHING HYDROPHOBICITY PATTERNS OF SEQUENCE SETS WITH SOLVENT ACCESSIBILITY PATTERNS OF KNOWN STRUCTURES
    BOWIE, JU
    CLARKE, ND
    PABO, CO
    SAUER, RT
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1990, 7 (03): : 257 - 264
  • [7] STATISTICS OF SEQUENCE-STRUCTURE THREADING
    BRYANT, SH
    ALTSCHUL, SF
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 1995, 5 (02) : 236 - 244
  • [8] THE RELATION BETWEEN THE DIVERGENCE OF SEQUENCE AND STRUCTURE IN PROTEINS
    CHOTHIA, C
    LESK, AM
    [J]. EMBO JOURNAL, 1986, 5 (04) : 823 - 826
  • [9] PROTEINS - 1000 FAMILIES FOR THE MOLECULAR BIOLOGIST
    CHOTHIA, C
    [J]. NATURE, 1992, 357 (6379) : 543 - 544
  • [10] DOOLITTLE RF, 1986, URFS ORFS PRIMER HOW