Eigen THREADER: analogous protein fold recognition by efficient contact map threading

被引:39
作者
Buchan, Daniel W. A. [1 ]
Jones, David T. [1 ]
机构
[1] UCL, Dept Comp Sci, Gower St, London WC1E 6BT, England
基金
英国生物技术与生命科学研究理事会;
关键词
STRUCTURE PREDICTION; SEQUENCE; ALIGNMENT; COEVOLUTION;
D O I
10.1093/bioinformatics/btx217
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Protein fold recognition when appropriate, evolutionarily-related, structural templates can be identified is often trivial and may even be viewed as a solved problem. However in cases where no homologous structural templates can be detected, fold recognition is a notoriously difficult problem (Moult et al., 2014). Here we present EigenTHREADER, a novel fold recognition method capable of identifying folds where no homologous structures can be identified. EigenTHREADER takes a query amino acid sequence, generates a map of intra-residue contacts, and then searches a library of contact maps of known structures. To allow the contact maps to be compared, we use eigenvector decomposition to resolve the principal eigenvectors these can then be aligned using standard dynamic programming algorithms. The approach is similar to the AlEigen approach of Di Lena et al. (2010), but with improvements made both to speed and accuracy. With this search strategy, EigenTHREADER does not depend directly on sequence homology between the target protein and entries in the fold library to generate models. This in turn enables EigenTHREADER to correctly identify analogous folds where little or no sequence homology information is. Results: EigenTHREADER outperforms well-established fold recognition methods such as pGenTHREADER and HHSearch in terms of True Positive Rate in the difficult task of analogous fold recognition. This should allow template-based modelling to be extended to many new protein families that were previously intractable to homology based fold recognition methods.
引用
收藏
页码:2684 / 2690
页数:7
相关论文
共 31 条
  • [1] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [2] Creighton T., 1992, PROTEINS STRUCTURES
  • [3] Fast overlapping of protein contact maps by alignment of eigenvectors
    Di Lena, Pietro
    Fariselli, Piero
    Margara, Luciano
    Vassura, Marco
    Casadio, Rita
    [J]. BIOINFORMATICS, 2010, 26 (18) : 2250 - 2258
  • [4] BioShell-Threading: versatile Monte Carlo package for protein 3D threading
    Gniewek, Pawel
    Kolinski, Andrzej
    Kloczkowski, Andrzej
    Gront, Dominik
    [J]. BMC BIOINFORMATICS, 2014, 15
  • [5] GOLDMAN D, 1999, FOCS 99 P 40 ANN S F, P512, DOI DOI 10.1109/SFFCS.1999.814624
  • [6] MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins
    Jones, David T.
    Singh, Tanya
    Kosciolek, Tomasz
    Tetchner, Stuart
    [J]. BIOINFORMATICS, 2015, 31 (07) : 999 - 1006
  • [7] PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments
    Jones, David T.
    Buchan, Daniel W. A.
    Cozzetto, Domenico
    Pontil, Massimiliano
    [J]. BIOINFORMATICS, 2012, 28 (02) : 184 - 190
  • [8] A NEW APPROACH TO PROTEIN FOLD RECOGNITION
    JONES, DT
    TAYLOR, WR
    THORNTON, JM
    [J]. NATURE, 1992, 358 (6381) : 86 - 89
  • [9] FreeContact: fast and free software for protein contact prediction from residue co-evolution
    Kajan, Laszlo
    Hopf, Thomas A.
    Kalas, Matus
    Marks, Debora S.
    Rost, Burkhard
    [J]. BMC BIOINFORMATICS, 2014, 15
  • [10] Assessment of CASP11 contact-assisted predictions
    Kinch, Lisa N.
    Li, Wenlin
    Monastyrskyy, Bohdan
    Kryshtafovych, Andriy
    Grishin, Nick V.
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2016, 84 : 164 - 180