Fold recognition by combining sequence profiles derived from evolution and from depth-dependent structural alignment of fragments

被引:173
作者
Zhou, HY [1 ]
Zhou, YQ [1 ]
机构
[1] SUNY Buffalo, Howard Hughes Med Inst, Ctr Single Mol Biophys, Dept Physiol & Biophys, Buffalo, NY 14214 USA
关键词
fold recognition; protein threading; protein structure prediction; sequence profile;
D O I
10.1002/prot.20308
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Recognizing structural similarity without significant sequence identity has proved to be a challenging task. Sequence-based and structure-based methods as well as their combinations have been developed. Here, we propose a fold-recognition method that incorporates structural information without the need of sequence-to-structure threading. This is accomplished by generating sequence profiles from protein structural fragments. The structure-derived sequence profiles allow a simple integration with evolution-derived sequence profiles and secondary-structural information for an optimized alignment by efficient dynamic programming. The resulting method (called SP3) is found to make a statistically significant improvement in both sensitivity of fold recognition and accuracy of alignment over the method based on evolution-derived sequence profiles alone (SP) and the method based on evolution-derived sequence profile and secondary structure profile (SP2). SP3 was tested in SALIGN benchmark for alignment accuracy and Lindahl, PROSPECTOR 3.0, and LiveBench 8.0 benchmarks for remote-homology detection and model accuracy. Sp(3) is found to be the most sensitive and accurate single-method server in all benchmarks tested where other methods are available for comparison (although its results are statistically indistinguishable from the next best in some cases and the comparison is subjected to the limitation of time-dependent sequence and/or structural library used by different methods.). In LiveBench 8.0, its accuracy rivals some of the consensus methods such as ShotGunINBGU, Pmodeller3, Pcons4, and ROBETTA. (C) 2004 Wiley-Liss, Inc.
引用
收藏
页码:321 / 328
页数:8
相关论文
共 79 条
[1]   RECOGNITION OF DISTANTLY RELATED PROTEINS THROUGH ENERGY CALCULATIONS [J].
ABAGYAN, R ;
FRISHMAN, D ;
ARGOS, P .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1994, 19 (02) :132-140
[2]   Combining multiple structure and sequence alignments to improve sequence detection and alignment: Application to the SH2 domains of Janus kinases [J].
Al-Lazikani, B ;
Sheinerman, FB ;
Honig, B .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (26) :14796-14801
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[5]   Domain combinations in archaeal, eubacterial and eukaryotic proteomes [J].
Apic, G ;
Gough, J ;
Teichmann, SA .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 310 (02) :311-325
[6]   Score distributions for simultaneous matching to multiple motifs [J].
Bailey, TL ;
Gribskov, M .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1997, 4 (01) :45-59
[7]   Pairwise sequence alignment below the twilight zone [J].
Blake, JD ;
Cohen, FE .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (02) :721-735
[8]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[9]   AN EMPIRICAL ENERGY FUNCTION FOR THREADING PROTEIN-SEQUENCE THROUGH THE FOLDING MOTIF [J].
BRYANT, SH ;
LAWRENCE, CE .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1993, 16 (01) :92-112
[10]   LiveBench-1: Continuous benchmarking of protein structure prediction servers [J].
Bujnicki, JM ;
Elofsson, A ;
Fischer, D ;
Rychlewski, L .
PROTEIN SCIENCE, 2001, 10 (02) :352-361