Sequence comparison and protein structure prediction

被引:127
作者
Dunbrack, Roland L., Jr. [1 ]
机构
[1] Fox Chase Canc Ctr, Inst Canc Res, Philadelphia, PA 19111 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1016/j.sbi.2006.05.006
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Sequence comparison is a major step in the prediction of protein structure from existing templates in the Protein Data Bank. The identification of potentially remote homologues to be used as templates for modeling target sequences of unknown structure and their accurate alignment remain challenges, despite many years of study. The most recent advances have been in combining as many sources of information as possible - including amino acid variation in the form of profiles or hidden Markov models for both the target and template families, known and predicted secondary structures of the template and target, respectively, the combination of structure alignment for distant homologues and sequence alignment for close homologues to build better profiles, and the anchoring of certain regions of the alignment based on existing biological data. Newer technologies have been applied to the problem, including the use of support vector machines to tackle the fold classification problem for a target sequence and the alignment of hidden Markov models. Finally, using the consensus of many fold recognition methods, whether based on profile-profile alignments, threading or other approaches, continues to be one of the most successful strategies for both recognition and alignment of remote homologues. Although there is still room for improvement in identification and alignment methods, additional progress may come from model building and refinement methods that can compensate for large structural changes between remotely related targets and templates, as well as for regions of misalignment.
引用
收藏
页码:374 / 384
页数:11
相关论文
共 108 条
[21]   Detecting distant homology with Meta-BASIC [J].
Ginalski, K ;
von Grotthuss, M ;
Grishin, NV ;
Rychlewski, L .
NUCLEIC ACIDS RESEARCH, 2004, 32 :W576-W581
[22]   3D-Jury: a simple approach to improve protein structure predictions [J].
Ginalski, K ;
Elofsson, A ;
Fischer, D ;
Rychlewski, L .
BIOINFORMATICS, 2003, 19 (08) :1015-1018
[23]   Frequency of gaps observed in a structurally aligned protein pair database suggests a simple gap penalty function [J].
Goonesekere, NCW ;
Lee, B .
NUCLEIC ACIDS RESEARCH, 2004, 32 (09) :2838-2843
[24]   Fold change in evolution of protein structures [J].
Grishin, NV .
JOURNAL OF STRUCTURAL BIOLOGY, 2001, 134 (2-3) :167-185
[25]   Fold recognition by combining profile-profile alignment and support vector machine [J].
Han, SJ ;
Lee, BC ;
Yu, ST ;
Jeong, CS ;
Lee, S ;
Kim, D .
BIOINFORMATICS, 2005, 21 (11) :2667-2673
[26]   IDENTIFICATION OF NATIVE PROTEIN FOLDS AMONGST A LARGE NUMBER OF INCORRECT MODELS - THE CALCULATION OF LOW-ENERGY CONFORMATIONS FROM POTENTIALS OF MEAN FORCE [J].
HENDLICH, M ;
LACKNER, P ;
WEITCKUS, S ;
FLOECKNER, H ;
FROSCHAUER, R ;
GOTTSBACHER, K ;
CASARI, G ;
SIPPL, MJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 216 (01) :167-180
[27]   Efficient remote homology detection using local structure [J].
Hou, Y ;
Hsu, W ;
Lee, ML ;
Bystroff, C .
BIOINFORMATICS, 2003, 19 (17) :2294-2301
[28]   Remote homolog detection using local sequence-structure correlations [J].
Hou, YN ;
Hsu, W ;
Lee, ML ;
Bystroff, C .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 57 (03) :518-530
[29]   Improved pairwise alignments of proteins in the Twilight Zone using local structure predictions [J].
Huang, YM ;
Bystroff, C .
BIOINFORMATICS, 2006, 22 (04) :413-422
[30]   A discriminative framework for detecting remote protein homologies [J].
Jaakkola, T ;
Diekhans, M ;
Haussler, D .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (1-2) :95-114