On the role of structural information in remote homology detection and sequence alignment: New methods using hybrid sequence profiles

被引:62
作者
Tang, CL [1 ]
Xie, L [1 ]
Koh, IYY [1 ]
Posy, S [1 ]
Alexov, E [1 ]
Honig, B [1 ]
机构
[1] Columbia Univ, Howard Hughes Med Inst, Dept Biochem & Mol Biophys, New York, NY 10032 USA
关键词
multiple structure alignment; profile-profile alignments; hybrid profile; sequence alignment; homolog detection;
D O I
10.1016/j.jmb.2003.10.025
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Structural alignments often reveal relationships between proteins that cannot be detected using sequence alignment alone. However, profile search methods based entirely on structural alignments alone have not been found to be effective in finding remote homologs. Here, we explore the role of structural information in remote homolog detection and sequence alignment. To this end, we develop a series of hybrid multidimensional alignment profiles that combine sequence, secondary and tertiary structure information into hybrid profiles. Sequence-based profiles are profiles whose position-specific scoring matrix is derived from sequence alignment alone; structure-based profiles are those derived from multiple structure alignments. We compare pure sequence-based profiles to pure structure-based profiles, as well as to hybrid profiles that use combined sequence-and-structure-based profiles, where sequence-based profiles are used in loop/motif regions and structural information is used in core structural regions. All of the hybrid methods offer significant improvement over simple profile-to-profile alignment. We demonstrate that both sequence-based and structure-based profiles contribute to remote homology detection and alignment accuracy, and that each contains some unique information. We discuss the implications of these results for further improvements in amino acid sequence and structural analysis. (C) 2003 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1043 / 1062
页数:20
相关论文
共 82 条
[61]   Structure-derived substitution matrices for alignment of distantly related sequences [J].
Prlic, A ;
Domingues, FS ;
Sippl, MJ .
PROTEIN ENGINEERING, 2000, 13 (08) :545-550
[62]  
Reddy BVB, 2001, PROTEINS, V42, P148, DOI 10.1002/1097-0134(20010201)42:2<148::AID-PROT20>3.0.CO
[63]  
2-R
[64]   A 3D-1D substitution matrix for protein fold recognition that includes predicted secondary structure of the sequence [J].
Rice, DW ;
Eisenberg, D .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 267 (04) :1026-1038
[65]   PREDICTION OF PROTEIN SECONDARY STRUCTURE AT BETTER THAN 70-PERCENT ACCURACY [J].
ROST, B ;
SANDER, C .
JOURNAL OF MOLECULAR BIOLOGY, 1993, 232 (02) :584-599
[66]   Protein fold recognition by prediction-based threading [J].
Rost, B ;
Schneider, R ;
Sander, C .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 270 (03) :471-480
[67]  
Rost B, 1996, METHOD ENZYMOL, V266, P525
[68]   STRUCTURAL FEATURES CAN BE UNCONSERVED IN PROTEINS WITH SIMILAR FOLDS - AN ANALYSIS OF SIDE-CHAIN TO SIDE-CHAIN CONTACTS SECONDARY STRUCTURE AND ACCESSIBILITY [J].
RUSSELL, RB ;
BARTON, GJ .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 244 (03) :332-350
[69]   Comparison of sequence profiles. Strategies for structural predictions using sequence information [J].
Rychlewski, L ;
Jaroszewski, L ;
Li, WZ ;
Godzik, A .
PROTEIN SCIENCE, 2000, 9 (02) :232-241
[70]   COMPASS: A tool for comparison of multiple protein alignments with assessment of statistical significance [J].
Sadreyev, R ;
Grishin, N .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 326 (01) :317-336