Quasi-consensus-based comparison of profile hidden Markov models for protein sequences

被引:11
作者
Kahsay, RY
Wang, GL
Gao, G
Liao, L [1 ]
Dunbrack, R
机构
[1] Delaware Biotechnol Inst, Newark, DE 19715 USA
[2] Fox Chase Canc Ctr, Philadelphia, PA 19111 USA
[3] Univ Delaware, Dept Comp & Informat Sci, Newark, DE 19716 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1093/bioinformatics/bti374
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A simple approach for the sensitive detection of distant relationships among protein families and for sequence-structure alignment via comparison of hidden Markov models based on their quasi-consensus sequences is presented. Using a previously published benchmark dataset, the approach is demonstrated to give better homology detection and yield alignments with improved accuracy in comparison to an existing state-of-the-art dynamic programming profile-profile comparison method. This method also runs significantly faster and is therefore suitable for a server covering the rapidly increasing structure database. A server based on this method is available at http://liao.cis.udel.edu/website/servers/modmod.
引用
收藏
页码:2287 / 2293
页数:7
相关论文
共 23 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkr1065, 10.1093/nar/gkh121]
[3]  
Durbin R., 1998, Biological sequence analysis: Probabilistic models of proteins and nucleic acids
[4]   Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763
[5]   A comparison of scoring functions for protein sequence profile alignment [J].
Edgar, RC ;
Sjölander, K .
BIOINFORMATICS, 2004, 20 (08) :1301-1308
[6]   SUPERFAMILY: HMMs representing all proteins of known structure. SCOP sequence searches, alignments and genome assignments [J].
Gough, J ;
Chothia, C .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :268-272
[7]   Use of receiver operating characteristic (ROC) analysis to evaluate sequence matching [J].
Gribskov, M ;
Robinson, NL .
COMPUTERS & CHEMISTRY, 1996, 20 (01) :25-33
[8]   PROFILE ANALYSIS - DETECTION OF DISTANTLY RELATED PROTEINS [J].
GRIBSKOV, M ;
MCLACHLAN, AD ;
EISENBERG, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (13) :4355-4358
[9]   DALI - A NETWORK TOOL FOR PROTEIN-STRUCTURE COMPARISON [J].
HOLM, L ;
SANDER, C .
TRENDS IN BIOCHEMICAL SCIENCES, 1995, 20 (11) :478-480
[10]   Hidden Markov models for detecting remote protein homologies [J].
Karplus, K ;
Barrett, C ;
Hughey, R .
BIOINFORMATICS, 1998, 14 (10) :846-856