Wurst: a protein threading server with a structural scoring function, sequence profiles and optimized substitution matrices

被引:38
作者
Torda, AE
Procter, JB
Huber, T
机构
[1] Univ Hamburg, Zentrum Bioinformat, D-20146 Hamburg, Germany
[2] Univ Queensland, Dept Math, Brisbane, Qld 4072, Australia
[3] Univ Queensland, Dept Biochem, Brisbane, Qld 4072, Australia
关键词
D O I
10.1093/nar/gkh357
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Wurst is a protein threading program with an emphasis on high quality sequence to structure alignments (http://www.zbh.uni-hamburg.de/wurst). Submitted sequences are aligned to each of about 3000 templates with a conventional dynamic programming algorithm, but using a score function with sophisticated structure and sequence terms. The structure terms are a log-odds probability of sequence to structure fragment compatibility, obtained from a Bayesian classification procedure. A simplex optimization was used to optimize the sequence-based terms for the goal of alignment and model quality and to balance the sequence and structural contributions against each other. Both sequence and structural terms operate with sequence profiles.
引用
收藏
页码:W532 / W535
页数:4
相关论文
共 38 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
[Anonymous], 1978, Atlas of protein sequence and structure
[3]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[4]   A Shannon entropy-based filter detects high-quality profile-profile alignments in searches for remote homologues [J].
Capriotti, E ;
Fariselli, P ;
Rossi, I ;
Casadio, R .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 54 (02) :351-360
[5]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[6]   Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763
[7]   Hidden Markov models [J].
Eddy, SR .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1996, 6 (03) :361-365
[8]   A study on protein sequence alignment quality [J].
Elofsson, A .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 46 (03) :330-339
[9]   TOPOLOGY FINGERPRINT APPROACH TO THE INVERSE PROTEIN FOLDING PROBLEM [J].
GODZIK, A ;
KOLINSKI, A ;
SKOLNICK, J .
JOURNAL OF MOLECULAR BIOLOGY, 1992, 227 (01) :227-238
[10]  
GODZIK A, 2003, STRUCTURAL BIOINFORM, V44, P525