FISH -: family identification of sequence homologues using structure anchored hidden Markov models

被引:5
作者
Tangrot, Jeanette
Wang, Lixiao
Kagstrom, Bo
Sauer, Uwe H.
机构
[1] Umea Univ, UCMP, Umea Ctr Mol Pathogenesis, S-90187 Umea, Sweden
[2] Umea Univ, Dept Comp Sci, S-90187 Umea, Sweden
[3] Umea Univ, High Performance Comp Ctr N, S-90187 Umea, Sweden
关键词
D O I
10.1093/nar/gkl330
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The FISH server is highly accurate in identifying the family membership of domains in a query protein sequence, even in the case of very low sequence identities to known homologues. A performance test using SCOP sequences and an E-value cut-off of 0.1 showed that 99.3% of the top hits are to the correct family saHMM. Matches to a query sequence provide the user not only with an annotation of the identified domains and hence a hint to their function, but also with probable 2D and 3D structures, as well as with pairwise and multiple sequence alignments to homologues with low sequence identity. In addition, the FISH server allows users to upload and search their own protein sequence collection or to quarry public protein sequence data bases with individual saHMMs. The FISH server can be accessed at http://babel.ucmp.umu.se/fish/.
引用
收藏
页码:W10 / W14
页数:5
相关论文
共 9 条
[1]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkr1065, 10.1093/nar/gkh121]
[2]   The ASTRAL Compendium in 2004 [J].
Chandonia, JM ;
Hon, G ;
Walker, NS ;
Lo Conte, L ;
Koehl, P ;
Levitt, M ;
Brenner, SE .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D189-D192
[3]   Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763
[4]   SMART 5: domains in the context of genomes and networks [J].
Letunic, Ivica ;
Copley, Richard R. ;
Pils, Birgit ;
Pinkert, Stefan ;
Schultz, Joerg ;
Bork, Peer .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D257-D260
[5]   The SUPERFAMILY database in 2004: additions and improvements [J].
Madera, M ;
Vogel, C ;
Kummerfeld, SK ;
Chothia, C ;
Gough, J .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D235-D239
[6]   CD-Search: protein domain annotations on the fly [J].
Marchler-Bauer, A ;
Bryant, SH .
NUCLEIC ACIDS RESEARCH, 2004, 32 :W327-W331
[7]  
MURZIN AG, 1995, J MOL BIOL, V247, P536, DOI 10.1016/S0022-2836(05)80134-2
[8]   Twilight zone of protein sequence alignments [J].
Rost, B .
PROTEIN ENGINEERING, 1999, 12 (02) :85-94
[9]   MULTIPLE PROTEIN-SEQUENCE ALIGNMENT FROM TERTIARY STRUCTURE COMPARISON - ASSIGNMENT OF GLOBAL AND RESIDUE CONFIDENCE LEVELS [J].
RUSSELL, RB ;
BARTON, GJ .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1992, 14 (02) :309-323