SAWTED: Structure Assignment With Text Description - Enhanced detection, of remote homologues with automated SWISS-PROT annotation comparisons

被引:47
作者
MacCallum, RM [1 ]
Kelley, LA [1 ]
Sternberg, MJE [1 ]
机构
[1] Imperial Canc Res Fund, Biomolec Modelling Lab, London WC2A 3PX, England
关键词
D O I
10.1093/bioinformatics/16.2.125
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Sequence database search methods often identify putative sub-threshold hits of known function or structure for a given query sequence. It is widespread practice to filter these hits by hand using knowledge of function and other factors; to the expert, some hits may appear more sensible than others. SAWTED (Structure Assignment With Text Description) is an automated solution to this post-filtering problem which will be applicable to large scale genome assignments. Results: A standard document comparison algorithm is applied to text descriptions extracted from SWISS-PROT annotations. The added value of SAWTED in combination with PSI-BLAST has been shown with a benchmark of difficult remote homologues taken from the SCOP structure database.
引用
收藏
页码:125 / 129
页数:5
相关论文
共 14 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[3]   Automatic extraction of keywords from scientific text: application to the knowledge domain of protein families [J].
Andrade, MA ;
Valencia, A .
BIOINFORMATICS, 1998, 14 (07) :600-607
[4]  
[Anonymous], METHOD ENZYMOL
[5]   THE SWISS-PROT PROTEIN-SEQUENCE DATA-BANK [J].
BAIROCH, A ;
BOECKMANN, B .
NUCLEIC ACIDS RESEARCH, 1991, 19 :2247-2248
[6]   Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships [J].
Brenner, SE ;
Chothia, C ;
Hubbard, TJP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (11) :6073-6078
[7]   MView: a web-compatible database search or multiple alignment viewer [J].
Brown, NP ;
Leroy, C ;
Sander, C .
BIOINFORMATICS, 1998, 14 (04) :380-381
[8]   Protein annotation: detective work for function prediction [J].
Doerks, T ;
Bairoch, A ;
Bork, P .
TRENDS IN GENETICS, 1998, 14 (06) :248-250
[9]  
Fischer D, 1999, PROTEINS, P209
[10]   Heme packing motifs revealed by the crystal structure of the tetra-heme cytochrome c554 from Nitrosomonas europaea [J].
Iverson, TM ;
Arciero, DM ;
Hsu, BT ;
Logan, MSP ;
Hooper, AB ;
Rees, DC .
NATURE STRUCTURAL BIOLOGY, 1998, 5 (11) :1005-1012