DOUTfinder - identification of distant domain outliers using subsignificant sequence similarity

被引:6
作者
Novatchkova, Maria
Schneider, Georg
Fritz, Richard
Eisenhaber, Frank
Schleiffer, Alexander
机构
[1] Res Inst Mol Pathol, A-1030 Vienna, Austria
[2] Med Univ Vienna, Inst Virol, A-1095 Vienna, Austria
关键词
D O I
10.1093/nar/gkl332
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
DOUTfinder is a web-based tool facilitating protein domain detection among related protein sequences in the twilight zone of sequence similarity. The sequence set required for this analysis can be provided by the user or will be collected using PSI-BLAST if a single sequence is given as an input. The obtained sequence family is analyzed for known Pfam and SMART domains, and the thereby identified subsignificant domain similarities are evaluated further. Domains with several subthreshold hits in the query set are ranked based on a sum-score function and likely homologous domains are suggested according to established cut-offs. By providing a post-filtering procedure for subsignificant domain hits DOUTfinder allows the detection of non-trivial domain relationships and can thereby lead to new insights into the function and evolution of distantly related sequence families. DOUTfinder is available at http://mendel.imp.ac.at/dout/.
引用
收藏
页码:W214 / W218
页数:5
相关论文
共 18 条
[1]  
Altschul SE, 1997, THEORETICAL AND COMPUTATIONAL METHODS IN GENOME RESEARCH, P1
[2]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkr1065, 10.1093/nar/gkh121]
[3]   The ASTRAL Compendium in 2004 [J].
Chandonia, JM ;
Hon, G ;
Walker, NS ;
Lo Conte, L ;
Koehl, P ;
Levitt, M ;
Brenner, SE .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D189-D192
[4]   Enhanced protein domain discovery using taxonomy [J].
Coin, L ;
Bateman, A ;
Durbin, R .
BMC BIOINFORMATICS, 2004, 5 (1)
[5]   Protein domain analysis in the era of complete genomes [J].
Copley, RR ;
Doerks, T ;
Letunic, I ;
Bork, P .
FEBS LETTERS, 2002, 513 (01) :129-134
[6]   Pfam:: clans, web tools and services [J].
Finn, Robert D. ;
Mistry, Jaina ;
Schuster-Bockler, Benjamin ;
Griffiths-Jones, Sam ;
Hollich, Volker ;
Lassmann, Timo ;
Moxon, Simon ;
Marshall, Mhairi ;
Khanna, Ajay ;
Durbin, Richard ;
Eddy, Sean R. ;
Sonnhammer, Erik L. L. ;
Bateman, Alex .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D247-D251
[7]   SMART 5: domains in the context of genomes and networks [J].
Letunic, Ivica ;
Copley, Richard R. ;
Pils, Birgit ;
Pinkert, Stefan ;
Schultz, Joerg ;
Bork, Peer .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D257-D260
[8]   Clustering of highly homologous sequences to reduce the size of large protein databases [J].
Li, WZ ;
Jaroszewski, L ;
Godzik, A .
BIOINFORMATICS, 2001, 17 (03) :282-283
[9]  
Lupas A, 1996, METHOD ENZYMOL, V266, P513
[10]   CDD: a conserved domain database for protein classification [J].
Marchler-Bauer, A ;
Anderson, JB ;
Cherukuri, PF ;
DeWweese-Scott, C ;
Geer, LY ;
Gwadz, M ;
He, SQ ;
Hurwitz, DI ;
Jackson, JD ;
Ke, ZX ;
Lanczycki, CJ ;
Liebert, CA ;
Liu, CL ;
Lu, F ;
Marchler, GH ;
Mullokandov, M ;
Shoemaker, BA ;
Simonyan, V ;
Song, JS ;
Thiessen, PA ;
Yamashita, RA ;
Yin, JJ ;
Zhang, DC ;
Bryant, SH .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D192-D196