Computer-assisted protein domain boundary prediction using the Dom-Pred server

被引:50
作者
Bryson, Kevin
Cozzetto, Domenico
Jones, David T.
机构
[1] UCL, Dept Comp Sci, London WC1E 6BT, England
[2] Univ Roma La Sapienza, Dept Biochem Sci Rossi Fanelli, I-00185 Rome, Italy
关键词
D O I
10.2174/138920307780363415
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Domain prediction from sequence is a particularly challenging task, and currently, a large variety of different methodologies are employed to tackle the task. Here we try to classify these diverse approaches into a number of broad categories. Completely automatic domain prediction from sequence alone is currently fraught with problems, but this should not be so surprising since human experts currently have significant disagreement on domain assignment even when given the structures. It can be argued that we should only test the domain prediction methods on benchmark data that human experts agree upon and this is the approach we take in this paper. Even for the data sets on which human experts agree, automatic structure-based domain assignment still cannot always agree, and so again it is still unlikely that domain prediction methods will reliably obtain correct results completely automatically. We make the argument that computer-assisted domain prediction is a more achievable goal. With this aim in mind, we present the DomPred server. This server provides the user with the results from two completely different categories of method (DPS and DomSSEA). In this paper, each method is individually benchmarked against one of the latest domain prediction benchmarks to provide information about their respective reliabilities. A variety of different benchmark scores are employed since the accuracy of a domain prediction method depends critically on what types of results one wishes to obtain (single/multi-domain classification, domain number, residue linker positions, etc.). Also both of these methods, implemented within the DomPred server, can suggest alternative domain predictions, allowing the user to make the final decision based on these results and applying their own background knowledge to the problem. The DomPred server is available from the URL.
引用
收藏
页码:181 / 188
页数:8
相关论文
共 32 条
[11]   SnapDRAGON: a method to delineate protein structural domains from sequence data [J].
George, RA ;
Heringa, J .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 316 (03) :839-851
[12]   Protein domain identification and improved sequence similarity searching using PSI-BLAST [J].
George, RA ;
Heringa, J .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 48 (04) :672-681
[13]   Partitioning protein structures into domains: Why is it so difficult? [J].
Holland, Timothy A. ;
Veretnik, Stella ;
Shindyalov, Ilya N. ;
Bourne, Philip E. .
JOURNAL OF MOLECULAR BIOLOGY, 2006, 361 (03) :562-590
[14]   FFAS03: a server for profile-profile sequence alignments [J].
Jaroszewski, L ;
Rychlewski, L ;
Li, ZW ;
Li, WZ ;
Godzik, A .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W284-W288
[15]   Protein secondary structure prediction based on position-specific scoring matrices [J].
Jones, DT .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 292 (02) :195-202
[16]   Automated prediction of domain boundaries in CASP6 targets using Ginzu and RosettaDOM [J].
Kim, DE ;
Chivian, D ;
Malmström, L ;
Baker, D .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 :193-200
[17]   Relative rates of gene fusion and fission in multi-domain proteins [J].
Kummerfeld, SK ;
Teichmann, SA .
TRENDS IN GENETICS, 2005, 21 (01) :25-30
[18]   SMART 5: domains in the context of genomes and networks [J].
Letunic, Ivica ;
Copley, Richard R. ;
Pils, Birgit ;
Pinkert, Stefan ;
Schultz, Joerg ;
Bork, Peer .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D257-D260
[19]   Sequence-based prediction of protein domains [J].
Liu, JF ;
Rost, B .
NUCLEIC ACIDS RESEARCH, 2004, 32 (12) :3522-3530
[20]   CHOP proteins into structural domain-like fragments [J].
Liu, JF ;
Rost, B .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 55 (03) :678-688