Computer-assisted protein domain boundary prediction using the Dom-Pred server

被引:50
作者
Bryson, Kevin
Cozzetto, Domenico
Jones, David T.
机构
[1] UCL, Dept Comp Sci, London WC1E 6BT, England
[2] Univ Roma La Sapienza, Dept Biochem Sci Rossi Fanelli, I-00185 Rome, Italy
关键词
D O I
10.2174/138920307780363415
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Domain prediction from sequence is a particularly challenging task, and currently, a large variety of different methodologies are employed to tackle the task. Here we try to classify these diverse approaches into a number of broad categories. Completely automatic domain prediction from sequence alone is currently fraught with problems, but this should not be so surprising since human experts currently have significant disagreement on domain assignment even when given the structures. It can be argued that we should only test the domain prediction methods on benchmark data that human experts agree upon and this is the approach we take in this paper. Even for the data sets on which human experts agree, automatic structure-based domain assignment still cannot always agree, and so again it is still unlikely that domain prediction methods will reliably obtain correct results completely automatically. We make the argument that computer-assisted domain prediction is a more achievable goal. With this aim in mind, we present the DomPred server. This server provides the user with the results from two completely different categories of method (DPS and DomSSEA). In this paper, each method is individually benchmarked against one of the latest domain prediction benchmarks to provide information about their respective reliabilities. A variety of different benchmark scores are employed since the accuracy of a domain prediction method depends critically on what types of results one wishes to obtain (single/multi-domain classification, domain number, residue linker positions, etc.). Also both of these methods, implemented within the DomPred server, can suggest alternative domain predictions, allowing the user to make the final decision based on these results and applying their own background knowledge to the problem. The DomPred server is available from the URL.
引用
收藏
页码:181 / 188
页数:8
相关论文
共 32 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] SCOP database in 2004: refinements integrate structure and sequence family data
    Andreeva, A
    Howorth, D
    Brenner, SE
    Hubbard, TJP
    Chothia, C
    Murzin, AG
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D226 - D229
  • [3] Bateman A, 2002, NUCLEIC ACIDS RES, V30, P276, DOI [10.1093/nar/gkr1065, 10.1093/nar/gkp985, 10.1093/nar/gkh121]
  • [4] DOMpro: Protein domain prediction using profiles, secondary structure, relative solvent accessibility, and recursive neural networks
    Cheng, Jianlin
    Sweredoski, Michael J.
    Baldi, Pierre
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2006, 13 (01) : 1 - 10
  • [5] Domain boundary prediction based on profile domain linker propensity index
    Dong, QW
    Wang, XL
    Lin, L
    Xu, ZM
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2006, 30 (02) : 127 - 133
  • [6] Armadillo: Domain boundary prediction by amino acid composition
    Dumontier, M
    Yao, R
    Feldman, HJ
    Hogue, CWV
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2005, 350 (05) : 1061 - 1073
  • [7] Pfam:: clans, web tools and services
    Finn, Robert D.
    Mistry, Jaina
    Schuster-Bockler, Benjamin
    Griffiths-Jones, Sam
    Hollich, Volker
    Lassmann, Timo
    Moxon, Simon
    Marshall, Mhairi
    Khanna, Ajay
    Durbin, Richard
    Eddy, Sean R.
    Sonnhammer, Erik L. L.
    Bateman, Alex
    [J]. NUCLEIC ACIDS RESEARCH, 2006, 34 : D247 - D251
  • [8] Galzitskaya OV, 2006, MOL BIOL+, V40, P111
  • [9] Prediction of protein domain boundaries from sequence alone
    Galzitskaya, OV
    Melnik, BS
    [J]. PROTEIN SCIENCE, 2003, 12 (04) : 696 - 701
  • [10] Scooby-domain: prediction of globular domains in protein sequence
    George, RA
    Lin, K
    Heringa, J
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 : W160 - W163