Loop-Length-Dependent SVM Prediction of Domain Linkers for High-Throughput Structural Proteomics

被引:35
作者
Ebina, Teppei [2 ]
Toh, Hiroyuki [1 ]
Kuroda, Yutaka [2 ]
机构
[1] Kyushu Univ, Med Inst Bioregulat, Div Bioinformat, Higashi Ku, Fukuoka 8128582, Japan
[2] Tokyo Univ Agr & Technol, Dept Biotechnol & Life Sci, Koganei, Tokyo 1848588, Japan
基金
日本学术振兴会;
关键词
support vector machine; high throughput protein dissection; structural domains; proteomics; PROTEIN SECONDARY STRUCTURE; BOUNDARY PREDICTION; IDENTIFICATION; REGIONS; ASSIGNMENT;
D O I
10.1002/bip.21105
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The prediction of structural domain s in novel protein sequences is becoming of practical importance. One important area of application is the development of computer-aided techniques for identifying, at a low cost, novel protein domain targets for large-scale functional and structural proteomics. Here, we report a loop-length-dependent support vector machine (SVM) prediction of domain linkers, which are loops separating two structural domains. (DLP-SVM is freely available at: http://www.tuat.ac.jp/similar to domserv/cgi-bin/DLP-SVM.cgi.) We constructed three loop-length-dependent SVM predictors of domain linkers (SVM-All, SVM-Long and SVM-Short), and also built SVM-Joint, which combines the results of SVM-Short and SVM-Long into a single consolidated prediction. The performances of SVM-Joint were, in most aspects, the highest, with a sensitivity of 59.7% and a specificity of 43.6%, which indicated that the specificity and the sensitivity were improved by over 2 and 3% respectively, when loop-length-dependent characteristics were taken into account. Furthermore, the sensitivity and specificity of SVM-Joint were, respectively, 37.6 and 17.4% higher than those of a random guess, and also superior to those of previously reported domain linker predictors. These results indicate that SVMs can be used to predict domain linkers, and that loop-length-dependent characteristics are useful for improving SVM prediction performances. (c) 2008 Wiley Periodicals, Inc. Biopolymers (Pept Sci) 92: 1-8, 2009.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 39 条
[31]   Automatic prediction of protein domains from sequence information using a hybrid learning system [J].
Nagarajan, N ;
Yona, G .
BIOINFORMATICS, 2004, 20 (09) :1335-1360
[32]   Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index [J].
Sikder, Abdur R. ;
Zomaya, Albert Y. .
BMC BIOINFORMATICS, 2006, 7 (Suppl 5)
[33]   PPRODO: Prediction of protein domain boundaries using neural networks [J].
Sim, J ;
Kim, SY ;
Lee, J .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 59 (03) :627-632
[34]   Identification, expression, and purification of a unique stable domain from human HSPC144 protein [J].
Song, AX ;
Chang, YG ;
Gao, YG ;
Lin, XJ ;
Shi, YH ;
Lin, DH ;
Hang, QH ;
Hu, HY .
PROTEIN EXPRESSION AND PURIFICATION, 2005, 42 (01) :146-152
[35]   DomCut: prediction of inter-domain linker regions in amino acid sequences [J].
Suyama, M ;
Ohara, O .
BIOINFORMATICS, 2003, 19 (05) :673-674
[36]   Improvement of domain linker prediction by incorporating loop-length-dependent characteristics [J].
Tanaka, T ;
Yokoyama, S ;
Kuroda, Y .
BIOPOLYMERS, 2006, 84 (02) :161-168
[37]  
Tanaka Takanori, 2003, Journal of Structural and Functional Genomics, V4, P79, DOI 10.1023/A:1026163008203
[38]   A method for prediction of the locations of linker regions within large multifunctional proteins, and application to a type I polyketide synthase [J].
Udwary, DW ;
Merski, M ;
Townsend, CA .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 323 (03) :585-598
[39]   Structural genomics projects in Japan [J].
Yokoyama, S ;
Hirota, H ;
Kigawa, T ;
Yabuki, T ;
Shlrouzu, M ;
Terada, T ;
Ito, Y ;
Matsuo, Y ;
Kuroda, Y ;
Nishimura, Y ;
Kyogoku, Y ;
Miki, K ;
Masui, R ;
Kuramitsu, S .
NATURE STRUCTURAL BIOLOGY, 2000, 7 (Suppl 11) :943-945