Cascleave: towards more accurate prediction of caspase substrate cleavage sites

被引:137
作者
Song, Jiangning [1 ,2 ]
Tan, Hao [1 ,3 ]
Shen, Hongbin [4 ]
Mahmood, Khalid [1 ,5 ]
Boyd, Sarah E. [3 ]
Webb, Geoffrey I. [3 ]
Akutsu, Tatsuya [2 ]
Whisstock, James C. [1 ,5 ]
机构
[1] Monash Univ, Dept Biochem & Mol Biol, Melbourne, Vic 3800, Australia
[2] Kyoto Univ, Inst Chem Res, Bioinformat Ctr, Kyoto 6110011, Japan
[3] Monash Univ, Fac Informat Technol, Melbourne, Vic 3800, Australia
[4] Shanghai Jiao Tong Univ, Inst Image Proc & Pattern Recognit, Shanghai 200240, Peoples R China
[5] Monash Univ, ARC Ctr Excellence Struct & Funct Microbial Genom, Melbourne, Vic 3800, Australia
基金
中国国家自然科学基金; 澳大利亚研究理事会; 日本学术振兴会; 英国医学研究理事会;
关键词
SUPPORT VECTOR REGRESSION; SVM-BASED PREDICTION; SECONDARY STRUCTURE; PROTEIN SEQUENCES; AMINO-ACID; DATABASE; IDENTIFICATION; APOPTOSIS; SERVER; DNA;
D O I
10.1093/bioinformatics/btq043
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The caspase family of cysteine proteases play essential roles in key biological processes such as programmed cell death, differentiation, proliferation, necrosis and inflammation. The complete repertoire of caspase substrates remains to be fully characterized. Accordingly, systematic computational screening studies of caspase substrate cleavage sites may provide insight into the substrate specificity of caspases and further facilitating the discovery of putative novel substrates. Results: In this article we develop an approach (termed Cascleave) to predict both classical (i.e. following a P-1 Asp) and non-typical caspase cleavage sites. When using local sequence-derived profiles, Cascleave successfully predicted 82.2% of the known substrate cleavage sites, with a Matthews correlation coefficient (MCC) of 0.667. We found that prediction performance could be further improved by incorporating information such as predicted solvent accessibility and whether a cleavage sequence lies in a region that is most likely natively unstructured. Novel bi-profile Bayesian signatures were found to significantly improve the prediction performance and yielded the best performance with an overall accuracy of 87.6% and a MCC of 0.747, which is higher accuracy than published methods that essentially rely on amino acid sequence alone. It is anticipated that Cascleave will be a powerful tool for predicting novel substrate cleavage sites of caspases and shedding new insights on the unknown caspase-substrate interactivity relationship.
引用
收藏
页码:752 / 760
页数:9
相关论文
共 54 条
[1]   Prediction of guide strand of microRNAs from its sequence and secondary structure [J].
Ahmed, Firoz ;
Ansari, Hifzur Rahman ;
Raghava, Gajendra P. S. .
BMC BIOINFORMATICS, 2009, 10
[2]  
[Anonymous], 1999, Advances in kernel methods: Support vector learning
[3]  
[Anonymous], 2000, NATURE STAT LEARNING, DOI DOI 10.1007/978-1-4757-3264-1
[4]   GraBCas: a bioinformatics tool for score-based prediction of caspase- and granzyme B-cleavage sites in protein sequences [J].
Backes, C ;
Kuentzer, J ;
Lenhof, HP ;
Comtesse, N ;
Meese, E .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W208-W213
[5]   The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :45-48
[6]  
BROCKINGTON I, 1998, ARCH WOMEN MENT HLTH, V1, P3
[7]   Identification of novel DNA repair proteins via primary sequence, secondary structure, and homology [J].
Brown, J. B. ;
Akutsu, Tatsuya .
BMC BIOINFORMATICS, 2009, 10
[8]   SCRATCH: a protein structure and structural feature prediction server [J].
Cheng, J ;
Randall, AZ ;
Sweredoski, MJ ;
Baldi, P .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W72-W76
[9]   Caspase-14 reveals its secrets [J].
Denecker, Geertrui ;
Ovaere, Petra ;
Vandenabeele, Peter ;
Declercq, Wim .
JOURNAL OF CELL BIOLOGY, 2008, 180 (03) :451-458
[10]   Global mapping of the topography and magnitude of proteolytic events in apoptosis [J].
Dix, Melissa M. ;
Simon, Gabriel M. ;
Cravatt, Benjamin F. .
CELL, 2008, 134 (04) :679-691