Support vector machines for predicting HIV protease cleavage sites in protein

被引:134
作者
Cai, YD
Liu, XJ
Xu, XB
Chou, KC
机构
[1] Chinese Acad Sci, Shanghai Res Ctr Biotechnol, Shanghai 200233, Peoples R China
[2] Univ Edinburgh, Inst Cell Anim & Populat Biol, Edinburgh EH9 3JT, Midlothian, Scotland
[3] Cardiff Univ, Dept Comp Sci, Coll Cardiff, Cardiff CF2 3XF, S Glam, Wales
[4] Upjohn Co, Upjohn Labs, Comp Aided Drug Discovery, Kalamazoo, MI 49001 USA
关键词
HIV protease; support vector machine; cleavage sites; self-consistency; jackknife test;
D O I
10.1002/jcc.10017
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Knowledge of the polyprotein cleavage sites by HIV protease will refine our understanding of its specificity, and the information thus acquired is useful for designing specific and efficient HIV protease inhibitors. The pace in searching for the proper inhibitors of HIV protease will be greatly expedited if one can find an accurate, robust, and rapid method for predicting the cleavage sites in proteins by HIV protease. In this article, a Support Vector Machine is applied to predict the cleavability of oligopeptides by proteases with multiple and extended specificity subsites. We selected HIV-1 protease as the subject of the study. Two hundred ninety-nine oligopeptides were chosen for the training set, while the other 63 oligopeptides were taken as a test set. Because of its high rate of self-consistency (299/299 = 100%), a good result in the jackknife test (286/299 = 95%) and correct prediction rate (55/63 = 87%), it is expected that the Support Vector Machine method can be referred to as a useful assistant technique for finding effective inhibitors of HIV protease, which is one of the targets in designing potential drugs against AIDS. The principle of the Support Vector Machine method can also be applied to analyzing the specificity of other multisubsite enzymes. (C) 2002 John Wiley Sons, Inc.
引用
收藏
页码:267 / 274
页数:8
相关论文
共 32 条
[1]  
[Anonymous], 1999, INT C MACH LEARN ICM
[2]  
BURBIDGE R, 2000, P AISB 00 S ART INT, P1
[3]   Artificial neural network model for predicting HIV protease cleavage sites in protein [J].
Cai, YD ;
Chou, KC .
ADVANCES IN ENGINEERING SOFTWARE, 1998, 29 (02) :119-128
[4]   Is it a paradox or misinterpretation? [J].
Cai, YD .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2001, 43 (03) :336-338
[5]   A FORMULATION FOR CORRELATING PROPERTIES OF PEPTIDES AND ITS APPLICATION TO PREDICTING HUMAN-IMMUNODEFICIENCY-VIRUS PROTEASE-CLEAVABLE SITES IN PROTEINS [J].
CHOU, JJ .
BIOPOLYMERS, 1993, 33 (09) :1405-1414
[6]   PREDICTING CLEAVABILITY OF PEPTIDE SEQUENCES BY HIV PROTEASE VIA CORRELATION-ANGLE APPROACH [J].
CHOU, JJ .
JOURNAL OF PROTEIN CHEMISTRY, 1993, 12 (03) :291-302
[7]  
Chou KC, 1996, PROTEINS, V24, P51, DOI 10.1002/(SICI)1097-0134(199601)24:1&lt
[8]  
51::AID-PROT4&gt
[9]  
3.0.CO
[10]  
2-R