Specificity rule discovery in HIV-1 protease cleavage site analysis

被引:15
作者
Kim, Hyeoncheol [1 ]
Zhang, Yiying [2 ]
Heo, Yong-Seok [3 ]
Oh, Heung-Bum [4 ,5 ]
Chen, Su-Shing [2 ]
机构
[1] Korea Univ, Dept Comp Sci Educ, Seoul 136701, South Korea
[2] Univ Florida, Gainesville, FL 32611 USA
[3] Konkuk Univ, Dept Chem, Seoul 143701, South Korea
[4] Asan Med Ctr, Dept Lab Med, Seoul, South Korea
[5] Univ Ulsan, Seoul, South Korea
关键词
HIV-1 cleavage site prediction rule discovery;
D O I
10.1016/j.compbiolchem.2007.09.006
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Several machine learning algorithms have recently been applied to modeling the specificity of HIV-1 protease. The problem is challenging because of the three issues as follows: (1) datasets with high dimensionality and small number of samples could misguide classification modeling and its interpretation; (2) symbolic interpretation is desirable because it provides us insight to the specificity in the form of human-understandable rules, and thus helps us to design effective HIV inhibitors; (3) the interpretation should take into account complexity or dependency between positions in sequences. Therefore, it is neccessary to investigate multivariate and feature-selective methods to model the specificity and to extract rules from the model. We have tested extensively various machine learning methods, and we have found that the combination of neural networks and decompositional approach can generate a set of effective rules. By validation to experimental results for the HIV-1 protease, the specificity rules outperform the ones generated by frequency-based, univariate or black-box methods. (C) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:72 / 79
页数:8
相关论文
共 41 条
[1]   Survey and critique of techniques for extracting rules from trained artificial neural networks [J].
Andrews, R ;
Diederich, J ;
Tickle, AB .
KNOWLEDGE-BASED SYSTEMS, 1995, 8 (06) :373-389
[2]  
Beck HP, 2002, ADV HUM PER, V2, P37, DOI 10.1016/S1479-3601(02)02005-2
[3]   Identification of efficiently cleaved substrates for HIV-1 protease using a phage display library and use in inhibitor development [J].
Beck, ZQ ;
Hervio, L ;
Dawson, PE ;
Elder, JH ;
Madison, EL .
VIROLOGY, 2000, 274 (02) :391-401
[4]   Molecular basis for the relative substrate specificity of human immunodeficiency virus type 1 and feline immunodeficiency virus proteases [J].
Beck, ZQ ;
Lin, PC ;
Elder, JH .
JOURNAL OF VIROLOGY, 2001, 75 (19) :9458-9469
[5]   Resistance to human immunodeficiency virus type 1 protease inhibitors [J].
Boden, D ;
Markowitz, M .
ANTIMICROBIAL AGENTS AND CHEMOTHERAPY, 1998, 42 (11) :2775-2783
[6]   HIV-1 protease: mechanism and drug discovery [J].
Brik, A ;
Wong, CH .
ORGANIC & BIOMOLECULAR CHEMISTRY, 2003, 1 (01) :5-14
[7]   Artificial neural network model for predicting HIV protease cleavage sites in protein [J].
Cai, YD ;
Chou, KC .
ADVANCES IN ENGINEERING SOFTWARE, 1998, 29 (02) :119-128
[8]   Support vector machines for predicting HIV protease cleavage sites in protein [J].
Cai, YD ;
Liu, XJ ;
Xu, XB ;
Chou, KC .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2002, 23 (02) :267-274
[9]   Positive selection detection in 40,000 human immunodeficiency virus (HIV) type 1 sequences automatically identifies drug resistance and positive fitness mutations in HIV protease and reverse transcriptase [J].
Chen, LM ;
Perlina, A ;
Lee, CJ .
JOURNAL OF VIROLOGY, 2004, 78 (07) :3722-3732
[10]   Prediction of human immunodeficiency virus protease cleavage sites in proteins [J].
Chou, KC .
ANALYTICAL BIOCHEMISTRY, 1996, 233 (01) :1-14