Development of linear, ensemble, and nonlinear models for the prediction and interpretation of the biological activity of a set of PDGFR inhibitors

被引:71
作者
Guha, R [1 ]
Jurs, PC [1 ]
机构
[1] Penn State Univ, Dept Chem, University Pk, PA 16802 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2004年 / 44卷 / 06期
关键词
D O I
10.1021/ci049849f
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A QSAR modeling Study has been done with a set of 79 piperazyinylquinazoline analogues which exhibit PDGFR inhibition. Linear regression and nonlinear computational neural network models were developed. The regression model was developed with a focus on interpretative ability using a PLS technique. However, it also exhibits a good predictive ability after outlier removal. The nonlinear CNN model had superior predictive ability compared to the linear model with a training,, set error of 0.22 log(IC50) units (R-2 = 0.93) and a prediction set error of 0.32 log(IC50) units (R-2 = 0.61). A random forest model was also developed to provide an alternate measure of descriptor importance. This approach ranks descriptors, and its results confirm the importance of specific descriptors as characterized by the PLS technique. In addition the neural network model contains the two most important descriptors indicated by the random forest model.
引用
收藏
页码:2179 / 2189
页数:11
相关论文
共 53 条
[1]  
[Anonymous], PHYS CHEM PROPERTIES
[2]   HIGHLY DISCRIMINATING DISTANCE-BASED TOPOLOGICAL INDEX [J].
BALABAN, AT .
CHEMICAL PHYSICS LETTERS, 1982, 89 (05) :399-404
[3]   Synthesis and tyrosine kinase inhibitory activity of a series of 2-amino-8H-pyrido[2,3-d]pyrimidines:: Identification of potent, selective platelet-derived growth factor receptor tyrosine kinase inhibitors [J].
Boschelli, DH ;
Wu, ZP ;
Klutchko, SR ;
Showalter, HDH ;
Hamby, JM ;
Lu, GH ;
Major, TC ;
Dahring, TK ;
Batley, B ;
Panek, RL ;
Keiser, J ;
Hartl, BG ;
Kraker, AJ ;
Klohs, WD ;
Roberts, BJ ;
Patmore, S ;
Elliott, WL ;
Steinkampf, R ;
Bradford, LA ;
Hallak, H ;
Doherty, AM .
JOURNAL OF MEDICINAL CHEMISTRY, 1998, 41 (22) :4365-4377
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]  
Breiman L., 2017, Classification And Regression Trees, DOI [10.1201/9781315139470, DOI 10.1201/9781315139470]
[7]   ATOM PAIRS AS MOLECULAR-FEATURES IN STRUCTURE ACTIVITY STUDIES - DEFINITION AND APPLICATIONS [J].
CARHART, RE ;
SMITH, DH ;
VENKATARAGHAVAN, R .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1985, 25 (02) :64-73
[8]   CROSS-VALIDATION, BOOTSTRAPPING, AND PARTIAL LEAST-SQUARES COMPARED WITH MULTIPLE-REGRESSION IN CONVENTIONAL QSAR STUDIES [J].
CRAMER, RD ;
BUNCE, JD ;
PATTERSON, DE ;
FRANK, IE .
QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 1988, 7 (01) :18-25
[9]   COMPARATIVE MOLECULAR-FIELD ANALYSIS (COMFA) .1. EFFECT OF SHAPE ON BINDING OF STEROIDS TO CARRIER PROTEINS [J].
CRAMER, RD ;
PATTERSON, DE ;
BUNCE, JD .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1988, 110 (18) :5959-5967
[10]  
DELANO WL, PYMOL MOL GRAPHICS G