Combinatorial QSAR modeling of P-glycoprotein substrates

被引:104
作者
Lima, Patricia de Cerqueira
Golbraikh, Alexander
Oloff, Scott
Xiao, Yunde
Tropsha, Alexander
机构
[1] Univ N Carolina, Sch Pharm, Div Nat & Med Chem, Lab Mol Modeling, Chapel Hill, NC 27599 USA
[2] Targacept Inc, Winston Salem, NC 27101 USA
关键词
D O I
10.1021/ci0504317
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Quantitative structure-activity ( property) relationship (QSAR/QSPR) models are typically generated with a single modeling technique using one type of molecular descriptors. Recently, we have begun to explore a combinatorial QSAR approach which employs various combinations of optimization methods and descriptor types and includes rigorous and consistent model validation (Kovatcheva, A.; Golbraikh, A.; Oloff, S.; Xiao, Y.; Zheng, W.; Wolschann, P.; Buchbauer, G.; Tropsha, A. Combinatorial QSAR of Ambergris Fragrance Compounds. J. Chem. Inf. Comput. Sci. 2004, 44, 582-95). Herein, we have applied this approach to a data set of 195 diverse substrates and nonsubstrates of P-glycoprotein (P-gp) that plays a crucial role in drug resistance. Modeling methods included k-nearest neighbors classification, decision tree, binary QSAR, and support vector machines (SVM). Descriptor sets included molecular connectivity indices, atom pair (AP) descriptors, VolSurf descriptors, and molecular operation environment descriptors. Each descriptor type was used with every QSAR modeling technique; so, in total, 16 combinations of techniques and descriptor types have been considered. Although all combinations resulted in models with a high correct classification rate for the training set (CCRtrain), not all of them had high classification accuracy for the test set (CCRtest). Thus, predictive models have been generated only for some combinations of the methods and descriptor types, and the best models were obtained using SVM classification with either AP or VolSurf descriptors; they were characterized by CCRtrain = 0.94 and 0.88 and CCRtest = 0.81 and 0.81, respectively. The combinatorial QSAR approach identified models with higher predictive accuracy than those reported previously for the same data set. We suggest that, in the absence of any universally applicable "one-for-all" QSAR methodology, the combinatorial QSAR approach should become the standard practice in QSPR/ QSAR modeling.
引用
收藏
页码:1245 / 1254
页数:10
相关论文
共 53 条
[1]  
[Anonymous], DAYLIGHT THEORY FING
[2]   CHEMICAL GRAPHS .34. 5 NEW TOPOLOGICAL INDEXES FOR THE BRANCHING OF TREE-LIKE GRAPHS [J].
BALABAN, AT .
THEORETICA CHIMICA ACTA, 1979, 53 (04) :355-375
[3]   HIGHLY DISCRIMINATING DISTANCE-BASED TOPOLOGICAL INDEX [J].
BALABAN, AT .
CHEMICAL PHYSICS LETTERS, 1982, 89 (05) :399-404
[4]   The role of multidrug, transporters in drug availability, metabolism and toxicity [J].
Bodó, A ;
Bakos, E ;
Szeri, F ;
Váradi, A ;
Sarkadi, B .
TOXICOLOGY LETTERS, 2003, 140 :133-143
[5]   ISOMER DISCRIMINATION BY TOPOLOGICAL INFORMATION APPROACH [J].
BONCHEV, D ;
MEKENYAN, O ;
TRINAJSTIC, N .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1981, 2 (02) :127-148
[6]   A rapid computational method for lead evolution:: Description and application to α1-adrenergic antagonists [J].
Bradley, EK ;
Beroza, P ;
Penzotti, JE ;
Grootenhuis, PDJ ;
Spellmeyer, DC ;
Miller, JL .
JOURNAL OF MEDICINAL CHEMISTRY, 2000, 43 (14) :2770-2774
[7]  
Breiman L., 1998, CLASSIFICATION REGRE
[8]   ATOM PAIRS AS MOLECULAR-FEATURES IN STRUCTURE ACTIVITY STUDIES - DEFINITION AND APPLICATIONS [J].
CARHART, RE ;
SMITH, DH ;
VENKATARAGHAVAN, R .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1985, 25 (02) :64-73
[9]   Molecular fields in quantitative structure-permeation relationships: the VolSurf approach [J].
Cruciani, C ;
Crivori, P ;
Carrupt, PA ;
Testa, B .
JOURNAL OF MOLECULAR STRUCTURE-THEOCHEM, 2000, 503 (1-2) :17-30
[10]   Characterisation of the brain multidrug resistance protein (BMDP/ABCG2/BCRP) expressed at the blood-brain barrier [J].
Eisenblätter, T ;
Hüwel, S ;
Galla, HJ .
BRAIN RESEARCH, 2003, 971 (02) :221-231