Use of classification regression tree in predicting oral absorption in humans

被引:36
作者
Bai, JPF
Utis, A
Crippen, G
He, HD
Fischer, V
Tullman, R
Yin, HQ
Hsu, CP
Jiang, L
Hwang, KK
机构
[1] ZyxBio LLC, Hudson, OH 44236 USA
[2] ZyxBio LLC, Cleveland, OH 44106 USA
[3] Univ Michigan, Coll Pharm, Ann Arbor, MI 48109 USA
[4] Novartis Pharmaceut, E Hanover, NJ USA
[5] Johnson & Johnson Pharmaceut Res & Dev LLC, Bridgewater, NJ USA
[6] Aventis Pharmaceut, Bridgewater, NJ USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2004年 / 44卷 / 06期
关键词
D O I
10.1021/ci040023n
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The purpose of this study is to explore the use of classification regression trees (CART) in predicting, in the dose-independent range, the fraction dose absorbed in humans. Since the results from clinical formulations in humans were used for training the model, a hypothetical state of drug molecules already dissolved in the intestinal fluid was adopted. Therefore, the molecular attributes affecting dissolution were not considered in the model. As a result, the model projects the highest achievable fraction dose absorbed, providing a reference point for manipulating the formulations or solid states to optimize oral clinical efficacy. A set of approximately 1260 structures and their human oral pharmacokinetic data, including bioavailability and/or absorption and/or radio-labeled studies, were used, with 899 compounds as the training set and 362 the test set. The numerical range of the fraction dose absorbed, 0 to 1, was divided into 6 classes with each class having a size of approximately 0.16. A set of 28 structural descriptors was used for modeling oral absorption without considering active transport. Then, a separate branch was created for modeling oral absorption involving active transport. The AAE of the training set was 0.12 and those of five test sets ranged from 0.17 to 0.2. In terms of classification, two test sets of unpublished, proprietary compounds showed 79% to 86% prediction when the predicted values fallen within +/- one class of real values were considered predicted. Overall, the computational errors from all the test sets of diverse structures were similar and reasonably acceptable. As compared to artificial membranes for ranking drug absorption potential, prediction by the CART model is considered fast and reasonably accurate for accelerating drug discovery. One can not only improve continuously the accuracy of CART computations by expanding the chemical space of the training set but also calculate the statistical errors associated with individual decision paths resulting from the training set to determine whether to accept individual computations of any test sets.
引用
收藏
页码:2061 / 2069
页数:9
相关论文
共 33 条
[1]  
[Anonymous], 1985, ADV ORG CHEM
[2]  
BAI PF, 1991, PHARMACOKINETICS DRU, P189
[3]  
Breiman L., 1998, CLASSIFICATION REGRE
[4]  
BURTON PS, 1992, J CONTROL RELEASE, V19, P87, DOI 10.1016/0168-3659(92)90067-2
[5]  
*CHEM COMP GROUP I, MOE MOL OP
[6]  
*DAYL CHEM INF INC, MOL2 SMI
[7]   Fast calculation of molecular polar surface area as a sum of fragment-based contributions and its application to the prediction of drug transport properties [J].
Ertl, P ;
Rohde, B ;
Selzer, P .
JOURNAL OF MEDICINAL CHEMISTRY, 2000, 43 (20) :3714-3717
[8]   PREDICTING STROKE INPATIENT REHABILITATION OUTCOME USING A CLASSIFICATION TREE APPROACH [J].
FALCONER, JA ;
NAUGHTON, BJ ;
DUNLOP, DD ;
ROTH, EJ ;
STRASSER, DC ;
SINACORE, JM .
ARCHIVES OF PHYSICAL MEDICINE AND REHABILITATION, 1994, 75 (06) :619-625
[9]   PASSIVE AND CARRIER-MEDIATED INTESTINAL-ABSORPTION COMPONENTS OF 2 ANGIOTENSIN CONVERTING ENZYME (ACE) INHIBITOR PRODRUGS IN RATS - ENALAPRIL AND FOSINOPRIL [J].
FRIEDMAN, DI ;
AMIDON, GL .
PHARMACEUTICAL RESEARCH, 1989, 6 (12) :1043-1047
[10]   Estimation of aqueous solubility of organic compounds with QSPR approach [J].
Gao, H ;
Shanmugasundaram, V ;
Lee, P .
PHARMACEUTICAL RESEARCH, 2002, 19 (04) :497-503