Classification of metabolites with kernel-partial least squares (K-PLS)

被引:20
作者
Embrechts, Mark J. [1 ]
Ekins, Sean
机构
[1] Rensselaer Polytech Inst, Dept Decis Sci & Engn Syst, Troy, NY USA
[2] GeneGo Inc, St Joseph, MO USA
[3] Univ Maryland, Dept Pharmaceut Sci, Baltimore, MD 20742 USA
关键词
D O I
10.1124/dmd.106.013185
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Numerous experimental and computational approaches have been developed to predict human drug metabolism. Since databases of human drug metabolism information are widely available, these can be used to train computational algorithms and generate predictive approaches. In turn, they may be used to assist in the identification of possible metabolites from a large number of molecules in drug discovery based on molecular structure alone. In the current study we have used a commercially available database (MetaDrug) and extracted a fraction of the human drug metabolism data. These data were used along with augmented atom descriptors in a predictive machine learning model, kernel-partial least squares (K-PLS). A total of 317 molecules, including parent drugs and their primary and secondary (sequential) metabolites, were used to build these models corresponding to individual metabolism rules, representing the formation of discrete metabolites, e. g., N-dealkylation. Each model was internally validated to assess the capability to classify other molecules that were left out. Using receiver operator curve statistics models for N-dealkylation, O-dealkylation, aromatic hydroxylation, aliphatic hydroxylation, O-glucuronidation, and O-sulfation gave area under the curve values from 0.75 to 0.84 and were able to predict between 61 and 79% active molecules upon leave-one-out testing. This preliminary study indicates that K-PLS and possibly other similar machine learning methods (such as support vector machines) can be applied to predicting human drug metabolite formation in a classification manner. Improvements can be achieved using considerably larger datasets-that contain more positive examples for the less frequently occurring metabolite rules, as well as the external evaluation of novel molecules.
引用
收藏
页码:325 / 327
页数:3
相关论文
共 18 条
[1]  
[Anonymous], 2000, SUPPORT VECTOR MACHI
[2]  
[Anonymous], ADV LEARNING THEORY
[3]   Kohonen maps for prediction of binding to human cytochrome P450 3A4 [J].
Balakin, KV ;
Ekins, S ;
Bugrim, A ;
Ivanenkov, YA ;
Korolev, D ;
Nikolsky, YV ;
Skorenko, AV ;
Ivashchenko, AA ;
Savchuk, NP ;
Nikolskaya, T .
DRUG METABOLISM AND DISPOSITION, 2004, 32 (10) :1183-1189
[4]   A new statistical approach to predicting aromatic hydroxylation sites. Comparison with model-based approaches [J].
Borodina, Y ;
Rudik, A ;
Filimonov, D ;
Kharchevnikova, N ;
Dmitriev, A ;
Blinova, V ;
Porolkov, V .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (06) :1998-2009
[5]   New methods in predictive metabolism [J].
Boyer, S ;
Zamora, I .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2002, 16 (5-6) :403-413
[6]   Cytochrome P450 in silico: An integrative modeling approach [J].
de Graaf, C ;
Vermeulen, NPE ;
Feenstra, KA .
JOURNAL OF MEDICINAL CHEMISTRY, 2005, 48 (08) :2725-2755
[7]   Designing better drugs: predicting cytochrome P450 metabolism [J].
de Groot, Marcel J. .
DRUG DISCOVERY TODAY, 2006, 11 (13-14) :601-606
[8]   A combined approach to drug metabolism and toxicity assessment [J].
Ekins, S ;
Andreyev, S ;
Ryabov, A ;
Kirillov, E ;
Rakhmatulin, EA ;
Sorokina, S ;
Bugrim, A ;
Nikolskaya, T .
DRUG METABOLISM AND DISPOSITION, 2006, 34 (03) :495-503
[9]  
Ekins S, 2001, DRUG METAB DISPOS, V29, P936
[10]   Techniques: Application of systems biology to absorption, distribution, metabolism, excretion and toxicity [J].
Ekins, S ;
Nikolsky, Y ;
Nikolskaya, T .
TRENDS IN PHARMACOLOGICAL SCIENCES, 2005, 26 (04) :202-209