Multi-space classification for predicting GPCR-ligands

被引:21
作者
Givehchi, Alireza [1 ,2 ]
Schneider, Gisbert [1 ]
机构
[1] Goethe Univ Frankfurt, Inst Organ Chem & Chem Biol, D-60439 Frankfurt, Germany
[2] Univ Klinikum Munster, Klin & Poliklin Neurochirurg, Klin & Poliklin Neurol, D-48129 Munster, Germany
关键词
descriptor selection; drug-likeness; genetic algorithm; molecular descriptor; neural network; receiver operating characteristic curve ROC;
D O I
10.1007/s11030-005-6293-4
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A classification of molecules depends on the descriptor set which is used to represent the compounds, and each descriptor could be regarded as one perception of a molecule. In this study we show that a combination of several classifiers that are grounded on separate descriptor sets can be superior to a single classifier that was built using all available descriptors. The task of predicting ligands of G-protein coupled receptors (GPCR) served as an example application. The perceptron, multilayer neural networks, and radial basis function (RBF) networks were employed for prediction. We developed classifiers with and without descriptor selection. Prediction accuracy was assessed by the area under the receiver operating characteristic (ROC) curve. In the case with descriptor selection both the selection and the rank order of the descriptors depended on the type and topology of the neural networks. We demonstrate that the overall prediction accuracy of the system can be improved by joining neural network classifiers of different type and topology using a "jury network" that is trained to evaluate the predictions from the individual classifiers. Seventy-one percent correct prediction of GPCR ligands was obtained.
引用
收藏
页码:371 / 383
页数:13
相关论文
共 65 条
[1]   Can we learn to distinguish between "drug-like" and "nondrug-like" molecules? [J].
Ajay ;
Walters, WP ;
Murcko, MA .
JOURNAL OF MEDICINAL CHEMISTRY, 1998, 41 (18) :3314-3324
[2]  
ANDERSON JA, 1990, NEUROCOMPUTING, V2
[3]   HIGHLY DISCRIMINATING DISTANCE-BASED TOPOLOGICAL INDEX [J].
BALABAN, AT .
CHEMICAL PHYSICS LETTERS, 1982, 89 (05) :399-404
[4]   Property-based design of GPCR-targeted library [J].
Balakin, KV ;
Tkachenko, SE ;
Lang, SA ;
Okun, I ;
Ivashchenko, AA ;
Savchuk, NP .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (06) :1332-1342
[5]  
Biganzoli E, 1998, STAT MED, V17, P1169, DOI 10.1002/(SICI)1097-0258(19980530)17:10<1169::AID-SIM796>3.3.CO
[6]  
2-4
[7]  
Bishop C. M., 1996, Neural networks for pattern recognition
[8]   Statistical methods for ranking differentially expressed genes [J].
Broberg, P .
GENOME BIOLOGY, 2003, 4 (06)
[9]  
Broomhead D. S., 1988, Complex Systems, V2, P321
[10]   SVM-based feature selection for characterization of focused compound collections [J].
Byvatov, E ;
Schneider, G .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (03) :993-999