Unified QSAR approach to antimicrobials.: Part 3:: First multi-tasking QSAR model for Input-Coded prediction, structural back-projection, and complex networks clustering of antiprotozoal compounds

被引:202
作者
Prado-Prado, Francisco J. [1 ,2 ]
Gonzalez-Diaz, Humberto [1 ,3 ]
Martinez de la Vega, Octavio [2 ]
Ubeira, Florencio M. [1 ]
Chou, Kuo-Chen [3 ]
机构
[1] Univ Santiago Compostela, Unit Bioinformat & Connect Anal UBICA, Dept Microbiol & Parasitol, Fac Pharm,Inst Ind Pharm, Santiago De Compostela 15782, Spain
[2] LANGEBIO, CINVESTAV, Dept Bioinformat, Irapuato 62936500, Mexico
[3] Gordon Life Sci Inst, San Diego, CA 92130 USA
关键词
QSAR; multi-tasking learning; machine learning; complex networks; antiparasite drugs; Markov Chain Model; malaria; Plasmodium; Leishmania; Toxoplasma; Trypanosoma; Trichomonas;
D O I
10.1016/j.bmc.2008.04.068
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Several pathogen parasite species show different susceptibilities to different antiparasite drugs. Unfortunately, almost all structure-based methods are one-task or one-target Quantitative Structure-Activity Relationships (ot-QSAR) that predict the biological activity of drugs against only one parasite species. Consequently, multi-tasking learning to predict drugs activity against different species by a single model (mt-QSAR) is vitally important. In the two previous works of the present series we reported two single mt-QSAR models in order to predict the antimicrobial activity against different fungal (Bioorg. Med. Chem. 2006, 14, 5973-5980) or bacterial species (Bioorg. Med. Chem. 2007, 15, 897-902). These mt-QSARs offer a good opportunity (unpractical with ot-QSAR) to construct drug-drug similarity Complex Networks and to map the contribution of sub-structures to function for multiple species. These possibilities were unattended in our previous works. In the present work, we continue this series toward other important direction of chemotherapy (antiparasite drugs) with the development of an mt-QSAR for more than 500 drugs tested in the literature against different parasites. The data were processed by Linear Discriminant Analysis (LDA) classifying drugs as active or non-active against the different tested parasite species. The model correctly classifies 212 out of 244 (87.0%) cases in training series and 207 out of 243 compounds (85.4%) in external validation series. In order to illustrate the performance of the QSAR for the selection of active drugs we carried out an additional virtual screening of antiparasite compounds not used in training or predicting series; the model recognized 97 out of 114 (85.1%) of them. We also give the procedures to construct back-projection maps and to calculate sub-structures contribution to the biological activity. Finally, we used the outputs of the QSAR to construct, by the first time, a multi-species Complex Networks of antiparasite drugs. The network predicted has 380 nodes (compounds), 634 edges (pairs of compounds with similar activity). This network allows us to cluster different compounds and identify on average three known compounds similar to a new query compound according to their pro. le of biological activity. This is the first attempt to calculate probabilities of antiparasitic action of drugs against different parasites. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:5871 / 5880
页数:10
相关论文
共 73 条
[1]  
ALVAREZGINARTE YM, 2007, J COMPUT CHEM
[2]  
[Anonymous], 2007, INSIDE NRC
[3]   Network theory -: The emergence of the creative enterprise [J].
Barabási, AL .
SCIENCE, 2005, 308 (5722) :639-641
[4]   Characterization and comparison of Escherichia coli transfer RNAs by graph theory based on secondary structure [J].
Bermúdez, CI ;
Daza, EE ;
Andrade, E .
JOURNAL OF THEORETICAL BIOLOGY, 1999, 197 (02) :193-205
[5]   Complex networks: Structure and dynamics [J].
Boccaletti, S. ;
Latora, V. ;
Moreno, Y. ;
Chavez, M. ;
Hwang, D. -U. .
PHYSICS REPORTS-REVIEW SECTION OF PHYSICS LETTERS, 2006, 424 (4-5) :175-308
[6]   TOMOCOMD-CARDD descriptors-based virtual screening of tyrosinase inhibitors:: Evaluation of different classification model combinations using bond-based linear indices [J].
Casahola-Martin, Gerardo M. ;
Marrero-Ponce, Yovani ;
Khan, Mahmud Tareq Hassan ;
Ather, Arjumand ;
Sultan, Sadia ;
Torrens, Francisco ;
Rotondo, Richard .
BIOORGANIC & MEDICINAL CHEMISTRY, 2007, 15 (03) :1483-1503
[7]   Dragon method for finding novel tyrosinase inhibitors:: Biosilico identification and experimental in vitro assays [J].
Casanola-Martin, Gerardo M. ;
Marrero-Ponce, Yovani ;
Khan, Mahmud Tareq Hassan ;
Ather, Arjumand ;
Khan, Khalid M. ;
Torrens, Francisco ;
Rotondo, Richard .
EUROPEAN JOURNAL OF MEDICINAL CHEMISTRY, 2007, 42 (11-12) :1370-1381
[8]   Atom-based 3D-chiral quadratic indices. Part 2: Prediction of the corticosteroid-binding globulin binding affinity of the 31 benchmark steroids data set [J].
Castillo-Garit, JA ;
Marrero-Ponce, Y ;
Torrens, F .
BIOORGANIC & MEDICINAL CHEMISTRY, 2006, 14 (07) :2398-2408
[9]   Atom-based stochastic and non-stochastic 3D-chiral bilinear indices and their applications to central chirality codification [J].
Castillo-Garit, Juan A. ;
Marrero-Ponce, Yovani ;
Torrens, Francisco ;
Rotondo, Richard .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2007, 26 (01) :32-47
[10]   Structural bioinformatics and its impact to biomedical science [J].
Chou, KC .
CURRENT MEDICINAL CHEMISTRY, 2004, 11 (16) :2105-2134