Unified QSAR approach to antimicrobials. 4. Multi-target QSAR modeling and comparative multi-distance study of the giant components of antiviral drug-drug complex networks

被引:116
作者
Prado-Prado, Francisco J. [1 ,2 ,3 ]
Martinez de la Vega, Octavio [2 ]
Uriarte, Eugenio [3 ]
Ubeira, Florencio M. [1 ]
Chou, Kuo-Chen [4 ]
Gonzalez-Diaz, Humberto [1 ,3 ,4 ]
机构
[1] Univ Santiago de Compostela, Fac Pharm, Dept Microbiol & Parasitol, Santiago De Compostela 15782, Spain
[2] CINVESTAV, LANGEBIO, Dept Bioinformat, Irapuato 62936500, Mexico
[3] Univ Santiago de Compostela, Fac Pharm, Dept Organ Chem, Inst Ind Pharm,UBICA, Santiago De Compostela 15782, Spain
[4] Gordon Life Sci Inst, San Diego, CA 92130 USA
关键词
QSAR; Multi-target learning; Machine learning; Complex Networks; Antimicrobial drugs; Antiviral drugs; Markov Chain Model; Giant Component; COMPUTATIONAL CHEMISTRY APPROACH; CONNECTIVITY INDEXES; IN-SILICO; TYROSINASE INHIBITORS; MOLECULAR DESCRIPTORS; BILINEAR INDEXES; DISCOVERY; QSPR/QSAR; TASK;
D O I
10.1016/j.bmc.2008.11.075
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
One limitation of almost all antiviral Quantitative Structure-Activity Relationships (QSAR) models is that they predict the biological activity of drugs against only one species of virus. Consequently, the development of multi-tasking QSAR models (mt-QSAR) to predict drugs activity against different species of virus is of the major vitally important. These mt-QSARs offer also a good opportunity to construct drug-drug Complex Networks (CNs) that can be used to explore large and complex drug-viral species databases. It is known that in very large CNs we can use the Giant Component (GC) as a representative sub-set of nodes (drugs) and but the drug-drug similarity function selected may strongly determines the final network obtained. In the three previous works of the present series we reported mt-QSAR models to predict the antimicrobial activity against different fungi [Gonzalez-Diaz, H.; Prado-Prado, F. J.; Santana, L.; Uriarte, E. Bioorg. Med. Chem. 2006, 14, 5973], bacteria [Prado-Prado, F. J.; Gonzalez-Diaz, H.; Santana, L.; Uriarte E. Bioorg. Med. Chem. 2007, 15, 897] or parasite species [Prado-Prado, F. J.; Gonzalez-Diaz, H.; Martinez de la Vega, O.; Ubeira, F. M.; Chou K. C. Bioorg. Med. Chem. 2008, 16, 5871]. However, including these works, we do not found any report of mt-QSAR models for antivirals drug, or a comparative study of the different GC extracted from drug-drug CNs based on different similarity functions. In this work, we used Linear Discriminant Analysis (LDA) to. fit a mt-QSAR model that classify 600 drugs as active or nonactive against the 41 different tested species of virus. The model correctly classifies 143 of 169 active compounds (specificity = 84.62%) and 119 of 139 non-active compounds (sensitivity = 85.61%) and presents overall training accuracy of 85.1% (262 of 308 cases). Validation of the model was carried out by means of external predicting series, classifying the model 466 of 514, 90.7% of compounds. In order to illustrate the performance of the model in practice, we develop a virtual screening recognizing the model as active 92.7%, 102 of 110 antivirus compounds. These compounds were never use in training or predicting series. Next, we obtained and compared the topology of the CNs and their respective GCs based on Euclidean, Manhattan, Chebychey, Pearson and other similarity measures. The GC of the Manhattan network showed the more interesting features for drug-drug similarity search. We also give the procedure for the construction of Back-Projection Maps for the contribution of each drug sub-structure to the antiviral activity against different species. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:569 / 575
页数:7
相关论文
共 37 条
[1]   Prevalence of HIV-2 and HIV-1 group O infections among new HIV diagnoses in France:: 2003-2006 [J].
Barin, Francis ;
Cazein, Francoise ;
Lot, Florence ;
Pillonel, Josiane ;
Brunet, Sylvie ;
Thierry, Damien ;
Damond, Florence ;
Brun-Vezinet, Francoise ;
Desenclos, Jean-Claude ;
Semaille, Caroline .
AIDS, 2007, 21 (17) :2351-2353
[2]  
Bornholdt S., 2003, HDB GRAPHS COMPLEX N
[3]   TOMOCOMD-CARDD descriptors-based virtual screening of tyrosinase inhibitors:: Evaluation of different classification model combinations using bond-based linear indices [J].
Casahola-Martin, Gerardo M. ;
Marrero-Ponce, Yovani ;
Khan, Mahmud Tareq Hassan ;
Ather, Arjumand ;
Sultan, Sadia ;
Torrens, Francisco ;
Rotondo, Richard .
BIOORGANIC & MEDICINAL CHEMISTRY, 2007, 15 (03) :1483-1503
[4]   Atom-based stochastic and non-stochastic 3D-chiral bilinear indices and their applications to central chirality codification [J].
Castillo-Garit, Juan A. ;
Marrero-Ponce, Yovani ;
Torrens, Francisco ;
Rotondo, Richard .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2007, 26 (01) :32-47
[5]  
DAMOS D, 1982, AVIAT SPACE ENVIR MD, V53, P1177
[6]   Collaborative filtering on a family of biological targets [J].
Erhan, D ;
L'Heureux, PJ ;
Yue, SY ;
Bengio, Y .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (02) :626-635
[7]   Novel local (fragment-based) topological molecular descriptors for QSPR/QSAR and molecular design [J].
Estrada, E ;
Molina, E .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2001, 20 (01) :54-64
[8]   3D connectivity indices in QSPR/QSAR studies [J].
Estrada, E ;
Molina, E .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (03) :791-797
[9]   Design, synthesis and photobiological properties of 3,4-cyclopentenepsoralens [J].
Gia, O ;
Magno, SM ;
Diaz, HG ;
Quezada, E ;
Santana, L ;
Uriarte, E ;
Via, LD .
BIOORGANIC & MEDICINAL CHEMISTRY, 2005, 13 (03) :809-817
[10]   Unified QSAR and network-based computational chemistry approach to antimicrobials, part 1:: Multispecies activity models for antifungals [J].
Gonzalez-Diaz, Humberto ;
Prado-Prado, Francisco J. .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2008, 29 (04) :656-667