Consensus queries in ligand-based virtual screening experiments

被引:13
作者
Berenger, Francois [1 ,2 ]
Oanh Vu [1 ]
Meiler, Jens [1 ]
机构
[1] Kyushu Univ, Med Inst Bioregulat, Div Syst Cohort, Fukuoka, Japan
[2] Kyushu Univ, Med Inst Bioregulat, Divs Syst Cohort, Fukuoka, Japan
关键词
Similarity search; Several bioactives; Consensus query; Ligand-based virtual screening (LBVS); Chemical fingerprint; Potency scaling; MACCS; ECFP4; MOLPRINT2D; Tanimoto score; MOLECULAR SIMILARITY; DRUG DISCOVERY; CHEMICAL SPACE; 2D;
D O I
10.1186/s13321-017-0248-5
中图分类号
O6 [化学];
学科分类号
070301 [无机化学];
摘要
Background: In ligand-based virtual screening experiments, a known active ligand is used in similarity searches to find putative active compounds for the same protein target. When there are several known active molecules, screening using all of them is more powerful than screening using a single ligand. A consensus query can be created by either screening serially with different ligands before merging the obtained similarity scores, or by combining the molecular descriptors (i.e. chemical fingerprints) of those ligands. Results: We report on the discriminative power and speed of several consensus methods, on two datasets only made of experimentally verified molecules. The two datasets contain a total of 19 protein targets, 3776 known active and similar to 2 x 10(6) inactive molecules. Three chemical fingerprints are investigated: MACCS 166 bits, ECFP4 2048 bits and an unfolded version of MOLPRINT2D. Four different consensus policies and five consensus sizes were benchmarked. Conclusions: The best consensus method is to rank candidate molecules using the maximum score obtained by each candidate molecule versus all known actives. When the number of actives used is small, the same screening performance can be approached by a consensus fingerprint. However, if the computational exploration of the chemical space is limited by speed (i.e. throughput), a consensus fingerprint allows to outperform this consensus of scores.
引用
收藏
页数:13
相关论文
共 37 条
[1]
Low Data Drug Discovery with One-Shot Learning [J].
Altae-Tran, Han ;
Ramsundar, Bharath ;
Pappu, Aneesh S. ;
Pande, Vijay .
ACS CENTRAL SCIENCE, 2017, 3 (04) :283-293
[2]
[Anonymous], RDKit: Open-source cheminformatics
[3]
Similarity searching of chemical databases using atom environment descriptors (MOLPRINT 2D): Evaluation of performance [J].
Bender, A ;
Mussa, HY ;
Glen, RC ;
Reiling, S .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (05) :1708-1718
[4]
Molecular similarity: a key technique in molecular informatics [J].
Bender, A ;
Glen, RC .
ORGANIC & BIOMOLECULAR CHEMISTRY, 2004, 2 (22) :3204-3218
[5]
Molecular similarity searching using atom environments, information-based feature selection, and a naive Bayesian classifier [J].
Bender, A ;
Mussa, HY ;
Glen, RC ;
Reiling, S .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (01) :170-178
[6]
PAR: a PARallel and distributed job crusher [J].
Berenger, Francois ;
Coti, Camille ;
Zhang, Kam Y. J. .
BIOINFORMATICS, 2010, 26 (22) :2918-2919
[7]
Benchmarking Ligand-Based Virtual High-Throughput Screening with the PubChem Database [J].
Butkiewicz, Mariusz ;
Lowe, Edward W., Jr. ;
Mueller, Ralf ;
Mendenhall, Jeffrey L. ;
Teixeira, Pedro L. ;
Weaver, C. David ;
Meiler, Jens .
MOLECULES, 2013, 18 (01) :735-756
[8]
A "minimal disruption" skeleton experiment: seamless map & reduce embedding in OCaml [J].
Danelutto, M. ;
Di Cosmo, R. .
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2012, 2012, 9 :1837-1846
[9]
The power metric: a new statistically robust enrichment-type metric for virtual screening applications with early recovery capability [J].
Dias Lopes, Julio Cesar ;
dos Santos, Fabio Mendes ;
Martins-Jose, Andrelly ;
Augustyns, Koen ;
De Winter, Hans .
JOURNAL OF CHEMINFORMATICS, 2017, 9
[10]
POT-DMC: A virtual screening method for the identification of potent hits [J].
Godden, JW ;
Stahura, FL ;
Bajorath, J .
JOURNAL OF MEDICINAL CHEMISTRY, 2004, 47 (23) :5608-5611