Condorcet and borda count fusion method for ligand-based virtual screening

被引:20
作者
Ahmed, Ali [1 ,2 ]
Saeed, Faisal [1 ]
Salim, Naomie [1 ]
Abdo, Ammar [3 ]
机构
[1] Univ Teknol Malaysia, Fac Comp, Soft Comp Res Grp, Skudai 81310, Malaysia
[2] Karary Univ, Fac Engn, Khartoum 12304, Sudan
[3] Hodeidah Univ, Dept Comp Sci, Hodeidah, Yemen
来源
JOURNAL OF CHEMINFORMATICS | 2014年 / 6卷
关键词
Similarity searching; Virtual screening; Similarity coefficients; Data fusion; SIMILARITY; COMBINATION; PERFORMANCE; DOCKING; SETS;
D O I
10.1186/1758-2946-6-19
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Background: It is known that any individual similarity measure will not always give the best recall of active molecule structure for all types of activity classes. Recently, the effectiveness of ligand-based virtual screening approaches can be enhanced by using data fusion. Data fusion can be implemented using two different approaches: group fusion and similarity fusion. Similarity fusion involves searching using multiple similarity measures. The similarity scores, or ranking, for each similarity measure are combined to obtain the final ranking of the compounds in the database. Results: The Condorcet fusion method was examined. This approach combines the outputs of similarity searches from eleven association and distance similarity coefficients, and then the winner measure for each class of molecules, based on Condorcet fusion, was chosen to be the best method of searching. The recall of retrieved active molecules at top 5% and significant test are used to evaluate our proposed method. The MDL drug data report (MDDR), maximum unbiased validation (MUV) and Directory of Useful Decoys (DUD) data sets were used for experiments and were represented by 2D fingerprints. Conclusions: Simulated virtual screening experiments with the standard two data sets show that the use of Condorcet fusion provides a very simple way of improving the ligand-based virtual screening, especially when the active molecules being sought have a lowest degree of structural heterogeneity. However, the effectiveness of the Condorcet fusion was increased slightly when structural sets of high diversity activities were being sought.
引用
收藏
页数:10
相关论文
共 41 条
[31]   Virtual screening workflow development guided by the "receiver operating characteristic" curve approach. Application to high-throughput docking on metabotropic glutamate receptor subtype 4 [J].
Triballeau, N ;
Acher, F ;
Brabet, I ;
Pin, JP ;
Bertrand, HO .
JOURNAL OF MEDICINAL CHEMISTRY, 2005, 48 (07) :2534-2547
[32]   Evaluating virtual screening methods: Good and bad metrics for the "early recognition" problem [J].
Truchon, Jean-Francois ;
Bayly, Christopher I. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (02) :488-508
[33]   Virtual screening - an overview [J].
Walters, WP ;
Stahl, MT ;
Murcko, MA .
DRUG DISCOVERY TODAY, 1998, 3 (04) :160-178
[34]   Chemical similarity searching [J].
Willett, P ;
Barnard, JM ;
Downs, GM .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (06) :983-996
[35]  
Willett P., 2013, BIOTECHNOL J, V5
[36]  
Willett P, 2009, ANNU REV INFORM SCI, V43, P3
[37]   Enhancing the effectiveness of ligand-based virtual screening using data fusion [J].
Willett, Peter .
QSAR & COMBINATORIAL SCIENCE, 2006, 25 (12) :1143-1152
[40]   A statistical framework to evaluate virtual screening [J].
Zhao, Wei ;
Hevener, Kirk E. ;
White, Stephen W. ;
Lee, Richard E. ;
Boyett, James M. .
BMC BIOINFORMATICS, 2009, 10