Enhancing the effectiveness of ligand-based virtual screening using data fusion

被引:56
作者
Willett, Peter
机构
[1] Univ Sheffield, Krebs Inst Biomol Sci, Sheffield S1 4DP, S Yorkshire, England
[2] Univ Sheffield, Dept Informat Studies, Sheffield S1 4DP, S Yorkshire, England
来源
QSAR & COMBINATORIAL SCIENCE | 2006年 / 25卷 / 12期
关键词
consensus scoring; data fusion; fusion rule; ligand docking; machine learning; scoring function; similarity searching; similarity-based virtual screening; structure-based virtual screening;
D O I
10.1002/qsar.200610084
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Data fusion is being increasingly used to combine the outputs of different types of sensors. This paper reviews the application of the approach to ligand-based virtual screening, where the sensors to be combined are functions that score molecules in a database on their likelihood of exhibiting some required biological activity. Much of the literature to date involves the combination of multiple similarity searches, although there is also an increasing interest in the combination of multiple machine-learning techniques. Both approaches are reviewed here, focusing on the extent to which fusion can improve the effectiveness of searching when compared with a single screening mechanism, and on the reasons that have been suggested for the observed performance enhancement.
引用
收藏
页码:1143 / 1152
页数:10
相关论文
共 90 条
[1]   ON BETTER GENERALIZATION BY COMBINING 2 OR MORE MODELS - A QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIP EXAMPLE USING NEURAL NETWORKS [J].
AJAY .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1994, 24 (01) :19-30
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   The use of consensus scoring in ligand-based virtual screening [J].
Baber, JC ;
William, AS ;
Gao, YH ;
Feher, M .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (01) :277-288
[4]   Integration of virtual and high-throughput screening [J].
Bajorath, F .
NATURE REVIEWS DRUG DISCOVERY, 2002, 1 (11) :882-894
[5]   Fusion of effective retrieval strategies in the same information retrieval system [J].
Beitzel, SM ;
Jensen, EC ;
Chowdhury, A ;
Grossman, D ;
Frieder, O ;
Goharian, N .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2004, 55 (10) :859-868
[6]   COMBINING THE EVIDENCE OF MULTIPLE QUERY REPRESENTATIONS FOR INFORMATION-RETRIEVAL [J].
BELKIN, NJ ;
KANTOR, P ;
FOX, EA ;
SHAW, JA .
INFORMATION PROCESSING & MANAGEMENT, 1995, 31 (03) :431-448
[7]   Molecular similarity: a key technique in molecular informatics [J].
Bender, A ;
Glen, RC .
ORGANIC & BIOMOLECULAR CHEMISTRY, 2004, 2 (22) :3204-3218
[8]   GFscore:: A general nonlinear consensus scoring function for high-throughput docking [J].
Betzi, Stephane ;
Suhre, Karsten ;
Chetrit, Bernard ;
Guerlesquin, Francoise ;
Morelli, Xavier .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (04) :1704-1712
[9]   The information content of 2D and 3D structural descriptors relevant to ligand-receptor binding [J].
Brown, RD ;
Martin, YC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (01) :1-9
[10]   Drug design by machine learning: support vector machines for pharmaceutical data analysis [J].
Burbidge, R ;
Trotter, M ;
Buxton, B ;
Holden, S .
COMPUTERS & CHEMISTRY, 2001, 26 (01) :5-14