Lead hopping using SVM and 3D pharmacophore fingerprints

被引:48
作者
Saeh, JC
Lyne, PD
Takasaki, BK
Cosgrove, DA
机构
[1] AstraZeneca R&D Boston, Canc Discovery, Waltham, MA 02451 USA
[2] AstraZeneca R&D Bostin, Global Sci & Informat, Waltham, MA 02451 USA
[3] AstraZeneca, Canc Discovery, Macclesfield SK10 4TG, Cheshire, England
关键词
D O I
10.1021/ci049732r
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The combination of 3D pharmacophore fingerprints and the support vector machine classification algorithm has been used to generate robust models that are able to classify compounds as active or inactive in a number of G-protein-coupled receptor assays. The models have been tested against progressively more challenging validation sets where steps are taken to ensure that compounds in the validation set are chemically and structurally distinct from the training set. In the most challenging example, we simulate a lead-hopping experiment by excluding an entire class of compounds (defined by a core substructure) from the training set. The left-out active compounds comprised approximately 40% of the actives. The model trained on the remaining compounds is able to recall 75% of the actives from the "new" lead series while correctly classifying > 99% of the 5000 inactives included in the validation set.
引用
收藏
页码:1122 / 1133
页数:12
相关论文
共 48 条
[1]   High-throughput docking for lead generation [J].
Abagyan, R ;
Totrov, M .
CURRENT OPINION IN CHEMICAL BIOLOGY, 2001, 5 (04) :375-382
[2]   Efficient generation, storage, and manipulation of fully flexible pharmacophore multiplets and their use in 3-D similarity searching [J].
Abrahamian, E ;
Fox, PC ;
Nærum, L ;
Christensen, IT ;
Thogersen, H ;
Clark, RD .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (02) :458-468
[3]   Toward general methods of targeted library design: Topomer shape similarity searching with diverse structures as queries [J].
Andrews, KM ;
Cramer, RD .
JOURNAL OF MEDICINAL CHEMISTRY, 2000, 43 (09) :1723-1740
[4]  
[Anonymous], 1998, Encyclopedia of Biostatistics
[5]  
[Anonymous], 1994, SIGIR
[6]   Integration of virtual and high-throughput screening [J].
Bajorath, F .
NATURE REVIEWS DRUG DISCOVERY, 2002, 1 (11) :882-894
[7]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[8]   Assessing the performance of OMEGA with respect to retrieving bioactive conformations [J].
Boström, J ;
Greenwood, JR ;
Gottfries, J .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2003, 21 (05) :449-462
[9]   Use of structure Activity data to compare structure-based clustering methods and descriptors for use in compound selection [J].
Brown, RD ;
Martin, YC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (03) :572-584
[10]  
BURGES CJC, 1996, IMPLIFIED SUPPORT VE