Chemical similarity searches using latent semantic structural indexing (LaSSI) and comparison to TOPOSIM

被引:20
作者
Hull, RD [1 ]
Fluder, EM [1 ]
Singh, SB [1 ]
Nachbar, RB [1 ]
Kearsley, SK [1 ]
Sheridan, RP [1 ]
机构
[1] Merck Res Labs, Dept Mol Syst, Rahway, NJ 07065 USA
关键词
D O I
10.1021/jm000392k
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Similarity searches based on chemical descriptors have proven extremely useful in aiding large-scale drug screening. Here we present results of similarity searching using Latent Semantic Structure Indexing (LaSSI). LaSSI uses a singular value decomposition on chemical descriptors to project molecules into a h-dimensional descriptor space, where h is the number of retained singular values. The effect of the projection is that certain descriptors are emphasized over others and some descriptors may count as partially equivalent to others. We compare LaSSI searches to searches done with TOPOSIM, our standard in-house method, which uses the Dice similarity definition. Standard descriptor-based methods such as TOPOSIM count all descriptors equally and treat all descriptors as independent. For this work we use atom pairs and topological torsions as examples of chemical descriptors. Using objective criteria to determine how effective one similarity method is versus another in selecting active compounds from a large database, we find for a series of 16 drug-like probes that LaSSI is as good as or better than TOPOSIM in selecting active compounds from the MDDR database, if the user is allowed to treat h as an adjustable parameter. Typically, LaSSI selects very different sets of actives than does TOPOSIM, so it can find classes of actives that TOPOSIM would miss.
引用
收藏
页码:1185 / 1191
页数:7
相关论文
共 9 条
[1]   ATOM PAIRS AS MOLECULAR-FEATURES IN STRUCTURE ACTIVITY STUDIES - DEFINITION AND APPLICATIONS [J].
CARHART, RE ;
SMITH, DH ;
VENKATARAGHAVAN, R .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1985, 25 (02) :64-73
[2]   Four association coefficients for relating molecular similarity measures [J].
Cheng, C ;
Maggiora, G ;
Lajiness, M ;
Johnson, M .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (04) :909-915
[3]   Latent semantic structure indexing (LaSSI) for defining chemical similarity [J].
Hull, RD ;
Singh, SB ;
Nachbar, RB ;
Sheridan, RP ;
Kearsley, SK ;
Fluder, EM .
JOURNAL OF MEDICINAL CHEMISTRY, 2001, 44 (08) :1177-1184
[4]  
Johnson M., 1990, CONCEPTS APPL MOL SI
[5]   Chemical similarity using physiochemical property descriptors [J].
Kearsley, SK ;
Sallamack, S ;
Fluder, EM ;
Andose, JD ;
Mosley, RT ;
Sheridan, RP .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (01) :118-127
[6]  
*MOL DES LTD, MDL DRUG DAT REP
[7]   TOPOLOGICAL TORSION - A NEW MOLECULAR DESCRIPTOR FOR SAR APPLICATIONS - COMPARISON WITH OTHER DESCRIPTORS [J].
NILAKANTAN, R ;
BAUMAN, N ;
DIXON, JS ;
VENKATARAGHAVAN, R .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1987, 27 (02) :82-85
[8]  
SINGH SB, IN PRESS J MED CHEM
[9]   Chemical similarity searching [J].
Willett, P ;
Barnard, JM ;
Downs, GM .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (06) :983-996