Mining molecular fragments: Finding relevant substructures of molecules

被引:186
作者
Borgelt, C [1 ]
Berthold, MR [1 ]
机构
[1] Univ Magdeburg, Sch Comp Sci, D-39106 Magdeburg, Germany
来源
2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS | 2002年
关键词
D O I
10.1109/ICDM.2002.1183885
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
We present an algorithm to find fragments in a set of molecules that help to discriminate between different classes of for instance, activity in a drug discovery context. Instead of carrying out a brute-force search, our method generates fragments by embedding them in all appropriate molecules in parallel and prunes the search tree based on a local order of the atoms and bonds, which results in substantially faster search by eliminating the need for frequent, computationally expensive reembeddings and by suppressing redundant search. We prove the usefulness of our algorithm by demonstrating the discovery of activity-related groups of chemical compounds in the well-known National Cancer Institute's HIV-screening dataset.
引用
收藏
页码:51 / 58
页数:8
相关论文
共 10 条
[1]
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]
BORGELT C, 2002, P 14 C COMP STAT COM
[3]
Clark Robert D., 2001, P337
[4]
DESPHANDE M, 2002, P WORKSH DAT MIN BIO, P11
[5]
Hipp J, 1998, LECT NOTES ARTIF INT, V1510, P74
[6]
KRAMER S, 2001, P 7 ACM SIGKDD INT C, P136
[7]
Heuristics for similarity searching of chemical graphs using a maximum common edge subgraph algorithm [J].
Raymond, JW ;
Gardiner, EJ ;
Willett, P .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (02) :305-316
[8]
TOPOLOGICAL PHARMACOPHORES NEW METHODS AND THEIR APPLICATION TO A SET OF ANTIMALARIALS .1. THE METHODS LOGANA AND LOCON [J].
STREICH, WJ ;
FRANKE, R .
QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 1985, 4 (01) :13-18
[9]
NEW SOLUBLE-FORMAZAN ASSAY FOR HIV-1 CYTOPATHIC EFFECTS - APPLICATION TO HIGH-FLUX SCREENING OF SYNTHETIC AND NATURAL-PRODUCTS FOR AIDS-ANTIVIRAL ACTIVITY [J].
WEISLOW, OS ;
KISER, R ;
FINE, DL ;
BADER, J ;
SHOEMAKER, RH ;
BOYD, MR .
JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 1989, 81 (08) :577-586
[10]
Zaki M. J., 1997, Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, P283