Build-Up Algorithm for Atomic Correspondence between Chemical Structures

被引:55
作者
Kawabata, Takeshi [1 ,2 ]
机构
[1] Osaka Univ, Inst Prot Sci, Suita, Osaka 5650871, Japan
[2] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara 6300192, Japan
基金
日本学术振兴会;
关键词
PROTEIN-STRUCTURE; COMMON; SEARCH; GRAPHS; SUBGRAPHS; ALIGNMENT; ACCURACY; CLIQUES;
D O I
10.1021/ci2001023
中图分类号
R914 [药物化学];
学科分类号
100705 [微生物与生化药学];
摘要
Determining a one-to-one atom correspondence between two chemical compounds is important to measure molecular similarities and to find compounds with similar biological activities. This calculation can be formalized as the maximum common substructure (MCS) problem, which is well-studied and has been shown to be NP-complete. Although many rigorous and heuristic algorithms have been developed, none of these algorithms is sufficiently fast and accurate. We developed a new program, called "kcombu" using a build-up algorithm, which is a type of the greedy heuristic algorithms. The program can search connected and disconnected MCSs as well as topologically constrained disconnected MCS (TD-MCS), which is introduced in this study. To evaluate the performance of our program, we prepared two correct standards: the exact correspondences generated by the maximum clique algorithms and the 3D correspondences obtained from superimposed 3D structure of the molecules in a complex 3D structure with the same protein. For the five sets of molecules taken from the protein structure database, the agreement value between the build-up and the exact correspondences for the connected MCS is sufficiently high, but the computation time of the build-up algorithm is much smaller than that of the exact algorithm. The comparison between the build-up and the 3D correspondences shows that the TD-MCS has the best agreement value among the other types of MCS. Additionally, we observed a strong correlation between the molecular similarity and the agreement with the correct and 3D correspondences; more similar molecule pairs are more correctly matched. Molecular pairs with more than 40% Tanimoto similarities can be correctly matched for more than half of the atoms with the 3D correspondences.
引用
收藏
页码:1775 / 1787
页数:13
相关论文
共 35 条
[1]
SUBSTRUCTURE SEARCHING METHODS - OLD AND NEW [J].
BARNARD, JM .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1993, 33 (04) :532-538
[2]
PZIM: A Method for Similarity Searching Using Atom Environments and 2D Alignment [J].
Berglund, Anders E. ;
Head, Richard D. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2010, 50 (10) :1790-1795
[3]
The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data [J].
Berman, Helen ;
Henrick, Kim ;
Nakamura, Haruki ;
Markley, John L. .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D301-D303
[4]
FINDING ALL CLIQUES OF AN UNDIRECTED GRAPH [H] [J].
BRON, C ;
KERBOSCH, J .
COMMUNICATIONS OF THE ACM, 1973, 16 (09) :575-577
[5]
BROSTROM J, 2006, J MED CHEM, V49, P6716
[6]
MATCHING 2-DIMENSIONAL CHEMICAL GRAPHS USING GENETIC ALGORITHMS [J].
BROWN, RD ;
JONES, G ;
WILLETT, P ;
GLEN, RC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1994, 34 (01) :63-70
[7]
A maximum common substructure-based algorithm for searching and predicting drug-like compounds [J].
Cao, Yiqun ;
Jiang, Tao ;
Girke, Thomas .
BIOINFORMATICS, 2008, 24 (13) :I366-I374
[8]
An algorithm for reporting maximal c-cliques [J].
Cazals, F ;
Karande, C .
THEORETICAL COMPUTER SCIENCE, 2005, 349 (03) :484-490
[9]
MCSS - A NEW ALGORITHM FOR PERCEPTION OF MAXIMAL COMMON SUBSTRUCTURES AND ITS APPLICATION TO NMR SPECTRAL STUDIES .1. THE ALGORITHM [J].
CHEN, LG ;
ROBIEN, W .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1992, 32 (05) :501-506
[10]
Geometric accuracy of three-dimensional molecular overlays [J].
Chen, Qi ;
Higgs, Richard E. ;
Vieth, Michal .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (05) :1996-2002