Common subgraph isomorphism detection by backtracking search

被引:68
作者
Krissinel, EB [1 ]
Henrick, K [1 ]
机构
[1] Europena Bioinformat Inst, Cambridge CB10 1SD, England
关键词
common subgraphs; backtracking algorithm;
D O I
10.1002/spe.588
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Graph theory offers a convenient and highly attractive approach to various tasks of pattern recognition. Provided there is a graph representation of the object in question (e.g. a chemical structure or protein fold), the recognition procedure is reduced to the problem of common subgraph isomorphism (CSI). Complexity of this problem shows combinatorial dependence on the size of input graphs, which in many practical cases makes the approach computationally intractable. Among the optimal algorithms for CSI, the leading place in practice belongs to algorithms based on maximal clique detection in the association graph. Backtracking algorithms for CSI, first developed two decades ago, are rarely used. We propose an improved backtracking algorithm for CSI, which differs from its predecessors by better search strategy and is therefore more efficient. We found that the new algorithm outperforms the traditional maximal clique approach by orders of magnitude in computational time. Copyright (C) 2004 John Wiley Sons, Ltd.
引用
收藏
页码:591 / 607
页数:17
相关论文
共 34 条
[1]  
Arita M., 2000, Journal of Japanese Society for Artificial Intelligence, V15, P703
[2]   THE CCP4 SUITE - PROGRAMS FOR PROTEIN CRYSTALLOGRAPHY [J].
BAILEY, S .
ACTA CRYSTALLOGRAPHICA SECTION D-BIOLOGICAL CRYSTALLOGRAPHY, 1994, 50 :760-763
[3]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[4]   E-MSD: the European Bioinformatics Institute Macromolecular Structure Database [J].
Boutselakis, H ;
Dimitropoulos, D ;
Fillon, J ;
Golovin, A ;
Henrick, K ;
Hussain, A ;
Ionides, J ;
John, M ;
Keller, PA ;
Krissinel, E ;
McNeil, P ;
Naim, A ;
Newman, R ;
Oldfield, T ;
Pineda, J ;
Rachedi, A ;
Copeland, J ;
Sitnov, A ;
Sobhany, S ;
Suarez-Uruena, A ;
Swaminathan, J ;
Tagari, M ;
Tate, J ;
Tromm, S ;
Velankar, S ;
Vranken, W .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :458-462
[5]   FINDING ALL CLIQUES OF AN UNDIRECTED GRAPH [H] [J].
BRON, C ;
KERBOSCH, J .
COMMUNICATIONS OF THE ACM, 1973, 16 (09) :575-577
[6]   Representation and searching of carbohydrate structures using graph-theoretic techniques [J].
Bruno, IJ ;
Kemp, NM ;
Artymiuk, PJ ;
Willett, P .
CARBOHYDRATE RESEARCH, 1997, 304 (01) :61-67
[7]  
BUNKE H, 2002, LECT NOTES COMPUTER, V2396, P123, DOI DOI 10.1007/3-540-70659-3_12
[8]  
BUNKE H, 2001, LNCS, V2059, P11
[9]   APPLICATION OF THE MAXIMAL COMMON SUBSTRUCTURE ALGORITHM TO AUTOMATIC INTERPRETATION OF C-13-NMR SPECTRA [J].
CHEN, LG ;
ROBIEN, W .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1994, 34 (04) :934-941
[10]   MOLECULAR-STRUCTURE COMPARISON PROGRAM FOR IDENTIFICATION OF MAXIMAL COMMON SUBSTRUCTURES [J].
CONE, MM ;
VENKATARAGHAVAN, R ;
MCLAFFERTY, FW .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1977, 99 (23) :7668-7671