The signature molecular descriptor. 4. Canonizing molecules using extended valence sequences

被引:66
作者
Faulon, JL
Collins, MJ
Carr, RD
机构
[1] Sandia Natl Labs, Dept Computat Biol, Livermore, CA 94551 USA
[2] Sandia Natl Labs, Dept Cryptog, Livermore, CA 94551 USA
[3] Sandia Natl Labs, Dept Discrete Algorithms & Math, Livermore, CA 94551 USA
[4] Univ New Mexico, Dept Comp Sci, Albuquerque, NM 87131 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2004年 / 44卷 / 02期
关键词
D O I
10.1021/ci0341823
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
We present a new algorithm to canonize molecular graphs using the signature molecular descriptor introduced in the previous papers of this series. While developed specifically for molecular structures, the algorithm can be used for any Graph and is not limited to acyclic graphs, planar graphs, bounded valence, or bounded Genus Graphs, for which polynomial time algorithms exist. The algorithm is tested with benzenoid hydrocarbons and a database of 126 705 organic compounds. The algorithm's performances are compared against Brendan Mc Kay's Nauty algorithm, which is believed to be the fastest graph canonization algorithm for General Graphs, with five series of graphs each comprising up to 30 000 vertices: 2D meshes (pericondensed benzenoids), 3D cages (fullerenes and nanotubes), 3D meshes (crystal lattices), 4D cages, and power law Graphs (protein and gene networks). The algorithm can be downloaded as an open source code at http://www.cs.sandia.gov/similar tojfaulon/QSAR.
引用
收藏
页码:427 / 436
页数:10
相关论文
共 17 条
[1]  
*ACC, 2003, CER SOFTW
[2]   HIGHLY DISCRIMINATING DISTANCE-BASED TOPOLOGICAL INDEX [J].
BALABAN, AT .
CHEMICAL PHYSICS LETTERS, 1982, 89 (05) :399-404
[3]  
CHURCHWELL CJ, IN PRESS J MOL GRAPH
[4]  
Dinitz, 1996, CRC HDB COMBINATORIA
[5]   The signature molecular descriptor. 1. Using extended valence sequences in QSAR and QSPR studies [J].
Faulon, JL ;
Visco, DP ;
Pophale, RS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (03) :707-720
[6]   The signature molecular descriptor. 2. Enumerating molecules from their extended valence sequences [J].
Faulon, JL ;
Churchwell, CJ ;
Visco, DP .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (03) :721-734
[7]   Isomorphism, automorphism partitioning, and canonical labeling can be solved in polynomial-time for molecular graphs [J].
Faulon, JL .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (03) :432-444
[8]   Unique labels for compounds [J].
Freemantle, M .
CHEMICAL & ENGINEERING NEWS, 2002, 80 (48) :33-35
[9]  
Hopcroft John E., 1972, IBM RES S SERIES, P131, DOI DOI 10.1007/978-1-4684-2001-2_13
[10]   The large-scale organization of metabolic networks [J].
Jeong, H ;
Tombor, B ;
Albert, R ;
Oltvai, ZN ;
Barabási, AL .
NATURE, 2000, 407 (6804) :651-654