The signature molecular descriptor. 2. Enumerating molecules from their extended valence sequences

被引:116
作者
Faulon, JL
Churchwell, CJ
Visco, DP
机构
[1] Sandia Natl Labs, Livermore, CA 94551 USA
[2] Tennessee Technol Univ, Dept Chem Engn, Cookeville, TN 38505 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2003年 / 43卷 / 03期
关键词
ACTIVITY RELATIONSHIP MODELS; INFORMATION-TRANSFER; TOPOLOGICAL INDEXES; GENETIC ALGORITHM; GRAPHS; DESIGN; DEGENERACY; FAMILIES; CONSTRUCTION; DATABASE;
D O I
10.1021/ci020346o
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
We present a new algorithm that enumerates molecular structures matching a predefined extended valence sequence or signature. The algorithm can construct molecular structures composed of about 50 non-hydrogen atoms in CPU seconds time scale. The algorithm is run to produce all molecular structures matching the binding affinities (IC50) of some HIV-1 protease inhibitors. The algorithm is also used to compute the degeneracy, or the number of molecular structures, corresponding to a given signature. Signature degeneracy is systematically studied for varying signature heights on four molecular series, alkanes, alcohols, fullerene-type structures, and peptides. Signature degeneracy is compared with similar results obtained with popular topological indices (TIs). As a general rule, we find that signature degeneracy decreases as the signature height increases. We also find that alkanes, alcohols, and fullerene-type structures comprising n non-hydrogen atoms are uniquely characterized by signatures of height n/4, while peptides up to 4000 amino acids can be singled out with signatures of heights as small as 2 and 3.
引用
收藏
页码:721 / 734
页数:14
相关论文
共 37 条
[1]   Mass spectrometry in proteomics [J].
Aebersold, R ;
Goodlett, DR .
CHEMICAL REVIEWS, 2001, 101 (02) :269-295
[2]  
[Anonymous], MOLCONN Z
[3]   CHEMICAL GRAPHS - LOOKING BACK AND GLIMPSING AHEAD [J].
BALABAN, AT .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1995, 35 (03) :339-350
[4]  
BALABAN AT, 1994, REV ROUM CHIM, V39, P245
[5]   LOCAL VERSUS GLOBAL (IE ATOMIC VERSUS MOLECULAR) NUMERICAL MODELING OF MOLECULAR GRAPHS [J].
BALABAN, AT .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1994, 34 (02) :398-402
[6]   ORDERLY ALGORITHMS FOR GENERATING RESTRICTED CLASSES OF GRAPHS [J].
COLBOURN, CJ ;
READ, RC .
JOURNAL OF GRAPH THEORY, 1979, 3 (02) :187-195
[7]   CTI - A NOVEL CHARGE-RELATED TOPOLOGICAL INDEX WITH LOW DEGENERACY [J].
DEMIREV, PA ;
DYULGEROV, AS ;
BANGOV, IP .
JOURNAL OF MATHEMATICAL CHEMISTRY, 1991, 8 (04) :367-382
[8]   GRAPH-THEORETICAL INVARIANT OF RANDIC REVISITED [J].
ESTRADA, E .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1995, 35 (06) :1022-1025
[9]   The signature molecular descriptor. 1. Using extended valence sequences in QSAR and QSPR studies [J].
Faulon, JL ;
Visco, DP ;
Pophale, RS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (03) :707-720
[10]   ON USING GRAPH-EQUIVALENT CLASSES FOR THE STRUCTURE ELUCIDATION OF LARGE MOLECULES [J].
FAULON, JL .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1992, 32 (04) :338-348