AUTOMATED CONFORMATIONAL-ANALYSIS FROM CRYSTALLOGRAPHIC DATA .2. SYMMETRY-MODIFIED JARVIS-PATRICK AND COMPLETE-LINKAGE CLUSTERING ALGORITHMS FOR 3-DIMENSIONAL PATTERN-RECOGNITION

被引:41
作者
ALLEN, FH [1 ]
DOYLE, MJ [1 ]
TAYLOR, R [1 ]
机构
[1] ICI AGROCHEM,JEALOTTS HILL RES STN,BRACKNELL RG12 6EY,BERKS,ENGLAND
来源
ACTA CRYSTALLOGRAPHICA SECTION B-STRUCTURAL SCIENCE | 1991年 / 47卷
关键词
D O I
10.1107/S0108768190010369
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The complete-linkage and Jarvis-Patrick clustering algorithms are used to identify discrete conformational subgroups for a chemical fragment from crystal structure data. Fragment conformations are defined by N(t) torsion angles for N(f) occurrences of the fragment in the Cambridge Structural Database. Both algorithms are extensively modified to handle 2D topological symmetry of the fragment and the 3D conformational enantiomers which occur in crystal structures. The modified procedures ensure that symmetry equivalents of a given conformation are optimally superimposed in a unique cluster. A modified single-linkage algorithm, based on cluster centroids, is used to place the discrete clusters in a single asymmetric unit of conformational space. Principal-component analysis provides a graphical representation of the clustering process. The complete-linkage and Jarvis-Patrick algorithms may be preferable to single-linkage cluster analysis [Allen, Doyle & Taylor (1991). Acta Cryst. B47, 29-40] since they minimize the effects of 'chaining' i.e. the linkage of major clusters through a chain of outlying observations. Both of the new algorithms have been tested using a trial data set of 222 six-membered carbocycles of known conformational complexity. The new algorithms are judged to provide a more effective conformational breakdown of the trial data set (in chemical terms) than that obtained with the single-linkage method alone.
引用
收藏
页码:41 / 49
页数:9
相关论文
共 8 条
[1]   SYSTEMATIC ANALYSIS OF STRUCTURAL DATA AS A RESEARCH TECHNIQUE IN ORGANIC-CHEMISTRY [J].
ALLEN, FH ;
KENNARD, O ;
TAYLOR, R .
ACCOUNTS OF CHEMICAL RESEARCH, 1983, 16 (05) :146-153
[2]   AUTOMATED CONFORMATIONAL-ANALYSIS FROM CRYSTALLOGRAPHIC DATA .1. A SYMMETRY-MODIFIED SINGLE-LINKAGE CLUSTERING-ALGORITHM FOR 3-DIMENSIONAL PATTERN-RECOGNITION [J].
ALLEN, FH ;
DOYLE, MJ ;
TAYLOR, R .
ACTA CRYSTALLOGRAPHICA SECTION B-STRUCTURAL SCIENCE, 1991, 47 :29-&
[3]   AUTOMATED CONFORMATIONAL-ANALYSIS FROM CRYSTALLOGRAPHIC DATA .3. 3-DIMENSIONAL PATTERN-RECOGNITION WITHIN THE CAMBRIDGE STRUCTURAL DATABASE SYSTEM - IMPLEMENTATION AND PRACTICAL EXAMPLES [J].
ALLEN, FH ;
DOYLE, MJ ;
TAYLOR, R .
ACTA CRYSTALLOGRAPHICA SECTION B-STRUCTURAL SCIENCE, 1991, 47 :50-61
[4]  
ALLEN FH, 1991, UNPUB ACTA CRYST B, V47
[5]   CONFORMATION OF 6-MEMBERED RINGS [J].
BOEYENS, JCA .
JOURNAL OF CRYSTAL AND MOLECULAR STRUCTURE, 1978, 8 (06) :317-320
[6]  
Everitt B., 1980, CLUSTER ANAL
[7]   CLUSTERING USING A SIMILARITY MEASURE BASED ON SHARED NEAR NEIGHBORS [J].
JARVIS, RA ;
PATRICK, EA .
IEEE TRANSACTIONS ON COMPUTERS, 1973, C-22 (11) :1025-1034
[8]   IMPLEMENTATION OF NON-HIERARCHICAL CLUSTER-ANALYSIS METHODS IN CHEMICAL INFORMATION-SYSTEMS - SELECTION OF COMPOUNDS FOR BIOLOGICAL TESTING AND CLUSTERING OF SUBSTRUCTURE SEARCH OUTPUT [J].
WILLETT, P ;
WINTERMAN, V ;
BAWDEN, D .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1986, 26 (03) :109-118