Correlation analysis between watersolubility, octanol-water partition coefficient and melting point based on clustering

被引:12
作者
Nouwen, J [1 ]
Hansen, B [1 ]
机构
[1] Commiss European Communities, Joint Res Ctr, Ispra Estab, European Chem Bur TP 280, I-21020 Ispra, Italy
来源
QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS | 1996年 / 15卷 / 01期
关键词
QSAR; clustering; bootstrap; EINECS; Jarvis-Patrick; Tanimoto coefficient; logWS; logK(ow); mp;
D O I
10.1002/qsar.19960150105
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
By means of clustering according to structure similarity one can manage easily large databases containing several Lens of thousands of chemicals. A new methodology in which statistical criteria are used to decide the optimal threshold in the clustering process is proposed. Clustering defined the several chemical classes that were present in our training set. All the clusters except one showed a correlation of logWS with logK(ow) and melting point. The latter covers all chemicals with melting points below room temperature, resulting in a logWS-logK(ow) relationship. This cluster showed also a weak correlation. probably due to the insufficient number of available screens. Such a limited number of screens permits that rather different chemicals share the same cluster. Using statistical criteria, our approach resulted in 3 QSARs with reasonably good predictive capabilities, The models resulting from the smaller clusters are characterised by high correlation coefficients, describing the cluster itself very well. Due to our stringent bootstrap criterion, they perform close to random models as indicated by the statistical characteristics. Some clusters showed rather low correlations. The well behaved models proved their usefulness through external validation. The logWS-values calculated using our QSARs agreed within 1 log-unit with these reported in the literature.
引用
收藏
页码:17 / 30
页数:14
相关论文
共 29 条
[1]  
*ACS, 1981, CAS ONL SCREEN DICT
[2]  
*AM I CHEM ENG, 1989, DIPPR DAT COMP PUR C
[3]  
*BER UMW ALTST, 1986, UMW ALT STOFF AUSW S
[4]  
*BER UMW ALTST, 1988, UMW ALT STOFF AUSW S, V2
[6]  
BRUGGEMAN R, 1991, ABSCHATZUNG PHYSIKOC
[7]  
COCCINI T, 1992, QUANTITATIVE STRUCTU, P5
[8]  
*COMM EUR COMM, DGXIA2 COMM EUR COMM
[9]  
DANNENFELSER RM, 1989, ARIZONA DATABASE
[10]  
DAUBERT TE, 1985, PUBLICATION AM I C X, V112