Predicting the genotoxicity of thiophene derivatives from molecular structure

被引:41
作者
Mosier, PD [2 ]
Jurs, PC
Custer, LL
Durham, SK
Pearl, GM
机构
[1] Bristol Myers Squibb Co, Princeton, NJ 08543 USA
[2] Penn State Univ, Dept Chem, University Pk, PA 16802 USA
关键词
D O I
10.1021/tx020104i
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
We report several binary classification models that directly link the genetic toxicity of a series of 140 thiophene derivatives with information derived from the compounds' molecular structure. Genetic toxicity was measured using an SOS Chromotest. IMAX (maximal SOS induction factor) values were recorded for each of the 140 compounds both in the presence and in the absence of S9 rat liver homogenate. Compounds were classified as genotoxic if IMAX greater than or equal to1.5 in either test or nongenotoxic if IMAX < 1.5 for both tests. The molecular structures were represented by numerical descriptors that encoded the topological, geometric, electronic, and polar surface area properties of the thiophene derivatives. The classification models used were linear discriminant analysis (LDA), k-nearest neighbor classification (k-NN), and the probabilistic neural network (PNN). These were used in conjunction with either a genetic algorithm or a generalized simulated annealing to find optimal subsets of descriptors for each classifier. The quality of the resulting models was determined by the number of misclassified compounds, with preference given to models that produced fewer false negative classifications. Model sizes ranged from seven descriptors for LDA to three descriptors for k-NN and PNN. Very good classification results were obtained with all three classifiers. Classification rates for the LDA, k-NN, and PNN models were 80, 85, and 85%, respectively, for the prediction set compounds. Additionally, a consensus model was generated that incorporated all three of the basic model types. This consensus model correctly predicted the genotoxicity of 95% of the prediction set compounds.
引用
收藏
页码:721 / 732
页数:12
相关论文
共 69 条
[61]   A SIMPLE METHOD FOR THE REPRESENTATION, QUANTIFICATION, AND COMPARISON OF THE VOLUMES AND SHAPES OF CHEMICAL-COMPOUNDS [J].
STOUCH, TR ;
JURS, PC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1986, 26 (01) :4-12
[62]  
Stuper A.J., 1979, Computer-Assisted Studies of Chemical Structure and Biological Function
[63]   The SOS response:: Recent insights into umuDC-dependent mutagenesis and DNA damage tolerance [J].
Sutton, MD ;
Smith, BT ;
Godoy, VG ;
Walker, GC .
ANNUAL REVIEW OF GENETICS, 2000, 34 :479-497
[64]  
Todeschini R., 2000, METHODS PRINCIPLES M, DOI 10.1002/9783527613106
[65]   CHANCE FACTORS IN STUDIES OF QUANTITATIVE STRUCTURE-ACTIVITY-RELATIONSHIPS [J].
TOPLISS, JG ;
EDWARDS, RP .
JOURNAL OF MEDICINAL CHEMISTRY, 1979, 22 (10) :1238-1244
[66]  
Wold S., 1995, QSAR: Chemometric methods in molecular design: Methods and principles in medicinal chemistry, P195
[67]   DEVELOPMENT OF STRUCTURE-ACTIVITY RELATIONSHIP RULES FOR PREDICTING CARCINOGENIC POTENTIAL OF CHEMICALS [J].
WOO, YT ;
LAI, DY ;
ARGUS, MF ;
ARCOS, JC .
TOXICOLOGY LETTERS, 1995, 79 (1-3) :219-228
[68]   COMPUTER-ASSISTED STRUCTURE-ACTIVITY STUDIES OF CHEMICAL CARCINOGENS - POLYCYCLIC AROMATIC HYDROCARBON DATA SET [J].
YUAN, M ;
JURS, PC .
TOXICOLOGY AND APPLIED PHARMACOLOGY, 1980, 52 (02) :294-312
[69]   COMPUTER-ASSISTED STRUCTURE-ACTIVITY STUDIES OF CHEMICAL CARCINOGENS - AROMATIC-AMINES [J].
YUTA, K ;
JURS, PC .
JOURNAL OF MEDICINAL CHEMISTRY, 1981, 24 (03) :241-251