The importance of scaling in data mining for toxicity prediction

被引:35
作者
Mazzatorta, P
Benfenati, E
Neagu, D
Gini, G
机构
[1] Ist Ric Farmacol Mario Negri, I-20157 Milan, Italy
[2] Politecn Milan, Dipartimento Elettron & Informazione, I-20133 Milan, Italy
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2002年 / 42卷 / 05期
关键词
D O I
10.1021/ci025520n
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
While mining a data set of 554 chemicals in order to extract information on their toxicity value, we faced the problem of scaling all the data. There are numerous different approaches to this procedure, and in most cases the choice greatly influences the results. The aim of this paper is 2-fold. First,we propose a universal scaling procedure for acute toxicity in fish according to the Directive 92/32/EEC. Second, we look at how expert preprocessing of the data effects the performance of qualitative structure-activity relationship (QSAR) approach to toxicity prediction.
引用
收藏
页码:1250 / 1255
页数:6
相关论文
共 28 条
[1]  
[Anonymous], INTELL DATA ANAL
[2]   A Bayesian neural network method for adverse drug reaction signal generation [J].
Bate, A ;
Lindquist, M ;
Edwards, IR ;
Olsson, S ;
Orre, R ;
Lansner, A ;
De Freitas, RM .
EUROPEAN JOURNAL OF CLINICAL PHARMACOLOGY, 1998, 54 (04) :315-321
[3]  
BECRAFT R, 1991, P INT JOINT C ART IN, P832
[4]   Factors influencing predictive models for toxicology [J].
Benfenati, E ;
Piclin, N ;
Roncaglioni, A ;
Varì, MR .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2001, 12 (06) :593-603
[5]  
BENFENATI E, 1999, GIR SEM MOL SIM 5 7
[6]  
Bishop C. M., 1995, NEURAL NETWORKS PATT
[7]   Structure-odor relationships: Using neural networks in the estimation of camphoraceous or fruity odors and olfactory thresholds of aliphatic alcohols [J].
Chastrette, M ;
Cretin, D ;
ElAidi, C .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (01) :108-113
[8]   An interior trust region approach for nonlinear minimization subject to bounds [J].
Coleman, TF ;
Li, YY .
SIAM JOURNAL ON OPTIMIZATION, 1996, 6 (02) :418-445
[9]  
Coleman TF., 1994, Mathematical Programming, V67, P189, DOI [DOI 10.1007/BF01582221, 10.1007/BF01582221]
[10]   THE INFLUENCE OF DATA PREPROCESSING ON THE ROBUSTNESS AND PARSIMONY OF MULTIVARIATE CALIBRATION MODELS [J].
DENOORD, OE .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1994, 23 (01) :65-70