SAR modeling of unbalanced data sets

被引:11
作者
Rosenkranz, HS [1 ]
Cunningham, AR [1 ]
机构
[1] Univ Pittsburgh, Grad Sch Publ Hlth, Dept Environm & Occupat Hlth, Pittsburgh, PA 15261 USA
关键词
unbalanced data; SAR; case/multicase; optimum models;
D O I
10.1080/10629360108032916
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The increased acceptance of SAR approaches to hazard identification has led us to investigate methods to improve the predictive performance of SAR models. In the present study we demonstrate that although on theoretical grounds the ratio of active to inactive chemicals in the learning set should be unity, SAR models can "tolerate" an unbalanced range in ratios from 3 : 1 (i,e., 75% actives) to 1 : 2 (i.e., 33% actives) and still perform adequately. On the other hand SAR models derived from learning sets with ratios in excess of 4 : 1 (80% actives), even when corrected for the initial ratio do not perform satisfactorily.
引用
收藏
页码:267 / 274
页数:8
相关论文
共 21 条
[11]  
Macina OT, 1999, CARCINOGENICITY, P227
[12]   The practice of structure activity relationships (SAR) in toxicology [J].
McKinney, JD ;
Richard, A ;
Waller, C ;
Newman, MC ;
Gerberick, F .
TOXICOLOGICAL SCIENCES, 2000, 56 (01) :8-17
[13]   SALMONELLA MUTAGENICITY TESTS .2. RESULTS FROM THE TESTING OF 270 CHEMICALS [J].
MORTELMANS, K ;
HAWORTH, S ;
LAWLOR, T ;
SPECK, W ;
TAINER, B ;
ZEIGER, E .
ENVIRONMENTAL MUTAGENESIS, 1986, 8 :1-119
[14]  
NRC: National Research Council, 1994, SCI JUDGM RISK ASS C
[15]   Development, characterization and application of predictive-toxicology models [J].
Rosenkranz, HS ;
Cunningham, AR ;
Zhang, YP ;
Claycamp, HG ;
Macina, OT ;
Sussman, NB ;
Grant, SG ;
Klopman, G .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 1999, 10 (2-3) :277-298
[16]  
Takihi N, 1993, Qual Assur, V2, P255
[17]   AN APPROACH FOR EVALUATING AND INCREASING THE INFORMATIONAL CONTENT OF MUTAGENICITY AND CLASTOGENICITY DATA-BASES [J].
TAKIHI, N ;
ZHANG, YP ;
KLOPMAN, G ;
ROSENKRANZ, HS .
MUTAGENESIS, 1993, 8 (03) :257-264
[18]  
ZEIGER E, 1987, CANCER RES, V47, P1287
[19]   SALMONELLA MUTAGENICITY TESTS .4. RESULTS FROM THE TESTING OF 300 CHEMICALS [J].
ZEIGER, E ;
ANDERSON, B ;
HAWORTH, S ;
LAWLOR, T ;
MORTELMANS, K .
ENVIRONMENTAL AND MOLECULAR MUTAGENESIS, 1988, 11 :1-158
[20]   SALMONELLA MUTAGENICITY TESTS .3. RESULTS FROM THE TESTING OF 255 CHEMICALS [J].
ZEIGER, E ;
ANDERSON, B ;
HAWORTH, S ;
LAWLOR, T ;
MORTELMANS, K ;
SPECK, W .
ENVIRONMENTAL MUTAGENESIS, 1987, 9 :1-109