Predictive toxicology: Benchmarking molecular descriptors and statistical methods

被引:62
作者
Feng, J
Lurati, L
Ouyang, H
Robinson, T
Wang, YY
Yuan, SL
Young, SS
机构
[1] Natl Inst Stat Sci, Raleigh, NC 27607 USA
[2] Univ N Carolina, Chapel Hill, NC 27599 USA
[3] Brown Univ, Providence, RI 02912 USA
[4] Univ Toledo, Toledo, OH 43606 USA
[5] N Carolina State Univ, Raleigh, NC 27695 USA
[6] Univ Waterloo, Waterloo, ON N2L 3G1, Canada
[7] Auburn Univ, Auburn, AL 36849 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2003年 / 43卷 / 05期
关键词
D O I
10.1021/ci034032s
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The development of drugs depends on finding compounds that have beneficial effects with a minimum of toxic effects. The measurement of toxic effects is typically time-consuming and expensive, so there is a need to be able to predict toxic effects from the compound structure. Predicting toxic effects is expected to be challenging because there are usually multiple toxic mechanisms involved. In this paper, combinations of different chemical descriptors and popular statistical methods were applied to the problem of predictive toxicology. Four data sets were collected and cleaned, and four different sets of chemical descriptors were calculated for the compounds in each of the four data sets. Three statistical methods (recursive partitioning, neural networks, and partial least squares) were used to attempt to link chemical descriptors to the response. Good predictions were achieved in the two smaller data sets; we found for large data sets that the results were less effective, indicating that new chemical descriptors or statistical methods are needed. All of the methods and descriptors worked to a degree, but our work hints that certain descriptors work better with specific statistical methods so there is a need for better understanding and for continued methods development.
引用
收藏
页码:1463 / 1470
页数:8
相关论文
共 13 条
[1]   MOLECULAR-IDENTIFICATION NUMBER FOR SUBSTRUCTURE SEARCHES [J].
BURDEN, FR .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1989, 29 (03) :225-227
[2]   A chemically intuitive molecular index based on the eigenvalues of a modified adjacency matrix [J].
Burden, FR .
QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 1997, 16 (04) :309-314
[3]   A quantitative structure-activity relationships model for the acute toxicity of substituted benzenes to Tetrahymena pyriformis using Bayesian-regularized neural networks [J].
Burden, FR ;
Winkler, DA .
CHEMICAL RESEARCH IN TOXICOLOGY, 2000, 13 (06) :436-440
[4]  
Dearden JC, 1997, ATLA-ALTERN LAB ANIM, V25, P223
[5]   Computer systems for the prediction of toxicity: an update [J].
Greene, N .
ADVANCED DRUG DELIVERY REVIEWS, 2002, 54 (03) :417-431
[6]  
Hastie T, 2008, The elements of statistical learning, Vsecond, DOI DOI 10.1007/978-0-387-21606-5
[7]   Metric validation and the receptor-relevant subspace concept [J].
Pearlman, RS ;
Smith, KM .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (01) :28-35
[8]  
Ripley B. D., 1996, Pattern Recognition and Neural Networks
[9]   Analysis of a large structure/biological activity data set using recursive partitioning [J].
Rusinko, A ;
Farmen, MW ;
Lambert, CG ;
Brown, PL ;
Young, SS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (06) :1017-1026
[10]   Evaluation and use of BCUT descriptors in QSAR and QSPR studies [J].
Stanton, DT .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (01) :11-20