QSAR applicability domain estimation by projection of the training set in descriptor space: A review

被引:535
作者
Jaworska, J [1 ]
Nikolova-Jeliazkova, N
Aldenberg, T
机构
[1] Procter & Gamble Co, Eurocor, Strombeek Bever, Belgium
[2] Bulgarian Acad Sci, Inst Parallel Proc, Sofia, Bulgaria
[3] RIVM, Bilthoven, Netherlands
来源
ATLA-ALTERNATIVES TO LABORATORY ANIMALS | 2005年 / 33卷 / 05期
关键词
applicability domain; multivariate interpolation; QSAR;
D O I
10.1177/026119290503300508
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
As the use of Quantitative Structure Activity Relationship (QSAR) models for chemical management increases, the reliability of the predictions from such models is a matter of growing concern. The OECD QSAR Validation Principles recommend that a model should be used within its applicability domain (AD). The Setubal Workshop report provided conceptual guidance on defining a (Q)SAR AD, but it is difficult to use directly. The practical application of the AD concept requires an operational definition that permits the design of an automatic (computerised), quantitative procedure to determine a model's AD. An attempt is made to address this need, and methods and criteria for estimating AD through training set interpolation in descriptor space are reviewed. It is proposed that response space should be included in the training set representation. Thus, training set chemicals are points in n-dimensional descriptor space and m-dimensional model response space. Four major approaches for estimating interpolation regions in a multivariate space are reviewed and compared: range, distance, geometrical, and probability density distribution.
引用
收藏
页码:445 / 459
页数:15
相关论文
共 30 条
[1]  
[Anonymous], 2000, CLASSICAL MODERN REG
[2]   Molecular similarity: a key technique in molecular informatics [J].
Bender, A ;
Glen, RC .
ORGANIC & BIOMOLECULAR CHEMISTRY, 2004, 2 (22) :3204-3218
[3]   Monte Carlo estimation of Bayesian credible and HPD intervals [J].
Chen, MH ;
Shao, QM .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1999, 8 (01) :69-92
[4]   A QSAR INVESTIGATION OF THE ROLE OF HYDROPHOBICITY IN REGULATING MUTAGENICITY IN THE AMES TEST .1. MUTAGENICITY OF AROMATIC AND HETEROAROMATIC AMINES IN SALMONELLA-TYPHIMURIUM TA98 AND TA100 [J].
DEBNATH, AK ;
DEBNATH, G ;
SHUSTERMAN, AJ ;
HANSCH, C .
ENVIRONMENTAL AND MOLECULAR MUTAGENESIS, 1992, 19 (01) :37-52
[5]  
*ECETOC, 2003, 89 ECETOC QSAR TF
[6]   Methods for reliability and uncertainty assessment and for applicability evaluations of classification- and regression-based QSARs [J].
Eriksson, L ;
Jaworska, J ;
Worth, AP ;
Cronin, MTD ;
McDowell, RM ;
Gramatica, P .
ENVIRONMENTAL HEALTH PERSPECTIVES, 2003, 111 (10) :1361-1375
[7]  
Flannery B.P., 1992, NUMERICAL RECIPES C
[8]   EXPLORATORY PROJECTION PURSUIT [J].
FRIEDMAN, JH .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1987, 82 (397) :249-266
[9]  
Fukunaga K., 1990, INTRO STAT PATTERN R
[10]   Transformation of mutagenic aromatic amines into non-mutagenic species by alkyl substituents Part I.: Alkylation ortho to the amino function [J].
Glende, C ;
Schmitt, H ;
Erdinger, L ;
Engelhardt, G ;
Boche, G .
MUTATION RESEARCH-GENETIC TOXICOLOGY AND ENVIRONMENTAL MUTAGENESIS, 2001, 498 (1-2) :19-37