Structure-activity relationships derived by machine learning: The use of atoms and their bond connectivities to predict mutagenicity by inductive logic programming

被引:161
作者
King, RD [1 ]
Muggleton, SH [1 ]
Srinivasan, A [1 ]
Sternberg, MJE [1 ]
机构
[1] UNIV OXFORD, COMP LAB, OXFORD OX1 3QD, ENGLAND
关键词
D O I
10.1073/pnas.93.1.438
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We present a general approach to forming structure-activity relationships (SARs). This approach is based on representing chemical structure by atoms and their bond connectivities in combination with the inductive logic programming (ILP) algorithm PROGOL. Existing SAR methods describe chemical structure by using attributes which are general properties of an object, It is not possible to map chemical structure directly to attribute-based descriptions, as such descriptions have no internal organization, A more natural and general way to describe chemical structure is to use a relational description, where the internal construction of the description maps that of the object described. Our atom and bond connectivities representation is a relational description, ILP algorithms can form SARs with relational descriptions, We have tested the relational approach by investigating the SARs of 230 aromatic and heteroaromatic nitro compounds, These compounds had been split previously into two subsets, 188 compounds that were amenable to regression and 42 that were not, For the 188 compounds, a SBR was found that was as accurate as the best statistical or neural network-generated SARs, The PROGOL SAR has the advantages that it did not need the use of any indicator variables handcrafted by an expert, and the generated rules were easily comprehensible. For the 42 compounds, PROGOL formed a SAR that was significantly (P < 0.025) more accurate than linear regression, quadratic regression, and back-propagation. This SAR is based on an automatically generated structural alert for mutagenicity.
引用
收藏
页码:438 / 442
页数:5
相关论文
共 26 条
[1]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[2]   COMPARATIVE MOLECULAR-FIELD ANALYSIS (COMFA) .1. EFFECT OF SHAPE ON BINDING OF STEROIDS TO CARRIER PROTEINS [J].
CRAMER, RD ;
PATTERSON, DE ;
BUNCE, JD .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1988, 110 (18) :5959-5967
[3]   STRUCTURE ACTIVITY RELATIONSHIP OF MUTAGENIC AROMATIC AND HETEROAROMATIC NITRO-COMPOUNDS - CORRELATION WITH MOLECULAR-ORBITAL ENERGIES AND HYDROPHOBICITY [J].
DEBNATH, AK ;
DECOMPADRE, RLL ;
DEBNATH, G ;
SHUSTERMAN, AJ ;
HANSCH, C .
JOURNAL OF MEDICINAL CHEMISTRY, 1991, 34 (02) :786-797
[4]  
DeLong Howard, 1970, PROFILE MATH LOGIC
[5]   A STATISTICAL VIEW OF SOME CHEMOMETRICS REGRESSION TOOLS [J].
FRANK, IE ;
FRIEDMAN, JH .
TECHNOMETRICS, 1993, 35 (02) :109-135
[6]   CORRELATION OF BIOLOGICAL ACTIVITY OF PHENOXYACETIC ACIDS WITH HAMMETT SUBSTITUENT CONSTANTS AND PARTITION COEFFICIENTS [J].
HANSCH, C ;
MALONEY, PP ;
FUJITA, T .
NATURE, 1962, 194 (4824) :178-&
[7]   QUANTITATIVE STRUCTURE-ACTIVITY-RELATIONSHIPS BY NEURAL NETWORKS AND INDUCTIVE LOGIC PROGRAMMING .2. THE INHIBITION OF DIHYDROFOLATE-REDUCTASE BY TRIAZINES [J].
HIRST, JD ;
KING, RD ;
STERNBERG, MJE .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1994, 8 (04) :421-432
[8]   QUANTITATIVE STRUCTURE-ACTIVITY-RELATIONSHIPS BY NEURAL NETWORKS AND INDUCTIVE LOGIC PROGRAMMING .1. THE INHIBITION OF DIHYDROFOLATE-REDUCTASE BY PYRIMIDINES [J].
HIRST, JD ;
KING, RD ;
STERNBERG, MJE .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1994, 8 (04) :405-420
[10]   STATLOG - COMPARISON OF CLASSIFICATION ALGORITHMS ON LARGE REAL-WORLD PROBLEMS [J].
KING, RD ;
FENG, C ;
SUTHERLAND, A .
APPLIED ARTIFICIAL INTELLIGENCE, 1995, 9 (03) :289-333