The discovery of indicator variables for QSAR using inductive logic programming

被引:19
作者
King, RD [1 ]
Srinivasan, A
机构
[1] Univ Wales, Dept Comp Sci, Aberystwyth SY23 3DB, Ceredigion, Wales
[2] Univ Oxford, Comp Lab, Oxford OX1 3QD, England
基金
英国工程与自然科学研究理事会;
关键词
artificial intelligence; machine learning; regression;
D O I
10.1023/A:1007967728701
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A central problem in forming accurate regression equations in QSAR studies is the selection of appropriate descriptors for the compounds under study. We describe a novel procedure for using inductive logic programming (ILP) to discover new indicator variables (attributes) for QSAR problems, and show that these improve the accuracy of the derived regression equations. ILP techniques have previously been shown to work well on drug design problems where there is a large structural component or where clear comprehensible rules are required. However, ILP techniques have had the disadvantage of only being able to make qualitative predictions e.g. active, inactive) and not to predict real numbers (regression). We unify ILP and linear regression techniques to give a QSAR method that has the strength of ILP at describing steric structure, with the familiarity and power of linear regression. We evaluated the utility of this new QSAR technique by examining the prediction of biological activity with and without the addition of new structural indicator variables formed by ILP. In three out of five datasets examined the addition of ILP variables produced statistically better results (P<0.01) over the original description. The new ILP variables did not increase the overall complexity of the derived QSAR equations and added insight into possible mechanisms of action. We conclude that ILP can aid in the process of drug design.
引用
收藏
页码:571 / 580
页数:10
相关论文
共 40 条
[1]   APPLICATIONS OF NEURAL NETWORKS IN QUANTITATIVE STRUCTURE-ACTIVITY-RELATIONSHIPS OF DIHYDROFOLATE-REDUCTASE INHIBITORS [J].
ANDREA, TA ;
KALAYEH, H .
JOURNAL OF MEDICINAL CHEMISTRY, 1991, 34 (09) :2824-2836
[2]  
BAHLER D, 1993, INTELLIGENT SYSTEMS
[3]   CRYSTALLOGRAPHIC INVESTIGATION OF THE COOPERATIVE INTERACTION BETWEEN TRIMETHOPRIM, REDUCED COFACTOR AND DIHYDROFOLATE-REDUCTASE [J].
CHAMPNESS, JN ;
STAMMERS, DK ;
BEDDELL, CR .
FEBS LETTERS, 1986, 199 (01) :61-67
[4]   COMPARATIVE MOLECULAR-FIELD ANALYSIS (COMFA) .1. EFFECT OF SHAPE ON BINDING OF STEROIDS TO CARRIER PROTEINS [J].
CRAMER, RD ;
PATTERSON, DE ;
BUNCE, JD .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1988, 110 (18) :5959-5967
[5]   THE USE OF THE GRID PROGRAM IN THE 3-D QSAR ANALYSIS OF A SERIES OF CALCIUM-CHANNEL AGONISTS [J].
DAVIS, AM ;
GENSMANTEL, NP ;
JOHANSSON, E ;
MARRIOTT, DP .
JOURNAL OF MEDICINAL CHEMISTRY, 1994, 37 (07) :963-972
[6]   STRUCTURE ACTIVITY RELATIONSHIP OF MUTAGENIC AROMATIC AND HETEROAROMATIC NITRO-COMPOUNDS - CORRELATION WITH MOLECULAR-ORBITAL ENERGIES AND HYDROPHOBICITY [J].
DEBNATH, AK ;
DECOMPADRE, RLL ;
DEBNATH, G ;
SHUSTERMAN, AJ ;
HANSCH, C .
JOURNAL OF MEDICINAL CHEMISTRY, 1991, 34 (02) :786-797
[7]  
DeLong Howard, 1970, PROFILE MATH LOGIC
[8]   A STATISTICAL VIEW OF SOME CHEMOMETRICS REGRESSION TOOLS [J].
FRANK, IE ;
FRIEDMAN, JH .
TECHNOMETRICS, 1993, 35 (02) :109-135
[9]   A GENETIC ALGORITHM FOR THE AUTOMATED GENERATION OF MOLECULES WITHIN CONSTRAINTS [J].
GLEN, RC ;
PAYNE, AWR .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1995, 9 (02) :181-202
[10]   CORRELATION OF BIOLOGICAL ACTIVITY OF PHENOXYACETIC ACIDS WITH HAMMETT SUBSTITUENT CONSTANTS AND PARTITION COEFFICIENTS [J].
HANSCH, C ;
MALONEY, PP ;
FUJITA, T .
NATURE, 1962, 194 (4824) :178-&