Binary formal inference-based recursive modeling using multiple atom and physicochemical property class pair and torsion descriptors as decision criteria

被引:22
作者
Cho, SJ
Shen, CF
Hermsmeier, MA
机构
[1] Bristol Myers Squibb Co, Combinatorial Drug Discovery, Wallingford, CT 06492 USA
[2] Bristol Myers Squibb Co, Nonclin Biostat, Princeton, NJ 08543 USA
[3] Bristol Myers Squibb Co, Combinatorial Drug Discovery, Princeton, NJ 08543 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2000年 / 40卷 / 03期
关键词
D O I
10.1021/ci9908190
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Analysis of a large amount of information, typically generated by high-throughput screening, is a very difficult task. To address this problem, we have developed binary formal inference-based recursive modeling using atom and physicochemical property class pair and torsion descriptors. Recursive partitioning is an exploratory technique for identifying structure in data. The implemented algorithm utilizes a statistical hypothesis resting, similar to Hawkins' formal inference-based recursive modeling program, to separate a data set into two homogeneous subsets at each splitting node. This process is repented recursively until no further separation can occur. Our implementation of recursive partitioning differs from previously reported approaches by employing a method to extract multiple features at each splitting node. The method was examined for its ability to distinguish random and real data sets. The effect of including a single descriptor and multiple descriptors in the splitting descriptor set was also studied. The method was tested using 27 401 National Cancer Institute (NCI) compounds and their pGI50 (-log(GI(50))) against the NCl-H23 cell line. The analyses show that partitioning using multiple descriptors is advantageous in analyzing the structure-activity relationship information.
引用
收藏
页码:668 / 680
页数:13
相关论文
共 63 条
[1]  
AGRAFIOTIS DK, 1998, ENCYC COMPUTATIONAL
[2]  
AJAY A, 1993, J MED CHEM, V36, P3565
[3]  
BOHACHEVSKY IO, 1986, TECHNOMETRICS, V28, P209
[4]   Use of structure Activity data to compare structure-based clustering methods and descriptors for use in compound selection [J].
Brown, RD ;
Martin, YC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (03) :572-584
[5]   MOLECULAR-IDENTIFICATION NUMBER FOR SUBSTRUCTURE SEARCHES [J].
BURDEN, FR .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1989, 29 (03) :225-227
[6]   ATOM PAIRS AS MOLECULAR-FEATURES IN STRUCTURE ACTIVITY STUDIES - DEFINITION AND APPLICATIONS [J].
CARHART, RE ;
SMITH, DH ;
VENKATARAGHAVAN, R .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1985, 25 (02) :64-73
[7]  
*CHEM DES LTD, 1997, CHEM X US GUID
[8]   Recursive partitioning analysis of a large structure-activity data set using three-dimensional descriptors [J].
Chen, X ;
Rusinko, A ;
Young, SS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (06) :1054-1062
[9]   AMINO-ACID SIDE-CHAIN DESCRIPTORS FOR QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIP STUDIES OF PEPTIDE ANALOGS [J].
COLLANTES, ER ;
DUNN, WJ .
JOURNAL OF MEDICINAL CHEMISTRY, 1995, 38 (14) :2705-2713
[10]   COMPARATIVE MOLECULAR-FIELD ANALYSIS (COMFA) .1. EFFECT OF SHAPE ON BINDING OF STEROIDS TO CARRIER PROTEINS [J].
CRAMER, RD ;
PATTERSON, DE ;
BUNCE, JD .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1988, 110 (18) :5959-5967