Status of HTS data mining approaches

被引:30
作者
Böcker, A
Schneider, G
Teekentrup, A [1 ]
机构
[1] Boehringer Ingelheim Pharma GMBH & Co KG, Dept Lead Discovery, D-88397 Biberach, Germany
[2] Univ Frankfurt, Inst Organ Chem & Chem Biol, D-60439 Frankfurt, Germany
来源
QSAR & COMBINATORIAL SCIENCE | 2004年 / 23卷 / 04期
关键词
classification; feature extraction; library design; machine leaning; molecular descriptor; similarity; virtual screening;
D O I
10.1002/qsar.200330860
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
High-throughput screening of large compound collections results in large sets of data. This review gives an overview of the most frequently employed computational techniques for the analysis of such data and the establishment of first QSAR models. Various methods for descriptor selection, classification and data mining are discussed. Recent trends include the application of kernel-based machine learning methods for the design of focused libraries and compilation of target-family biased compound collections.
引用
收藏
页码:207 / 213
页数:7
相关论文
共 114 条
[41]   Chemical descriptors with distinct levels of information content and varying sensitivity to differences between selected compound databases identified by SE-DSE analysis [J].
Godden, JW ;
Bajorath, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (01) :87-93
[42]   Median partitioning: A novel method for the selection of representative subsets from large compound pools [J].
Godden, JW ;
Xue, L ;
Kitchen, DB ;
Stahura, FL ;
Schermerhorn, EJ ;
Bajorath, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (04) :885-893
[43]   Prediction of biological activity for high-throughput screening using binary kernel discrimination [J].
Harper, G ;
Bradshaw, J ;
Gittins, JC ;
Green, DVS ;
Leach, AR .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (05) :1295-1300
[44]   GA strategy for variable selection in QSAR studies: GA-based PLS analysis of calcium channel antagonists [J].
Hasegawa, K ;
Miyashita, Y ;
Funatsu, K .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (02) :306-310
[45]   Analysis of a large structure-activity data set using recursive partitioning [J].
Hawkins, DM ;
Young, SS ;
Rusinko, A .
QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 1997, 16 (04) :296-302
[46]   Experimental designs for selecting molecules from large chemical databases [J].
Higgs, RE ;
Bemis, KG ;
Watson, IA ;
Wikel, JH .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1997, 37 (05) :861-870
[47]  
HOELTJE HD, 2001, RATIONAL APPROACHES
[48]   Variable selection for QSAR by artificial ant colony systems [J].
Izrailev, S ;
Agrafiotis, DK .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2002, 13 (3-4) :417-423
[49]  
Izrailev S, 2001, J CHEM INF COMP SCI, V41, P176, DOI 10.1021/ci000036s
[50]  
JAMES CA, DAYLIGHT THEORY MANU