Status of HTS data mining approaches

被引:30
作者
Böcker, A
Schneider, G
Teekentrup, A [1 ]
机构
[1] Boehringer Ingelheim Pharma GMBH & Co KG, Dept Lead Discovery, D-88397 Biberach, Germany
[2] Univ Frankfurt, Inst Organ Chem & Chem Biol, D-60439 Frankfurt, Germany
来源
QSAR & COMBINATORIAL SCIENCE | 2004年 / 23卷 / 04期
关键词
classification; feature extraction; library design; machine leaning; molecular descriptor; similarity; virtual screening;
D O I
10.1002/qsar.200330860
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
High-throughput screening of large compound collections results in large sets of data. This review gives an overview of the most frequently employed computational techniques for the analysis of such data and the establishment of first QSAR models. Various methods for descriptor selection, classification and data mining are discussed. Recent trends include the application of kernel-based machine learning methods for the design of focused libraries and compilation of target-family biased compound collections.
引用
收藏
页码:207 / 213
页数:7
相关论文
共 114 条
[1]   A self-organizing principle for learning nonlinear manifolds [J].
Agrafiotis, DK ;
Xu, HF .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (25) :15869-15872
[2]   A fractal approach for selecting an appropriate bin size for cell-based diversity estimation [J].
Agrafiotis, DK ;
Rassokhin, DN .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (01) :117-122
[3]   Feature selection for structure-activity correlation using binary particle swarms [J].
Agrafiotis, DK ;
Cedeño, W .
JOURNAL OF MEDICINAL CHEMISTRY, 2002, 45 (05) :1098-1107
[4]  
[Anonymous], MACCS KEYS
[5]   Integration of virtual and high-throughput screening [J].
Bajorath, F .
NATURE REVIEWS DRUG DISCOVERY, 2002, 1 (11) :882-894
[6]  
BALAKHIN KV, 2003, J COMP COMP GROUP, V43
[7]   Property-based design of GPCR-targeted library [J].
Balakin, KV ;
Tkachenko, SE ;
Lang, SA ;
Okun, I ;
Ivashchenko, AA ;
Savchuk, NP .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (06) :1332-1342
[8]   CLUSTERING OF CHEMICAL STRUCTURES ON THE BASIS OF 2-DIMENSIONAL SIMILARITY MEASURES [J].
BARNARD, JM ;
DOWNS, GM .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1992, 32 (06) :644-649
[9]   Binning schemes for partition-based compound selection [J].
Bayley, MJ ;
Willett, P .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 1999, 17 (01) :10-18
[10]   Comparison of chemical databases:: Analysis of molecular diversity with Self Organising Maps (SOM) [J].
Bernard, P ;
Golbraikh, A ;
Kireev, D ;
Chrétien, JR ;
Rozhkova, N .
ANALUSIS, 1998, 26 (08) :333-341