Computational methods in developing quantitative structure-activity relationships (QSAR):: A review

被引:298
作者
Dudek, AZ
Arodz, T
Gálvez, J
机构
[1] Univ Minnesota, Sch Med, Div Hematol Oncol & Transplantat, Minneapolis, MN 55455 USA
[2] AGH Univ Sci & Technol, Inst Comp Sci, PL-30059 Krakow, Poland
[3] Univ Valencia, Unit Drug Design & Mol Connect Res, E-46100 Burjassot, Valencia, Spain
关键词
QSAR; molecular descriptors; feature selection; machine learning;
D O I
10.2174/138620706776055539
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Virtual filtering and screening of combinatorial libraries have recently gained attention as methods complementing the high-throughput screening and combinatorial chemistry. These chemoinformatic techniques rely heavily on quantitative structure-activity relationship (QSAR) analysis, a field with established methodology and successful history. In this review, we discuss the computational methods for building QSAR models. We start with outlining their usefulness in high-throughput screening and identifying the general scheme of a QSAR model. Following, we focus on the methodologies in constructing three main components of QSAR model, namely the methods for describing the molecular structure of compounds. for selection of informative descriptors and for activity prediction. We present both the well-established methods as well as techniques recently introduced into the QSAR domain.
引用
收藏
页码:213 / 228
页数:16
相关论文
共 162 条
[11]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[12]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[13]   MS-WHIM, new 3D theoretical descriptors derived from molecular surface properties: A comparative 3D QSAR study in a series of steroids [J].
Bravi, G ;
Gancia, E ;
Mascagni, P ;
Pegna, M ;
Todeschini, R ;
Zaliani, A .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1997, 11 (01) :79-92
[14]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[15]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[16]   Drug design by machine learning: support vector machines for pharmaceutical data analysis [J].
Burbidge, R ;
Trotter, M ;
Buxton, B ;
Holden, S .
COMPUTERS & CHEMISTRY, 2001, 26 (01) :5-14
[17]   MOLECULAR-IDENTIFICATION NUMBER FOR SUBSTRUCTURE SEARCHES [J].
BURDEN, FR .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1989, 29 (03) :225-227
[18]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[19]   SVM-based feature selection for characterization of focused compound collections [J].
Byvatov, E ;
Schneider, G .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (03) :993-999