Low Data Drug Discovery with One-Shot Learning

被引:488
作者
Altae-Tran, Han [1 ]
Ramsundar, Bharath [2 ]
Pappu, Aneesh S. [2 ]
Pande, Vijay [3 ]
机构
[1] MIT, Dept Biol Engn, Cambridge, MA 02139 USA
[2] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Chem, Stanford, CA 94305 USA
关键词
D O I
10.1021/acscentsci.6b00367
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Recent advances in machine learning have made significant contributions to drug discovery. Deep neural networks in particular have been demonstrated to provide significant boosts in predictive power when inferring the properties and activities of small-molecule compounds (Ma, J. et al. J. Chem. Inf. Model. 2015, 55, 263-274). However, the applicability of these techniques has been limited by the requirement for large amounts of training data. In this work, we demonstrate how one-shot learning can be used to significantly lower the amounts of data required to make meaningful predictions in drug discovery applications. We introduce a new architecture, the iterative refinement long short-term memory, that, when combined with graph convolutional neural networks, significantly improves learning of meaningful distance metrics over small-molecules. We open source all models introduced in this work as part of DeepChem, an open-source framework for deep-learning in drug discovery.
引用
收藏
页码:283 / 293
页数:11
相关论文
共 29 条
[11]  
[Anonymous], 2015, ARXIV150202072
[12]  
Deng L, 2013, INT CONF ACOUST SPEE, P8599, DOI 10.1109/ICASSP.2013.6639344
[13]  
Duvenaudt D, 2015, ADV NEUR IN, V28
[14]  
Graves A, 2013, 2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), P273, DOI 10.1109/ASRU.2013.6707742
[15]   Identity Mappings in Deep Residual Networks [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :630-645
[16]   Molecular graph convolutions: moving beyond fingerprints [J].
Kearnes, Steven ;
McCloskey, Kevin ;
Berndl, Marc ;
Pande, Vijay ;
Riley, Patrick .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2016, 30 (08) :595-608
[17]  
Kingma DP., 2014, ARXIV14126980, p1412.6980, DOI DOI 10.1145/1830483.1830503
[18]   The SIDER database of drugs and side effects [J].
Kuhn, Michael ;
Letunic, Ivica ;
Jensen, Lars Juhl ;
Bork, Peer .
NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) :D1075-D1079
[19]   Human-level concept learning through probabilistic program induction [J].
Lake, Brenden M. ;
Salakhutdinov, Ruslan ;
Tenenbaum, Joshua B. .
SCIENCE, 2015, 350 (6266) :1332-1338
[20]   Deep Architectures and Deep Learning in Chemoinformatics: The Prediction of Aqueous Solubility for Drug-Like Molecules [J].
Lusci, Alessandro ;
Pollastri, Gianluca ;
Baldi, Pierre .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (07) :1563-1575