The inner and outer approaches to the design of recursive neural architectures

被引:6
作者
Baldi, Pierre [1 ]
机构
[1] Univ Calif Irvine, Dept Comp Sci, Irvine, CA 92617 USA
关键词
Deep learning; Recurrent neural networks; Recursive neural networks; Convolutional neural networks; Structured input; PROTEIN SECONDARY STRUCTURE; CONTACT MAP PREDICTION; DEEP ARCHITECTURES; AQUEOUS SOLUBILITY; NETWORKS;
D O I
10.1007/s10618-017-0531-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feedforward neural network architectures work well for numerical data of fixed size, such as images. For variable size, structured data, such as sequences, d dimensional grids, trees, and other graphs, recursive architectures must be used. We distinguish two general approaches for the design of recursive architectures in deep learning, the inner and the outer approach. The inner approach uses neural networks recursively inside the data graphs, essentially to "crawl" the edges of the graphs in order to compute the final output. It requires acyclic orientations of the underlying graphs. The outer approach uses neural networks recursively outside the data graphs and regardless of their orientation. These neural networks operate orthogonally to the data graph and progressively "fold" or aggregate the input structure to produce the final output. The distinction is illustrated using several examples from the fields of natural language processing, chemoinformatics, and bioinformatics, and applied to the problem of learning from variable-size sets.
引用
收藏
页码:218 / 230
页数:13
相关论文
共 22 条
[1]  
[Anonymous], 2015, ARXIV150304069
[2]  
[Anonymous], 2015, ADV NEURAL INFORM PR
[3]  
[Anonymous], 2004, J MACH LEARN RES, DOI DOI 10.1162/153244304773936054
[4]  
[Anonymous], 2013, P 2013 C EMP METH NA
[5]   Exploiting the past and the future in protein secondary structure prediction [J].
Baldi, P ;
Brunak, S ;
Frasconi, P ;
Soda, G ;
Pollastri, G .
BIOINFORMATICS, 1999, 15 (11) :937-946
[6]   Hybrid modeling, HMM/NN architectures, and protein applications [J].
Baldi, P ;
Chauvin, Y .
NEURAL COMPUTATION, 1996, 8 (07) :1541-1565
[7]   ESOL: Estimating aqueous solubility directly from molecular structure [J].
Delaney, JS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (03) :1000-1005
[8]   Deep architectures for protein contact map prediction [J].
Di Lena, Pietro ;
Nagata, Ken ;
Baldi, Pierre .
BIOINFORMATICS, 2012, 28 (19) :2449-2457
[9]   A general framework for adaptive processing of data structures [J].
Frasconi, P ;
Gori, M ;
Sperduti, A .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1998, 9 (05) :768-786
[10]   Learning to forget: Continual prediction with LSTM [J].
Gers, FA ;
Schmidhuber, J ;
Cummins, F .
NEURAL COMPUTATION, 2000, 12 (10) :2451-2471