A novel information geometric approach to variable selection in MLP networks

被引:11
作者
Eleuteri, A
Tagliaferri, R
Milano, L
机构
[1] Univ Naples Federico II, Dipartimento Sci Fisiche, I-80126 Naples, Italy
[2] INFN Sez, Naples, Italy
[3] Univ Salerno, DMI, Fisciano, SA, Italy
关键词
information geometry; variable selection; neural networks; Bayesian inference;
D O I
10.1016/j.neunet.2005.01.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a novel information geometric-based variable selection criterion for multi-layer perceptron networks is described. It is based on projections of the Riemannian manifold defined by a multi-layer perceptron network on submanifolds defined by multi-layer perceptron networks with reduced input dimension. We show how the divergence between models can be used as a criterion for an efficient search in the space of networks with different inputs. Then, we show how the posterior probabilities of the models can be evaluated to rank the projected models. Finally, we test our algorithm on synthetic and real data, and compare its performances with other methods reported in literature. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1309 / 1318
页数:10
相关论文
共 31 条
[1]   Natural gradient works efficiently in learning [J].
Amari, S .
NEURAL COMPUTATION, 1998, 10 (02) :251-276
[2]   Information geometry of the EM and em algorithms for neural networks [J].
Amari, SI .
NEURAL NETWORKS, 1995, 8 (09) :1379-1408
[3]  
[Anonymous], 2000, METHODS INFORM GEOME
[4]  
Belsley DA, 1980, Regression Diagnostics: Identifying Influential Data and Sources of Collinearity
[5]  
Breiman L., 1998, CLASSIFICATION REGRE
[6]   An iterative pruning algorithm for feedforward neural networks [J].
Castellano, G ;
Fanelli, AM ;
Pelillo, M .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (03) :519-531
[7]   Variable selection in qualitative models via an entropic explanatory power [J].
Dupuis, JA ;
Robert, CP .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2003, 111 (1-2) :77-94
[8]   Assessing the importance of features for multi-layer perceptrons [J].
Egmont-Petersen, M ;
Talmon, JL ;
Hasman, A ;
Ambergen, AW .
NEURAL NETWORKS, 1998, 11 (04) :623-635
[9]  
ELEUTERI A, 2004, THESIS U STUDI NAPOL
[10]   MULTIVARIATE ADAPTIVE REGRESSION SPLINES [J].
FRIEDMAN, JH .
ANNALS OF STATISTICS, 1991, 19 (01) :1-67