A model selection algorithm for a Posteriori probability estimation with neural networks

被引:29
作者
Arribas, JI [1 ]
Cid-Sueiro, J
机构
[1] Univ Valladolid, Dept Teor Denal & Comun & Ingn Telemat, Valladolid 47011, Spain
[2] Univ Carlos 3 Madris, Dept Signal Theory & Commun, Leganes 28911, Madrid, Spain
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2005年 / 16卷 / 04期
关键词
expectation-maximization; model selection; neural network (NN); objective function; posterior probability; regularization;
D O I
10.1109/TNN.2005.849826
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel algorithm to jointly determine the structure and the parameters of a posteriori probability model based on neural networks (NNs). It makes use of well-known ideas of pruning, splitting, and merging neural components and takes advantage of the probabilistic interpretation of these components. The algorithm, so called a posteriori probability model selection (PPMS), is applied to an NN architecture called the generalized softmax perceptron (GSP) whose outputs can be understood as probabilities although results shown can be extended to more general network architectures. Learning rules are derived from the application of the expectation-maximization algorithm to the GSP-PPMS structure. Simulation results show the advantages of the proposed algorithm with respect to other schemes.
引用
收藏
页码:799 / 809
页数:11
相关论文
共 49 条
[1]   Partial likelihood for signal processing [J].
Adali, T ;
Ni, HM .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2003, 51 (01) :204-212
[2]  
AKAIKE H, 1974, ANN I STAT MATH, V22, P202
[3]   Growing Gaussian mixtures network for classification applications [J].
Alba, JL ;
Docío, L ;
Docampo, D ;
Márquez, OW .
SIGNAL PROCESSING, 1999, 76 (01) :43-60
[4]   Asymptotic statistical theory of overtraining and cross-validation [J].
Amari, S ;
Murata, N ;
Muller, KR ;
Finke, M ;
Yang, HH .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (05) :985-996
[5]   BACKPROPAGATION AND STOCHASTIC GRADIENT DESCENT METHOD [J].
AMARI, S .
NEUROCOMPUTING, 1993, 5 (4-5) :185-196
[6]   Model selection in neural networks [J].
Anders, U ;
Korn, O .
NEURAL NETWORKS, 1999, 12 (02) :309-323
[7]  
ARRIBAS JI, 2005, NEURAL SIGNAL PROCES
[8]  
ARRIBAS JI, 1999, NNSP, V9, P263
[9]  
ARRIBAS JI, 2001, THESIS U VALLADOLID
[10]  
BAUM EB, 1988, ADV NEURAL INFORMATI, P52