Cost functions to estimate a posteriori probabilities in multiclass problems

被引:53
作者
Cid-Sueiro, J
Arribas, JI
Urbán-Muñoz, S
Figueiras-Vidal, AR
机构
[1] Univ Valladolid, ETSIT, Dept Teor Senal & Comunicac & Ing Telemat, E-47011 Valladolid, Spain
[2] Univ Carlos III Madrid, Dept Tecnol Comunicac, EPS, Leganes Madrid 28911, Spain
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 1999年 / 10卷 / 03期
关键词
neural networks; pattern classification; probability estimation;
D O I
10.1109/72.761724
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of designing cost functions to estimate a posteriori probabilities in multiclass problems is addressed in this paper. We establish necessary and sufficient conditions that these costs must satisfy in one-class one-output networks whose outputs are consistent with probability laws. We focus our attention on a particular subset of the corresponding cost functions; those which verify two usually interesting properties: symmetry and separability (well-known cost functions, such as the quadratic cost or the cross entropy are particular cases in this subset). Finally, we present a universal stochastic gradient learning rule for single-layer networks, in the sense of minimizing a general version of these cost functions for a,vide family of nonlinear activation functions.
引用
收藏
页码:645 / 656
页数:12
相关论文
共 30 条
[1]   Conditional distribution learning with neural networks and its application to channel equalization [J].
Adali, T ;
Liu, X ;
Sonmez, MK .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1997, 45 (04) :1051-1064
[2]   BACKPROPAGATION AND STOCHASTIC GRADIENT DESCENT METHOD [J].
AMARI, S .
NEUROCOMPUTING, 1993, 5 (4-5) :185-196
[3]  
[Anonymous], 1968, SER DETECTION ESTIMA
[4]  
BILLA J, 1996, ENG APPL ARTIFICIAL, V9, P203
[5]   PEDAGOGICAL PATTERN SELECTION-STRATEGIES [J].
CACHIN, C .
NEURAL NETWORKS, 1994, 7 (01) :175-181
[6]  
Cichocki A., 1993, Neural Networks for Optimization and Signal Processing
[7]  
CIDSUEIRO J, 1995, P 7 INT THYRRH WORKS, P337
[8]  
Elfadel IM, 1994, ADV NEURAL INFORMATI, V6, P882
[9]  
ELJAROUDI A, 1990, P INT JOINT C NEUR N, V3, P185
[10]  
ELMASRY MI, 1994, VLSI ARTIFICIAL NEUR