Adaptive online learning algorithms for blind separation: Maximum entropy and minimum mutual information

被引:231
作者
Yang, HH
Amari, S
机构
[1] Lab. for Information Representation, FRP, RIKEN, Wako-shi, Saitama 351-01
关键词
D O I
10.1162/neco.1997.9.7.1457
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are two major approaches for blind separation: maximum entropy (ME) and minimum mutual information (MMI). Both can be implemented by the stochastic gradient descent method for obtaining the demixing matrix. The MI is the contrast function for blind separation; the entropy is not. To justify the ME, the relation between ME and MMI is first elucidated by calculating the first derivative of the entropy and proving that the mean sub traction is necessary in applying the ME and at the solution points determined by the MI, the ME will not update the demixing matrix in the directions of increasing the cross-talking. Second, the natural gradient instead of the ordinary gradient is introduced to obtain efficient algorithms, because the parameter space is a Riemannian space consisting of matrices. The mutual information is calculated by applying the Gram-Charlier expansion to approximate probability density functions of the outputs. Finally, we propose an efficient learning algorithm that incorporates with an adaptive method of estimating the unknown cumulants. It is shown by computer simulation that the convergence of the stochastic descent algorithms is improved by using the natural gradient and the adaptively estimated cumulants.
引用
收藏
页码:1457 / 1482
页数:26
相关论文
共 13 条
[1]  
Amari S, 1996, ADV NEUR IN, V8, P757
[2]   A THEORY OF ADAPTIVE PATTERN CLASSIFIERS [J].
AMARI, S .
IEEE TRANSACTIONS ON ELECTRONIC COMPUTERS, 1967, EC16 (03) :299-+
[3]   BACKPROPAGATION AND STOCHASTIC GRADIENT DESCENT METHOD [J].
AMARI, S .
NEUROCOMPUTING, 1993, 5 (4-5) :185-196
[4]  
AMARI S, 1997, ADV NEURAL INFORMATI, V9
[5]  
BARLOW H, 1989, COMP NEUR S, P54
[6]   AN INFORMATION MAXIMIZATION APPROACH TO BLIND SEPARATION AND BLIND DECONVOLUTION [J].
BELL, AJ ;
SEJNOWSKI, TJ .
NEURAL COMPUTATION, 1995, 7 (06) :1129-1159
[7]  
BELL AJ, 1995, P INT S NONL THEOR A, V1, P43
[8]   Equivariant adaptive source separation [J].
Cardoso, JF ;
Laheld, BH .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1996, 44 (12) :3017-3030
[9]  
CICHOCKI A, 1994, ISANN94, P406
[10]   INDEPENDENT COMPONENT ANALYSIS, A NEW CONCEPT [J].
COMON, P .
SIGNAL PROCESSING, 1994, 36 (03) :287-314