Nonnegative matrix factorization with constrained second-order optimization

被引:112
作者
Zdunek, Rafal [1 ]
Cichocki, Andrzej
机构
[1] RIKEN, Brain Sci Inst, Lab Adv Brain Signal Proc, Wako, Saitama 3510198, Japan
[2] Wroclaw Univ Technol, Inst Telecommun Teleinformat & Acoust, Wroclaw, Poland
[3] Warsaw Univ Technol, Warsaw, Poland
关键词
nonnegative matrix factorization; blind source separation; quasi-Newton method; GPCG; second-order optimization; fixed-point algorithm;
D O I
10.1016/j.sigpro.2007.01.024
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 [电气工程]; 0809 [电子科学与技术];
摘要
Nonnegative matrix factorization (NMF) solves the following problem: find notinegative matrices A epsilon R-+(MxR) and X epsilon R-+(RxT) such that Y congruent to AX, given only Y epsilon R-MxT and the assigned index R. This method has found a wide spectrum of applications in signal and image processing, such as blind source separation (BSS), spectra recovering, pattern recognition, segmentation or clustering. Such a factorization is usually performed with an alternating gradient descent technique that is applied to the squared Euclidean distance or Kullback-Leibler divergence. This approach has been used in the widely known Lee-Seung NMF algorithms that belong to a class of multiplicative iterative algorithms. It is well known that these algorithms, in spite of their low complexity, are slowly convergent, give only a strictly positive solution, and can easily fall into local minima of a nonconvex cost function. In this paper, we propose to take advantage of the second-order terms of a cost function to overcome the disadvantages of gradient (multiplicative) algorithms. First, a projected quasi-Newton method is presented, where a regularized Hessian with the Levenberg-Marquardt approach is inverted with the Q-less QR decomposition. Since the matrices A and/or X are usually sparse, a more sophisticated hybrid approach based on the gradient projection conjugate gradient (GPCG) algorithm, which was invented by More and Toraldo, is adapted for NMF. The gradient projection (GP) method is exploited to find zero-value components (active), and then the Newton steps are taken only to compute positive components (inactive) with the conjugate gradient (CG) method. As a cost function, we used the a-divergence that unifies many well-known cost functions. We applied our new NMF method to a BSS problem with mixed signals and images. The results demonstrate the high robustness of our method. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:1904 / 1916
页数:13
相关论文
共 45 条
[1]
Ahn J.-H., 2004, ACCV
[2]
Amari S., 1985, DIFFERENTIAL GEOMETR
[3]
[Anonymous], BMC BIOINFORMATICS
[4]
[Anonymous], 1988, GOODNESS FIT STAT DI
[5]
Bardsley JM, 2005, ELECTRON T NUMER ANA, V20, P139
[6]
A nonnegatively constrained convex programming method for image reconstruction [J].
Bardsley, JM ;
Vogel, CR .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2003, 25 (04) :1326-1343
[7]
Algorithms and applications for approximate nonnegative matrix factorization [J].
Berry, Michael W. ;
Browne, Murray ;
Langville, Amy N. ;
Pauca, V. Paul ;
Plemmons, Robert J. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) :155-173
[8]
Metagenes and molecular pattern discovery using matrix factorization [J].
Brunet, JP ;
Tamayo, P ;
Golub, TR ;
Mesirov, JP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (12) :4164-4169
[9]
Accelerating the EMML algorithm and related iterative algorithms by rescaled block-iterative methods [J].
Byrne, CL .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1998, 7 (01) :100-109
[10]
Nonnegative features of spectro-temporal sounds for classification [J].
Cho, YC ;
Choi, SJ .
PATTERN RECOGNITION LETTERS, 2005, 26 (09) :1327-1336