Modeling multi-way data with linearly dependent loadings

被引:82
作者
Bro, Rasmus [1 ]
Harshman, Richard A. [2 ]
Sidiropoulos, Nicholas D. [3 ]
Lundy, Margaret E. [2 ,4 ]
机构
[1] Univ Copenhagen, Dept Food Sci, DK-1958 Frederiksberg C, Denmark
[2] Univ Western Ontario, Dept Psychol, London, ON N6A 5C2, Canada
[3] Dept Elect & Comp Engn, Khania 73100, Greece
[4] Univ Western Ontario, Dept Psychol, London, ON N6A 5C2, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
PARAFAC; PARATUCK2; constrained ALS estimation; uniqueness; linear dependence; equality constraints; multi-mode factor analysis; ANNIHILATION FACTOR-ANALYSIS; PARALLEL FACTOR-ANALYSIS; 3-MODE FACTOR-ANALYSIS; INDIVIDUAL-DIFFERENCES; RANK AUGMENTATION; DELAY SPREAD; CDMA SYSTEMS; CALIBRATION; UNIQUENESS; RESOLUTION;
D O I
10.1002/cem.1206
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A generalization/specialization of the PARAFAC model is developed that improves its properties when applied to multi-way problems involving linearly dependent factors. This model is called PARALIND (PARAllel profiles with LINear Dependences). Linear dependences can arise when the empirical sources of variation being modeled by factors are causally or logically linked during data generation, or circumstantially linked during data collection. For example, this can occur in a chemical context when end products are related to the precursor or in a psychological context when a single stimulus generates two incompatible feelings at once. For such cases, the most theoretically appropriate PARAFAC model has loading vectors that are linearly dependent in at least one mode, and when collinear, are nonunique in the others. However, standard PARAFAC analysis of fallible data will have neither of these features. Instead, latent linear dependences become high surface correlations and any latent nonuniqueness is replaced by a meaningless surface-level 'unique orientation' that optimally fits the particular random noise in that sample. To avoid these problems, any set of components that in theory should be rank deficient are re-expressed in PARALIND as a product of two matrices, one that explicitly represents their dependency relationships and another, with fewer columns, that captures their patterns of variation. To demonstrate the approach, we apply it first to fluorescence spectroscopy (excitation-emission matrices, EEM) data in which concentration values for two analytes covary exactly, and then to flow injection analysis (FIA) data in which subsets of columns are logically constrained to sum to a constant, but differently in each of two modes. In the PARAFAC solutions of the EEM data, all factors are 'unique' but this is only meaningful for two of the factors that are also unique at the latent level. In contrast, the PARALIND solutions directly display the extent and nature of partial nonuniqueness present at the latent level by exhibiting a corresponding partial uniqueness in their recovered loadings. For the FIA data, PARALIND constraints restore latent uniqueness to the concentration estimates. Comparison of the solutions shows that PARALIND more accurately recovers latent structure, presumably because it uses fewer parameters and hence fits less error. Copyright (C) 2009 John Wiley & Sons, Ltd.
引用
收藏
页码:324 / 340
页数:17
相关论文
共 48 条
[1]   Calibration of spectral reaction data [J].
Amrhein, M ;
Srinivasan, B ;
Bonvin, D ;
Schumacher, MM .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1999, 46 (02) :249-264
[2]   On the rank deficiency and rank augmentation of the spectral measurement matrix [J].
Amrhein, M ;
Srinivasan, B ;
Bonvin, D ;
Schumacher, MM .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1996, 33 (01) :17-33
[3]  
AMRHEIN M, 1998, THESIS ECOLE POLYTEC
[4]   The N-way Toolbox for MATLAB [J].
Andersson, CA ;
Bro, R .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2000, 52 (01) :1-4
[5]  
[Anonymous], 1971, Generalized Inverses of Matrices and its Applications
[6]  
BRERETON RG, 1995, CHEMOM INTELL LAB SY, V27, P73
[7]   PARAFAC. Tutorial and applications [J].
Bro, R .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1997, 38 (02) :149-171
[8]  
Bro R, 1998, J CHEMOMETR, V12, P223, DOI 10.1002/(SICI)1099-128X(199807/08)12:4<223::AID-CEM511>3.3.CO
[9]  
2-U
[10]  
Bro R, 1997, J CHEMOMETR, V11, P393, DOI 10.1002/(SICI)1099-128X(199709/10)11:5<393::AID-CEM483>3.0.CO