Multiple factor analysis: principal component analysis for multitable and multiblock data sets

被引:336
作者
Abdi, Herve [1 ]
Williams, Lynne J. [2 ]
Valentin, Domininique [3 ]
机构
[1] Univ Texas Dallas, Dept Brain & Behav Sci, Richardson, TX 75080 USA
[2] Baycrest, Rotman Res Inst, Toronto, ON, Canada
[3] Univ Bourgogne, Dijon, France
关键词
multiple factor analysis (MFA); multiple factorial analysis; multiblock correspondence analysis; STATIS; INDSCAL; multiblock barycentric discriminant analysis (MUDICA); multiple factor analysis barycentric discriminant analysis (MUFABADA); barycentric discriminant analysis (BADA); generalized Procrustes analysis (GPA); generalized singular value decomposition; principal component analysis; consensus PCA; multitable PCA; multiblock PCA;
D O I
10.1002/wics.1246
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Multiple factor analysis (MFA, also called multiple factorial analysis) is an extension of principal component analysis (PCA) tailored to handle multiple data tables that measure sets of variables collected on the same observations, or, alternatively, (in dual-MFA) multiple data tables where the same variables are measured on different sets of observations. MFA proceeds in two steps: First it computes a PCA of each data table and 'normalizes' each data table by dividing all its elements by the first singular value obtained from its PCA. Second, all the normalized data tables are aggregated into a grand data table that is analyzed via a (non-normalized) PCA that gives a set of factor scores for the observations and loadings for the variables. In addition, MFA provides for each data table a set of partial factor scores for the observations that reflects the specific 'view-point' of this data table. Interestingly, the common factor scores could be obtained by replacing the original normalized data tables by the normalized factor scores obtained from the PCA of each of these tables. In this article, we present MFA, review recent extensions, and illustrate it with a detailed example. (C) 2013 Wiley Periodicals, Inc.
引用
收藏
页码:149 / 179
页数:31
相关论文
共 157 条
[21]   Principal component analysis [J].
Abdi, Herve ;
Williams, Lynne J. .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04) :433-459
[22]   Partial least squares regression and projection on latent structure regression (PLS Regression) [J].
Abdi, Herve .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (01) :97-106
[23]   How to compute reliability estimates and display confidence and tolerance intervals for pattern classifiers using the Bootstrap and 3-way multidimensional scaling (DISTATIS) [J].
Abdi, Herve ;
Dunlop, Joseph P. ;
Williams, Lynne J. .
NEUROIMAGE, 2009, 45 (01) :89-95
[24]  
Abdi Herve, 2007, ENCYCLO PEDIA MEASUR, V3, P103, DOI DOI 10.4135/9781412952644
[25]   Unsupervised Multiway Data Analysis: A Literature Survey [J].
Acar, Evrim ;
Yener, Buelent .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (01) :6-20
[26]  
[Anonymous], 2016, ANAL FACTORIELLES SI
[27]  
[Anonymous], 2007, SELECTED CONTRIBUTIO
[28]  
[Anonymous], [No title captured]
[29]   Testing fidelity to a new psychological intervention for family members of substance misusers during implementation in Italy [J].
Arcidiacono, C. ;
Sarnacchiaro, P. ;
Velleman, R. .
JOURNAL OF SUBSTANCE USE, 2008, 13 (06) :361-381
[30]   Comparison of two sensory profiling techniques based on consumer perception [J].
Ares, Gaston ;
Deliza, Rosires ;
Barreiro, Cecilia ;
Gimenez, Ana ;
Gambaro, Adriana .
FOOD QUALITY AND PREFERENCE, 2010, 21 (04) :417-426