ASPECTS OF PSEUDORANK ESTIMATION METHODS BASED ON THE EIGENVALUES OF PRINCIPAL COMPONENT ANALYSIS OF RANDOM MATRICES

被引:41
作者
FABER, NM
BUYDENS, LMC
KATEMAN, G
机构
[1] Department of Analytical Chemistry, University of Nijmegen, 6525 ED Nijmegen
关键词
D O I
10.1016/0169-7439(94)85043-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, analytical instruments that produce a data matrix for one chemical sample enjoy a widespread popularity. However, for a successful analysis of these data an accurate estimate of the pseudorank of the matrix is often a crucial prerequisite. A large number of methods for estimating the pseudorank are based on the eigenvalues obtained from principal component analysis (PCA). In this paper methods are discussed that exploit the essential similarity between the residuals of PCA of the test data matrix and the elements of a random matrix. In the literature of PCA these methods are commonly denoted as parallel analysis. Attention is paid to several aspects that have to be considered when applying such methods. For some of these aspects asymptotic results can be found in the statistical literature. In this study Monte Carlo simulations are used to investigate the practical implications of these theoretical results. It is shown that for sufficiently large matrices the distribution of the measurement error does not significantly influence the results. Down to a very small signal-to-noise ratio the ratio of the number of rows and the number of columns constitutes the major influence on the expected value of the eigenvalues associated with the residuals. The consequences are illustrated for two functions of the eigenvalues, i.e. the logarithm of the eigenvalues and Malinowski's reduced eigenvalues. Both methods are graphical and have been applied in the past with considerable success for a variety of data. Malinowski's reduced eigenvalues are of special interest since they have been used to construct an F-test. Finally, a modification is proposed for pseudorank estimation methods that are based on the principle of parallel analysis.
引用
收藏
页码:203 / 226
页数:24
相关论文
共 41 条
[1]   MULTIVARIATE SELECTION OF VARIABLES IN INDUSTRIAL QUALITY-CONTROL - OPTIMIZING AVIATION FUEL FINAL CONTROL [J].
ANDRADE, JM ;
PRADA, D ;
MUNIATEGUI, S ;
GOMEZ, B ;
PAN, M .
JOURNAL OF CHEMOMETRICS, 1993, 7 (05) :427-438
[2]  
[Anonymous], 2005, USERS GUIDE PRINCIPA
[3]  
[Anonymous], 1977, CHEMOMETRICS THEORY, DOI DOI 10.1021/BK-1977-0052.CH012
[4]  
[Anonymous], STATISTICIAN
[5]   IMPROVING RELIABILITY OF FACTOR-ANALYSIS OF CHEMICAL DATA BY UTILIZING MEASURED ANALYTICAL UNCERTAINTY [J].
DUEWER, DL ;
KOWALSKI, BR ;
FASCHING, JL .
ANALYTICAL CHEMISTRY, 1976, 48 (13) :2002-2010
[6]   CROSS-VALIDATORY CHOICE OF THE NUMBER OF COMPONENTS FROM A PRINCIPAL COMPONENT ANALYSIS [J].
EASTMENT, HT ;
KRZANOWSKI, WJ .
TECHNOMETRICS, 1982, 24 (01) :73-77
[8]   GENERALIZED RANK ANNIHILATION METHOD .2. BIAS AND VARIANCE IN THE ESTIMATED EIGENVALUES [J].
FABER, NM ;
BUYDENS, LMC ;
KATEMAN, G .
JOURNAL OF CHEMOMETRICS, 1994, 8 (03) :181-203
[9]   STANDARD ERRORS IN THE EIGENVALUES OF A CROSS-PRODUCT MATRIX - THEORY AND APPLICATIONS [J].
FABER, NM ;
BUYDENS, LMC ;
KATEMAN, G .
JOURNAL OF CHEMOMETRICS, 1993, 7 (06) :495-526
[10]  
FABER NM, IN PRESS ANAL CHIMIC