Potential applications of functional data analysis in chemometrics

被引:45
作者
Saeys, Wouter [1 ]
De Ketelaere, Bart [1 ,2 ]
Darius, Paul [1 ,2 ]
机构
[1] Katholieke Univ Leuven, BIOSYST MeBio S, Dept Biosyst, Div Mech Biostat & Sensors, B-3001 Heverlee, Belgium
[2] Leuven Stat Res Ctr LStat, B-3001 Heverlee, Belgium
关键词
functional data analysis; B-splines; PLS; functional regression; FANOVA;
D O I
10.1002/cem.1129
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In spectroscopy the measured spectra are typically plotted as a function of the wavelength (or wavenumber), but analysed with multivariate data analysis techniques (multiple linear regression (MLR), principal components regression (PCR), partial least squares (PLS)) which consider the spectrum as a set of m different variables. From a physical point of view it could be more informative to describe the spectrum as a function rather than as a set of points, hereby taking into account the physical background of the spectrum, being a sum of absorption peaks for the different chemical components, where the absorbance at two wavelengths close to each other is highly correlated. In a first part of this contribution, a motivating example for this functional approach is given. In a second part, the potential of functional data analysis is discussed in the field of chemometrics and compared to the ubiquitous PLS regression technique using two practical data sets. it is shown that for spectral data, the use of B-splines proves to be an appealing basis to accurately describe the data. By applying both functional data analysis and PLS on the data sets the predictive ability of functional data analysis is found to be comparable to that of PLS. Moreover, many chemometric datasets have some specific structure (e.g. replicate measurements, on the same object or objects that are grouped), but the structure is often removed before analysis (e.g. by averaging the replicates). In the second part of this contribution, we suggest a method to adapt traditional analysis of variance (ANOVA) methods to datasets with spectroscopic data. In particular, the possibilities to explore and interpret sources of variation, such as variations in sample and ambient temperature, are examined. Copyright (C) 2008 John Wiley & Sons, Ltd.
引用
收藏
页码:335 / 344
页数:10
相关论文
共 27 条
[1]   REPRESENTATION OF SPECTRA BY CONTINUOUS-FUNCTIONS [J].
ALSBERG, BK .
JOURNAL OF CHEMOMETRICS, 1993, 7 (03) :177-193
[2]   An introduction to wavelet transforms for chemometricians: A time-frequency approach [J].
Alsberg, BK ;
Woodward, AM ;
Kell, DB .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1997, 37 (02) :215-239
[3]  
[Anonymous], 1989, MULTIVARIATE CALIBRA
[4]  
Beer A., 1852, Ann. der Phys. und Chemie, V86, P78, DOI [DOI 10.1002/ANDP.18521620505, 10.1002/andp.18521620505]
[5]  
BURNS DA, 1992, HDB NEAR INFRARED AN, P681
[6]   An anova test for functional data [J].
Cuevas, A ;
Febrero, M ;
Fraiman, R .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 47 (01) :111-122
[7]   Flexible smoothing with B-splines and penalties [J].
Eilers, PHC ;
Marx, BD .
STATISTICAL SCIENCE, 1996, 11 (02) :89-102
[8]   A STATISTICAL VIEW OF SOME CHEMOMETRICS REGRESSION TOOLS [J].
FRANK, IE ;
FRIEDMAN, JH .
TECHNOMETRICS, 1993, 35 (02) :109-135
[9]  
GABRIEL KR, 1971, BIOMETRIKA, V58, P453, DOI 10.2307/2334381
[10]   Multivariate curve resolution of wavelet and Fourier compressed spectra [J].
Harrington, PD ;
Rauch, PJ ;
Cai, CS .
ANALYTICAL CHEMISTRY, 2001, 73 (14) :3247-3256