Variable selection in discriminant partial least-squares analysis

被引:99
作者
Alsberg, BK [1 ]
Kell, DB [1 ]
Goodacre, R [1 ]
机构
[1] Univ Coll Aberystwyth, Inst Biol Sci, Aberystwyth SY23 3DD, Dyfed, Wales
关键词
D O I
10.1021/ac980506o
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Variable selection enhances the understanding and interpretability of multivariate classification models. A new chemometric method based on the selection of the most important variables in discriminant partial least-squares (VS-DPLS) analysis is described. The suggested method is a simple extension of DPLS where a small number of elements in the weight vector w is retained for each factor, The optimal number of DPLS factors is determined by cross-validation. The new algorithm is applied to four different high-dimensional spectral data sets with excellent results. Spectral profiles from Fourier transform infrared spectroscopy and pyrolysis mass spectrometry are used. To investigate the uniqueness of the selected variables an iterative VS-DPLS procedure is performed, At each iteration, the previously found selected variables are removed to see if a new VS-DPLS classification model can be constructed using a different set of variables. In this manner, it is possible to determine regions rather than individual variables that are important for a successful classification.
引用
收藏
页码:4126 / 4133
页数:8
相关论文
共 57 条
  • [1] Wavelet denoising of infrared spectra
    Alsberg, BK
    Woodward, AM
    Winson, MK
    Rowland, J
    Kell, DB
    [J]. ANALYST, 1997, 122 (07) : 645 - 652
  • [2] Classification of pyrolysis mass spectra by fuzzy multivariate rule induction-comparison with regression, K-nearest neighbour, neural and decision-tree methods
    Alsberg, BK
    Goodacre, R
    Rowland, JJ
    Kell, DB
    [J]. ANALYTICA CHIMICA ACTA, 1997, 348 (1-3) : 389 - 407
  • [3] Improving the interpretation of multivariate and rule induction models by using a peak parameter representation
    Alsberg, BK
    Winson, MK
    Kell, DB
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1997, 36 (02) : 95 - 109
  • [4] COMPRESSION OF NTH-ORDER DATA ARRAYS BY B-SPLINES .1. THEORY
    ALSBERG, BK
    KVALHEIM, OM
    [J]. JOURNAL OF CHEMOMETRICS, 1993, 7 (01) : 61 - 73
  • [5] Variable selection in wavelet regression models
    Alsberg, BK
    Woodward, AM
    Winson, MK
    Rowland, JJ
    Kell, DB
    [J]. ANALYTICA CHIMICA ACTA, 1998, 368 (1-2) : 29 - 44
  • [6] COMPRESSION OF NTH-ORDER DATA ARRAYS BY B-SPLINES .2. APPLICATION TO 2ND-ORDER FT-IR SPECTRA
    ALSBERG, BK
    NODLAND, E
    KVALHEIM, OM
    [J]. JOURNAL OF CHEMOMETRICS, 1994, 8 (02) : 127 - 145
  • [7] An introduction to wavelet transforms for chemometricians: A time-frequency approach
    Alsberg, BK
    Woodward, AM
    Kell, DB
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1997, 37 (02) : 215 - 239
  • [8] [Anonymous], 1989, MULTIVARIATE CALIBRA
  • [9] [Anonymous], HDB NEW BACTERIAL SY
  • [10] BEARDAH CC, 1996, ANALECTA PREHISTORIA, V28