Improving the interpretation of multivariate and rule induction models by using a peak parameter representation

被引:16
作者
Alsberg, BK
Winson, MK
Kell, DB
机构
[1] Institute of Biological Sciences, University of Wales, Aberystwyth
关键词
multivariate calibration; partial least squares; rule induction; infrared spectra; Gaussian and Lorentzian parameters; non-linear curve fitting; functional representation;
D O I
10.1016/S0169-7439(97)00024-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper demonstrates that the interpretation of multivariate calibration and rule induction classification models can be significantly improved by adopting a new representation of data profiles (e.g., spectra and chromatograms) containing identifiable peaks. The new representation is based on estimating Gaussian or Lorentzian curve parameters of data profiles by non-linear curve fitting. All modelling is performed on these peak parameters rather than using the traditional approach where each variable is assigned a sampling point in the data profile. Loading weight plots from the multivariate methods and decision trees obtained from rule induction algorithms become more parsimonious and easier to interpret in terms of the new representation.
引用
收藏
页码:95 / 109
页数:15
相关论文
共 70 条
  • [1] IMAGE COMPRESSION BY VECTOR QUANTIZATION - A REVIEW FOCUSED ON CODEBOOK GENERATION
    AKROUT, N
    PROST, R
    GOUTTE, R
    [J]. IMAGE AND VISION COMPUTING, 1994, 12 (10) : 627 - 637
  • [2] SPEED IMPROVEMENT OF MULTIVARIATE ALGORITHMS BY THE METHOD OF POSTPONED BASIS MATRIX MULTIPLICATION .2. 3-MODE PRINCIPAL COMPONENT ANALYSIS
    ALSBERG, BK
    KVALHEIM, OM
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1994, 24 (01) : 43 - 54
  • [3] REPRESENTATION OF SPECTRA BY CONTINUOUS-FUNCTIONS
    ALSBERG, BK
    [J]. JOURNAL OF CHEMOMETRICS, 1993, 7 (03) : 177 - 193
  • [4] COMPRESSION OF NTH-ORDER DATA ARRAYS BY B-SPLINES .1. THEORY
    ALSBERG, BK
    KVALHEIM, OM
    [J]. JOURNAL OF CHEMOMETRICS, 1993, 7 (01) : 61 - 73
  • [5] Speed improvement of multivariate algorithms by the method of postponed basis matrix multiplication. Part I. Principal component analysis
    Alsberg, Bjørn K.
    Kvalheim, Olav M.
    [J]. Chemometrics and Intelligent Laboratory Systems, 1994, 24 (01) : 31 - 42
  • [6] COMPRESSION OF NTH-ORDER DATA ARRAYS BY B-SPLINES .2. APPLICATION TO 2ND-ORDER FT-IR SPECTRA
    ALSBERG, BK
    NODLAND, E
    KVALHEIM, OM
    [J]. JOURNAL OF CHEMOMETRICS, 1994, 8 (02) : 127 - 145
  • [7] Survey and critique of techniques for extracting rules from trained artificial neural networks
    Andrews, R
    Diederich, J
    Tickle, AB
    [J]. KNOWLEDGE-BASED SYSTEMS, 1995, 8 (06) : 373 - 389
  • [8] Advances in residual vector quantization: A review
    Barnes, CF
    Rizvi, SA
    Nasrabadi, NM
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 1996, 5 (02) : 226 - 262
  • [9] Bishop C. M., 1995, Neural networks for pattern recognition
  • [10] SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
    Blewitt, Marnie E.
    Gendrel, Anne-Valerie
    Pang, Zhenyi
    Sparrow, Duncan B.
    Whitelaw, Nadia
    Craig, Jeffrey M.
    Apedaile, Anwyn
    Hilton, Douglas J.
    Dunwoodie, Sally L.
    Brockdorff, Neil
    Kay, Graham F.
    Whitelaw, Emma
    [J]. NATURE GENETICS, 2008, 40 (05) : 663 - 669