Structural modelling with sparse kernels

被引：54

作者：

Gunn, SR ^{[1
]}

Kandola, JS ^{[1
]}

机构：

[1] Univ Southampton, Dept Elect & Comp Sci, ISIS Res Grp, Southampton SO9 5NH, Hants, England

来源：

MACHINE LEARNING | 2002年 / 48卷 / 1-3期

关键词：

Kernel methods; transparency; model interpretability; sparse structure; ANOVA;

D O I：

10.1023/A:1013903804720

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A widely acknowledged drawback of many statistical modelling techniques, commonly used in machine learning, is that the resulting model is extremely difficult to interpret. A number of new concepts and algorithms have been introduced by researchers to address this problem. They focus primarily on determining which inputs are relevant in predicting the output. This work describes a transparent, advanced non-linear modelling approach that enables the constructed predictive models to be visualised, allowing model validation and assisting in interpretation. The technique combines the representational advantage of a sparse ANOVA decomposition, with the good generalisation ability of a kernel machine. It achieves this by employing two forms of regularisation: a 1-norm based structural regulariser to enforce transparency, and a 2-norm based regulariser to control smoothness. The resulting model structure can be visualised showing the overall effects of different inputs, their interactions, and the strength of the interactions. The robustness of the technique is illustrated using a range of both artifical and "real world" datasets. The performance is compared to other modelling techniques, and it is shown to exhibit competitive generalisation performance together with improved interpretability.

引用

页码：137 / 163

页数：27

共 48 条

[1]

[Anonymous], 1923, LECT CAUCHYS PROBLEM

[2]

[Anonymous], 1961, Adaptive Control Processes: a Guided Tour, DOI DOI 10.1515/9781400874668

[3]

[Anonymous], 1998, ISIS198

[4]

[Anonymous], 1995, THESIS STANFORD U

[5]

[Anonymous], 1989, Maximum Entropy and Bayesian Methods

[6]

[Anonymous], GRAPHICAL MODELS

[7] THEORY OF REPRODUCING KERNELS [J].

ARONSZAJN, N .

TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY, 1950, 68 (MAY) :337-404

[8]

Bishop C. M., 1995, NEURAL NETWORKS PATT

[9]

Blake C.L., 1998, UCI repository of machine learning databases

[10]

Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946

← 1 2 3 4 5 →