Structural modelling with sparse kernels

被引:54
作者
Gunn, SR [1 ]
Kandola, JS [1 ]
机构
[1] Univ Southampton, Dept Elect & Comp Sci, ISIS Res Grp, Southampton SO9 5NH, Hants, England
关键词
Kernel methods; transparency; model interpretability; sparse structure; ANOVA;
D O I
10.1023/A:1013903804720
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A widely acknowledged drawback of many statistical modelling techniques, commonly used in machine learning, is that the resulting model is extremely difficult to interpret. A number of new concepts and algorithms have been introduced by researchers to address this problem. They focus primarily on determining which inputs are relevant in predicting the output. This work describes a transparent, advanced non-linear modelling approach that enables the constructed predictive models to be visualised, allowing model validation and assisting in interpretation. The technique combines the representational advantage of a sparse ANOVA decomposition, with the good generalisation ability of a kernel machine. It achieves this by employing two forms of regularisation: a 1-norm based structural regulariser to enforce transparency, and a 2-norm based regulariser to control smoothness. The resulting model structure can be visualised showing the overall effects of different inputs, their interactions, and the strength of the interactions. The robustness of the technique is illustrated using a range of both artifical and "real world" datasets. The performance is compared to other modelling techniques, and it is shown to exhibit competitive generalisation performance together with improved interpretability.
引用
收藏
页码:137 / 163
页数:27
相关论文
共 48 条
[1]  
[Anonymous], 1923, LECT CAUCHYS PROBLEM
[2]  
[Anonymous], 1961, Adaptive Control Processes: a Guided Tour, DOI DOI 10.1515/9781400874668
[3]  
[Anonymous], 1998, ISIS198
[4]  
[Anonymous], 1995, THESIS STANFORD U
[5]  
[Anonymous], 1989, Maximum Entropy and Bayesian Methods
[6]  
[Anonymous], GRAPHICAL MODELS
[7]   THEORY OF REPRODUCING KERNELS [J].
ARONSZAJN, N .
TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY, 1950, 68 (MAY) :337-404
[8]  
Bishop C. M., 1995, NEURAL NETWORKS PATT
[9]  
Blake C.L., 1998, UCI repository of machine learning databases
[10]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946