TROP-ELM: A double-regularized ELM using LARS and Tikhonov regularization

被引:260
作者
Miche, Yoan [1 ,2 ]
van Heeswijk, Mark [1 ]
Bas, Patrick [2 ]
Simula, Olli [1 ]
Lendasse, Amaury [1 ]
机构
[1] Aalto Univ, Dept Informat & Comp Sci, Sch Sci & Technol, FI-00076 Aalto, Finland
[2] Gipsa Lab, F-38402 Grenoble, France
关键词
ELM; Regularization; Ridge regression; Tikhonov regularization; LARS; OP-ELM;
D O I
10.1016/j.neucom.2010.12.042
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
In this paper an improvement of the optimally pruned extreme learning machine (OP-ELM) in the form of a L-2 regularization penalty applied within the OP-ELM is proposed. The OP-ELM originally proposes a wrapper methodology around the extreme learning machine (ELM) meant to reduce the sensitivity of the ELM to irrelevant variables and obtain more parsimonious models thanks to neuron pruning. The proposed modification of the OP-ELM uses a cascade of two regularization penalties: first a L-1 penalty to rank the neurons of the hidden layer, followed by a L-2 penalty on the regression weights (regression between hidden layer and output layer) for numerical stability and efficient pruning of the neurons. The new methodology is tested against state of the art methods such as support vector machines or Gaussian processes and the original ELM and OP-ELM, on 11 different data sets; it systematically outperforms the OP-ELM (average of 27% better mean square error) and provides more reliable results in terms of standard deviation of the results - while remaining always less than one order of magnitude slower than the OP-ELM. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:2413 / 2421
页数:9
相关论文
共 35 条
[1]
RELATIONSHIP BETWEEN VARIABLE SELECTION AND DATA AUGMENTATION AND A METHOD FOR PREDICTION [J].
ALLEN, DM .
TECHNOMETRICS, 1974, 16 (01) :125-127
[2]
[Anonymous], 1963, Soviet Math
[3]
[Anonymous], 1976, 28 STANF U DIV BIOST
[4]
[Anonymous], 1994, Neural networks: a comprehensive foundation
[5]
MINIMAX ESTIMATION OF A MULTIVARIATE NORMAL MEAN UNDER ARBITRARY QUADRATIC LOSS [J].
BERGER, J .
JOURNAL OF MULTIVARIATE ANALYSIS, 1976, 6 (02) :256-264
[6]
Bishop CM., 1995, NEURAL NETWORKS PATT
[7]
LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]
Regularized Extreme Learning Machine [J].
Deng, Wanyu ;
Zheng, Qinghua ;
Chen, Lin .
2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, 2009, :389-395
[9]
Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[10]
*EIML GROUP, 2009, OP ELM TOOLB