Building Predictive Models via Feature Synthesis

被引:67
作者
Arnaldo, Ignacio [1 ]
O'Reilly, Una-May [1 ]
Veeramachaneni, Kalyan [1 ]
机构
[1] MIT, CSAIL, Cambridge, MA 02139 USA
来源
GECCO'15: PROCEEDINGS OF THE 2015 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE | 2015年
关键词
Regression; Feature Synthesis; Feature Subset Selection;
D O I
10.1145/2739480.2754693
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce Evolutionary Feature Synthesis' (EFS), a regression method that generates readable, nonlinear models of small to medium size datasets in seconds. EFS is, to the best of our knowledge, the fastest regression tool based on evolutionary computation reported to date. The feature search involved in the proposed method is composed of two main steps: feature composition and feature subset selection. EFS adopts a bottom-up feature composition strategy that eliminates the need for a symbolic representation of the features and exploits the variable selection process involved in pathwise regularized linear regression to perform the feature subset selection step. The result is a regression method that is competitive against neural networks, and outperforms both linear methods and Multiple Regression Genetic Programming, up to now the best regression tool based on evolutionary computation.
引用
收藏
页码:983 / 990
页数:8
相关论文
共 20 条
[1]  
[Anonymous], 2014, Multiple regression in SPSS | An introduction. Multiple regression in SPSS
[2]  
[Anonymous], NEUR NETW TOOLB
[3]   Multiple Regression Genetic Programming [J].
Arnaldo, Ignacio ;
Krawiec, Krzysztof ;
O'Reilly, Una-May .
GECCO'14: PROCEEDINGS OF THE 2014 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2014, :879-886
[4]   Kaizen Programming [J].
de Melo, Vinicius Veloso .
GECCO'14: PROCEEDINGS OF THE 2014 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2014, :895-902
[5]   Regularization Paths for Generalized Linear Models via Coordinate Descent [J].
Friedman, Jerome ;
Hastie, Trevor ;
Tibshirani, Rob .
JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01) :1-22
[6]  
Ganjisaffar Y., 2014, LASSO4J
[7]   Simultaneous Generation of Prototypes and Features Through Genetic Programming [J].
Garcia-Limon, Mauricio ;
Jair Escalante, Hugo ;
Morales, Eduardo ;
Morales-Reyes, Alicia .
GECCO'14: PROCEEDINGS OF THE 2014 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2014, :517-524
[8]  
Gathercole C, 1994, LECT NOTES COMPUT SC, V866, P312
[9]  
Guyon I., 2003, Journal of Machine Learning Research, V3, P1157, DOI 10.1162/153244303322753616
[10]  
Hastie T, 1996, J ROY STAT SOC B, V58, P155