Genetic programming with a genetic algorithm for feature construction and selection

被引:93
作者
Smith M.G. [1 ]
Bull L. [1 ]
机构
[1] Faculty of Computing, Engineering and Mathematical Sciences, University of the West of England, Bristol
关键词
Classification; Feature construction; Feature selection; Genetic algorithm; Genetic programming; Machine learning;
D O I
10.1007/s10710-005-2988-7
中图分类号
学科分类号
摘要
The use of machine learning techniques to automatically analyse data for information is becoming increasingly widespread. In this paper we primarily examine the use of Genetic Programming and a Genetic Algorithm to pre-process data before it is classified using the C4.5 decision tree learning algorithm. Genetic Programming is used to construct new features from those available in the data, a potentially significant process for data mining since it gives consideration to hidden relationships between features. A Genetic Algorithm is used to determine which such features are the most predictive. Using ten well-known datasets we show that our approach, in comparison to C4.5 alone, provides marked improvement in a number of cases. We then examine its use with other well-known machine learning techniques. © 2005 Springer Science + Business Media, Inc.
引用
收藏
页码:265 / 281
页数:16
相关论文
共 24 条
  • [1] Aha D., Kibler D., Instance-based learning algorithms, Machine Learning, 6, pp. 37-66, (1991)
  • [2] Ahluwalia M., Bull L., Co-evolving functions in genetic programming: Classification using k-nearest neighbour, GECCO-99: Proceedings of the Genetic and Evolutionary Computation Conference, pp. 947-952, (1999)
  • [3] Amit Y., Geman D., Shape quantization and recognition with randomized trees, Neural Computation, 9, 7, pp. 1545-1588, (1996)
  • [4] Banzhaf W., Nordin P., Keller R.E., Francone F.D., Genetic Programming - An Introduction on the Automatic Evolution of Computer Programs and its Applications, (1998)
  • [5] Breiman L., Bagging predictors, Machine Learning, 24, 2, pp. 123-140, (1996)
  • [6] Dagher I., Georgiopoulos M., Heileman G.L., Bebis G., An ordering algorithm for pattern presentation in fuzzy ARTMAP that tends to improve generalization performance, IEEE Transactions on Neural Networks, 10, 4, pp. 768-778, (1999)
  • [7] Dixon P., Corne D., Oates M., A preliminary investigation of modified XCS as a generic data mining Tool, Advances in Learning Classifier Systems, pp. 133-151, (2001)
  • [8] Ekart A., Markus A., Using genetic programming and decision trees for generating structural descriptions of four bar mechanisms, Artificial Intelligence for Engineering Design, Analysis and Manufacturing, 17, 3, (2003)
  • [9] Guyon I., Elisseeff A., An introduction to variable and feature selection, Journal of Machine Learning Research, 3, pp. 1157-1182, (2003)
  • [10] Holland J., Adaptation in Natural and Artificial Systems, (1975)