Toward integrating feature selection algorithms for classification and clustering

被引:1789
作者
Liu, H [1 ]
Yu, L [1 ]
机构
[1] Arizona State Univ, Dept Comp Sci & Engn, Tempe, AZ 85287 USA
基金
美国国家科学基金会;
关键词
feature selection; classification; clustering; categorizing framework; unifying platform; real-world applications;
D O I
10.1109/TKDE.2005.66
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces concepts and algorithms of feature selection, surveys existing feature selection algorithms for classification and clustering, groups and compares different algorithms with a categorizing framework based on search strategies, evaluation criteria, and data mining tasks, reveals unattempted combinations, and provides guidelines in selecting feature selection algorithms. With the categorizing framework, we continue our efforts toward building an integrated system for intelligent feature selection. A unifying platform is proposed as an intermediate step. An illustrative example is presented to show how existing feature selection algorithms can be integrated into a meta algorithm that can take advantage of individual algorithms. An added advantage of doing so is to help a user employ a suitable algorithm without knowing details of each algorithm. Some real-world applications are included to demonstrate the use of feature selection in data mining. We conclude this work by identifying trends and challenges of feature selection research and development.
引用
收藏
页码:491 / 502
页数:12
相关论文
共 96 条
  • [1] DATABASE MINING - A PERFORMANCE PERSPECTIVE
    AGRAWAL, R
    IMIELINSKI, T
    SWAMI, A
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1993, 5 (06) : 914 - 925
  • [2] LEARNING BOOLEAN CONCEPTS IN THE PRESENCE OF MANY IRRELEVANT FEATURES
    ALMUALLIM, H
    DIETTERICH, TG
    [J]. ARTIFICIAL INTELLIGENCE, 1994, 69 (1-2) : 279 - 305
  • [3] ALMUALLIM H, 1991, P 9 NAT C ART INT AA, V2, P547
  • [4] [Anonymous], 1994, FEATURE SELECTION ME
  • [5] [Anonymous], 2001, FEATURE EXTRACTION C
  • [6] [Anonymous], FEATURE EXTRACTION C
  • [7] [Anonymous], 2002, P 19 INT C MACH LEAR
  • [8] [Anonymous], [No title captured], DOI DOI 10.1145/347090.347169
  • [9] [Anonymous], [No title captured]
  • [10] [Anonymous], 1990, P 10 INT C PATT REC, DOI DOI 10.1109/ICPR.1990.118160