NANO: A New Supervised Algorithm for Feature Selection with Discretization

被引:3
作者
Senthilkumar, J. [1 ]
Manjula, D. [1 ]
Krishnamoorthy, R. [2 ]
机构
[1] Anna Univ, Dept Comp Sci & Engn, Chennai 600025, Tamil Nadu, India
[2] Anna Univ, Dept Informat Technol, Chennai 600025, Tamil Nadu, India
来源
2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3 | 2009年
关键词
Discretization; feature selection; pattern classification;
D O I
10.1109/IADCC.2009.4809243
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Discretization turns numeric attributes into discrete ones. Feature selection eliminates some irrelevant and/or redundant attributes. Data discretization and feature selection are two important tasks that performed prior to the learning phase of data mining algorithms and significantly reduces the processing effort of the learning algorithm. In this paper, we present a new algorithm, called Nano, that can perform simultaneously data discretization and feature selection. In feature selection process irrelevant and redundant attributes as a measure of inconsistence are eliminated to determine the final number of intervals and to select features. The proposed Nano algorithm aims at keeping the minimal number of intervals with minimal inconsistency and establishes a tradeoff between these measures. The empirical results demonstrate that the proposed Nano algorithm is effective in feature selection and discretization of numeric and ordinal attributes.
引用
收藏
页码:1515 / +
页数:3
相关论文
共 20 条
  • [1] [Anonymous], P EUR WORK SESS LEAR
  • [2] [Anonymous], P 10 NAT C ART INT S
  • [3] [Anonymous], P 10 NAT C ART INT
  • [4] Antonie Maria-Luiza, 2001, Proceedings of the 2nd International Workshop on Multimedia Data Mining, P94
  • [5] ASUNCION A, 2007, UCI REPOSITORY
  • [6] Selection of relevant features and examples in machine learning
    Blum, AL
    Langley, P
    [J]. ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) : 245 - 271
  • [7] CARDIE C, 1993, P 10 INT C MACH LEAR, P25
  • [8] Feature selection for clustering - A filter solution
    Dash, M
    Choi, K
    Scheuermann, P
    Liu, H
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 115 - 122
  • [9] Dash Manoranjan, 1997, INTELL DATA ANAL, V38, P131, DOI DOI 10.1016/S1088-467X(97)00008-5
  • [10] FEATURE-SELECTION FOR AUTOMATIC CLASSIFICATION OF NON-GAUSSIAN DATA
    FOROUTAN, I
    SKLANSKY, J
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1987, 17 (02): : 187 - 198