A feature selection technique for classificatory analysis

被引:79
作者
Ahmad, A
Dey, L
机构
[1] Indian Inst Technol, Dept Math, New Delhi 110016, India
[2] Solid State Phys Lab, Delhi 110054, India
关键词
feature selection; significance of attributes; classificatory knowledge extraction;
D O I
10.1016/j.patrec.2004.08.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Patterns summarizing mutual associations between class decisions and attribute values in a pre-classified database, provide insight into the significance of attributes and also useful classificatory knowledge. In this paper we have proposed a conditional probability based, efficient method to extract the significant attributes from a database. Reducing the feature set during pre-processing enhances the quality of knowledge extracted and also increases the speed of computation. Our method supports easy visualization of classificatory knowledge. A likelihood-based classification algorithm that uses this classificatory knowledge is also proposed. We have also shown how the classification methodology can be used for cost-sensitive learning where both accuracy and precision of prediction are important. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:43 / 56
页数:14
相关论文
共 25 条
[1]   Selection of relevant features and examples in machine learning [J].
Blum, AL ;
Langley, P .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :245-271
[2]  
Breiman L., 1998, CLASSIFICATION REGRE
[3]   Genetic feature selection in a fuzzy rule-based classification system learning process for high-dimensional problems [J].
Casillas, J ;
Cordón, O ;
Del Jesus, MJ ;
Herrera, F .
INFORMATION SCIENCES, 2001, 136 (1-4) :135-157
[4]   CLASS-DEPENDENT DISCRETIZATION FOR INDUCTIVE LEARNING FROM CONTINUOUS AND MIXED-MODE DATA [J].
CHING, JY ;
WONG, AKC ;
CHAN, KCC .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (07) :641-651
[5]   A WEIGHTED NEAREST NEIGHBOR ALGORITHM FOR LEARNING WITH SYMBOLIC FEATURES [J].
COST, S ;
SALZBERG, S .
MACHINE LEARNING, 1993, 10 (01) :57-78
[6]  
Dash M., 1997, Intelligent Data Analysis, V1
[7]  
Domingos P., 2009, P 5 INT C KNOWL DISC, P155
[8]  
Dougherty J., 1995, P 12 INT C MACHINE L
[9]  
Duda R. O., 2000, PATTERN CLASSIFICATI
[10]   Design of an optimal nearest neighbor classifier using an intelligent genetic algorithm [J].
Ho, SY ;
Liu, CC ;
Liu, S .
PATTERN RECOGNITION LETTERS, 2002, 23 (13) :1495-1503