Improved heterogeneous distance functions

被引:795
作者
Wilson, DR
Martinez, TR
机构
[1] Computer Science Department, Brigham Young University, Provo
关键词
D O I
10.1613/jair.346
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Instance-based learning techniques typically handle continuous and linear input values well, but often do not handle nominal input attributes appropriately. The Value Difference Metric (VDM) was designed to find reasonable distance values between nominal attribute values, but it largely ignores continuous attributes, requiring discretization to map continuous values into nominal values. This paper proposes three new heterogeneous distance functions, called the Heterogeneous Value Difference Metric (HVDM), the Interpolated Value Difference Metric (IVDM), and the Windowed Value Difference Metric (WVDM). These new distance functions are designed to handle applications with nominal attributes, continuous attributes, or both. In experiments on 48 applications the new distance metrics achieve higher classification accuracy on average than three previous distance functions on those datasets that have both nominal and continuous attributes.
引用
收藏
页码:1 / 34
页数:34
相关论文
共 60 条
[1]  
AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
[2]   TOLERATING NOISY, IRRELEVANT AND NOVEL ATTRIBUTES IN INSTANCE-BASED LEARNING ALGORITHMS [J].
AHA, DW .
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1992, 36 (02) :267-287
[3]  
[Anonymous], PATTERN RECOGNITION
[4]  
[Anonymous], AAAI 94 WORKSH PROGR
[5]  
ATKESON C, 1989, ADV NEURAL INFORMATI, V2
[6]  
ATKESON C, 1996, IN PRESS ARTIFICIAL
[7]  
BIBERMAN Y, 1994, P 9 EUR C MACH LEARN, P49
[8]  
Broomhead D. S., 1988, Complex Systems, V2, P321
[9]  
CAMERONJONES RM, 1995, P 8 AUSTR JOINT C AR, P99
[10]   A MASSIVELY PARALLEL ARCHITECTURE FOR A SELF-ORGANIZING NEURAL PATTERN-RECOGNITION MACHINE [J].
CARPENTER, GA ;
GROSSBERG, S .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1987, 37 (01) :54-115