Convexity and logical analysis of data

被引:8
作者
Ekin, O
Hammer, PL
Kogan, A
机构
[1] Rutgers State Univ, RUTCOR, New Brunswick, NJ 08903 USA
[2] Bilkent Univ, Dept Ind Engn, Ankara, Turkey
[3] Rutgers State Univ, Fac Management, Newark, NJ 07102 USA
基金
美国国家科学基金会;
关键词
partially-defined Boolean functions; orthogonal disjunctive normal forms; computational learning theory; classification; polynomial algorithms;
D O I
10.1016/S0304-3975(98)00337-5
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A Boolean function is called k-convex if for any pair x,y of its true points at Hamming distance at most k, every point "between" x and y is also true. Given a set of true points and a set of false points, the central question of Logical Analysis of Data is the study of those Boolean functions whose values agree with those of the given points. In this paper we examine data sets which admit k-convex Boolean extensions. We provide polynomial algorithms for finding a k-convex extension, if any, and for finding the maximum k for which a k-convex extension exists. We study the problem of uniqueness, and provide a polynomial algorithm for checking whether all k-convex extensions agree in a point outside the given data set. We estimate the number of k-convex Boolean functions, and show that for small k this number is doubly exponential. On the other hand, we also show that for large k the class of k-convex Boolean functions is PAC-learnable. (C) 2000 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:95 / 116
页数:22
相关论文
共 15 条
[1]  
ANTHONY M., 1997, Computational learning theory, V30
[2]  
Ball M. O., 1979, Mathematics of Operations Research, V4, P132, DOI 10.1287/moor.4.2.132
[3]   On the applications of multiplicity automata in learning [J].
Beimel, A ;
Bergadano, F ;
Bshouty, NH ;
Kushilevitz, E .
37TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 1996, :349-358
[4]  
Blake A., 1937, Ph.D. dissertation
[5]   LEARNABILITY AND THE VAPNIK-CHERVONENKIS DIMENSION [J].
BLUMER, A ;
EHRENFEUCHT, A ;
HAUSSLER, D ;
WARMUTH, MK .
JOURNAL OF THE ACM, 1989, 36 (04) :929-965
[6]  
BOROS E, IN PRESS IEEE T KNOW
[7]   A MEASURE OF ASYMPTOTIC EFFICIENCY FOR TESTS OF A HYPOTHESIS BASED ON THE SUM OF OBSERVATIONS [J].
CHERNOFF, H .
ANNALS OF MATHEMATICAL STATISTICS, 1952, 23 (04) :493-507
[8]  
Colbourn C.J., 1987, The combinatorics of network reliability
[9]  
Crama Y., 1988, Annals of Operations Research, V16, P299, DOI 10.1007/BF02283750
[10]   A GENERAL LOWER BOUND ON THE NUMBER OF EXAMPLES NEEDED FOR LEARNING [J].
EHRENFEUCHT, A ;
HAUSSLER, D ;
KEARNS, M ;
VALIANT, L .
INFORMATION AND COMPUTATION, 1989, 82 (03) :247-261