On the difficulty of approximately maximizing agreements

被引：64

作者：

Ben-David, S

Eiron, N

Long, PM

机构：

[1] Genome Inst Singapore, Singapore 117604, Singapore

[2] Technion Israel Inst Technol, Dept Comp Sci, IL-32000 Haifa, Israel

[3] IBM Corp, Almaden Res Ctr, San Jose, CA 95120 USA

来源：

JOURNAL OF COMPUTER AND SYSTEM SCIENCES | 2003年 / 66卷 / 03期

关键词：

machine learning; computational learning theory; neural networks; inapproximability; hardness; half-spaces; axis-aligned hyper rectangles; balls; monomials;

D O I：

10.1016/S0022-0000(03)00038-2

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

We address the computational complexity of learning in the agnostic framework. For a variety of common concept classes we prove that, unless P = NP, there is no polynomial time approximation scheme for finding a member in the class that approximately maximizes the agreement with a given training sample. In particular our results apply to the classes of monomials, axis-aligned hyper-rectangles, closed balls and monotone monomials. For each of these classes, we prove the NP-hardness of approximating maximal agreement to within some fixed constant (independent of the sample size and of the dimensionality of the sample space). For the class of half-spaces, we prove that, for any epsilon > 0, it is NP-hard to approximately maximize agreements to within a factor of (418/415 - epsilon), improving on the best previously known constant for this problem, and using a simpler proof. An interesting feature of our proofs is that, for each of the classes we discuss, we find patterns of training examples that, while being hard for approximating agreement within that concept class, allow efficient agreement maximization within other concept classes. These results bring up a new aspect of the model selection problem-they imply that the choice of hypothesis class for agnostic learning from among those considered in this paper can drastically effect the computational complexity of the learning process. (C) 2003 Elsevier Science (USA). All rights reserved.

引用

页码：496 / 514

页数：19

共 24 条

[1] THE COMPLEXITY AND APPROXIMABILITY OF FINDING MAXIMUM FEASIBLE SUBSYSTEMS OF LINEAR RELATIONS [J].