On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems

被引:500
作者
Amaldi, E [1 ]
Kann, V
机构
[1] Cornell Univ, Sch Operat Res, Ithaca, NY 14853 USA
[2] Cornell Univ, Ctr Math Appl, Ithaca, NY 14853 USA
[3] Royal Inst Technol, Dept Numer Anal & Comp Sci, S-10044 Stockholm, Sweden
关键词
linear systems; unsatisfied relations; nonzero variables; approximability bounds; designing linear classifiers;
D O I
10.1016/S0304-3975(97)00115-1
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We investigate the computational complexity of two closely related classes of combinatorial optimization problems for linear systems which arise in various fields such as machine learning, operations research and pattern recognition. In the first class (MIN ULR) one wishes, given a possibly infeasible system of linear relations, to find a solution that violates as few relations as possible while satisfying all the others. In the second class (MIN RVLS) the linear system is supposed to be feasible and one looks for a solution with as few nonzero variables as possible. For both MIN ULR and MIN RVLS the four basic types of relational operators =, greater than or equal to, > and not equal are considered. While MIN RVLS with equations was mentioned to be NP-hard in (Garey and Johnson, 1979), we established in (Amaldi; 1992; Amaldi and Kann, 1995) that MIN ULR with equalities and inequalities are NP-hard even when restricted to homogeneous systems with bipolar coefficients. The latter problems have been shown hard to approximate in (Arora et al., 1993). In this paper we determine strong bounds on the approximability of various variants of MIN RVLS and MIN ULR, including constrained ones where the variables are restricted to take binary values or where some relations are mandatory while others are optional. The various NP-hard versions turn out to have different approximability properties depending on the type of relations and the additional constraints, but none of them can be approximated within any constant factor, unless P = NP. Particular attention is devoted to two interesting special cases that occur in discriminant analysis and machine learning. In particular, we disprove a conjecture of van Horn and Martinet (1992) regarding the existence of a polynomial-time algorithm to design linear classifiers (or perceptrons) that involve a close-to-minimum number of features. (C) 1998 Published by Elsevier Science B.V. All rights reserved.
引用
收藏
页码:237 / 260
页数:24
相关论文
共 66 条
  • [1] AMALDI E, 1991, ARTIFICIAL NEURAL NETWORKS, VOLS 1 AND 2, P55
  • [2] THE COMPLEXITY AND APPROXIMABILITY OF FINDING MAXIMUM FEASIBLE SUBSYSTEMS OF LINEAR RELATIONS
    AMALDI, E
    KANN, V
    [J]. THEORETICAL COMPUTER SCIENCE, 1995, 147 (1-2) : 181 - 210
  • [3] AMALDI E, 1994, THESIS SWISS FED I T
  • [4] AMALDI E, 1992, ACT C ASP THEOR RES
  • [5] AMALDI E, 1994, ORWP694 DEP MATH SWI
  • [6] ANGLUIN D, 1983, ACM COMPUT SURV, V15, P237
  • [7] [Anonymous], LNCS
  • [8] [Anonymous], 1979, Computers and Intractablity: A Guide to the Theoryof NP-Completeness
  • [9] The hardness of approximate optima in lattices, codes, and systems of linear equations
    Arora, S
    Babai, L
    Stern, J
    Sweedyk, Z
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 54 (02) : 317 - 331
  • [10] Arora S., 1992, Proceedings 33rd Annual Symposium on Foundations of Computer Science (Cat. No.92CH3188-0), P14, DOI 10.1109/SFCS.1992.267823