Complexity theoretic hardness results for query learning

被引:18
作者
Aizenstein, H
Hegedus, T
Hellerstein, L
Pitt, L
机构
[1] Univ Pittsburgh, Western Psychiat Inst & Clin, Sch Med, Pittsburgh, PA 15213 USA
[2] Northwestern Univ, Dept Elect Engn & Comp Sci, Evanston, IL 60208 USA
[3] Comenius Univ, Dept Comp Sci, Bratislava 84215, Slovakia
[4] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
query learning; computational learning theory; complexity theory; read-thrice DNF; threshold functions; membership queries; equivalence queries;
D O I
10.1007/PL00001593
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We investigate the complexity of learning for the well-studied model in which the learning algorithm may ask membership and equivalence queries. While complexity theoretic techniques have previously been used to prove hardness results in various learning models, these techniques typically are not strong enough to use when a learning algorithm may make membership queries. We develop a general technique for proving hardness results for learning with membership and equivalence queries land for more general query models). We apply the technique to show that, assuming NP not equal co-NP, no polynomial-time membership and (proper) equivalence query algorithms exist for exactly learning read-thrice DNF formulas, unions of k greater than or equal to 3 halfspaces over the Boolean domain, or some other related classes. Our hardness results are representation dependent, and do not preclude the existence of representation independent algorithms. The general technique introduces the representation, problem for a class F of representations (e.g., formulas), which is naturally associated with the learning problem for F. This problem is related to the structural question of how to characterize functions representable by formulas in F, and is a generalization of standard complexity problems such as SATISFIABILITY. While in general the representation problem is in Sigma(2)(P), we present a theorem demonstrating that for "reasonable" classes F, the existence of a polynomial-time membership and equivalence query algorithm for exactly learning F implies that the representation problem for F is in fact in co-NP. The theorem is applied to prove hardness results such as the ones mentioned above, by showing that the representation problem for specific classes of formulas is NP-hard.
引用
收藏
页码:19 / 53
页数:35
相关论文
共 61 条
[1]  
Aizenstein H., 1992, Proceedings 33rd Annual Symposium on Foundations of Computer Science (Cat. No.92CH3188-0), P523, DOI 10.1109/SFCS.1992.267799
[2]  
AIZENSTEIN H, 1993, UIUCDCSR931813 U ILL
[3]  
AIZENSTEIN H, 1994, IN PRES SIAM J COMPU, P110
[4]  
ANGLUIN D, 1990, MACH LEARN, V5, P121, DOI 10.1023/A:1022692615781
[5]  
Angluin D., 1992, Proceedings of the Twenty-Fourth Annual ACM Symposium on the Theory of Computing, P351, DOI 10.1145/129712.129746
[6]  
ANGLUIN D, 1992, MACH LEARN, V9, P147, DOI 10.1007/BF00992675
[7]   LEARNING READ-ONCE FORMULAS WITH QUERIES [J].
ANGLUIN, D ;
HELLERSTEIN, L ;
KARPINSKI, M .
JOURNAL OF THE ACM, 1993, 40 (01) :185-210
[8]  
Angluin D., 1988, Machine Learning, V2, P319, DOI 10.1007/BF00116828
[9]   LEARNING REGULAR SETS FROM QUERIES AND COUNTEREXAMPLES [J].
ANGLUIN, D .
INFORMATION AND COMPUTATION, 1987, 75 (02) :87-106
[10]  
ANGLUIN D, 1991, J COMPUTER SYSTEM SC, V50, P444