BOOSTING A WEAK LEARNING ALGORITHM BY MAJORITY

被引:1096
作者
FREUND, Y
机构
[1] AT and T Bell Laboratories, Murray Hill, NJ
关键词
D O I
10.1006/inco.1995.1136
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present an algorithm for improving the accuracy of algorithms for learning binary concepts. The improvement is achieved by combining a large number of hypotheses, each of which is generated by training the given learning algorithm on a different set of examples. Our algorithm is based on ideas presented by Schapire and represents an improvement over his results, The analysis of our algorithm provides general upper bounds on the resources required for learning in Valiant's polynomial PAC learning framework, which are the best general upper bounds known today. We show that the number of hypotheses that are combined by our algorithm is the smallest number possible. Other outcomes of our analysis are results regarding the representational power of threshold circuits, the relation between learnability and compression, and a method for parallelizing PAC learning algorithms. We provide extensions of our algorithms to cases in which the concepts are not binary and to the case where the accuracy of the learning algorithm depends on the distribution of the instances. (C) 1995 Academic Press, inc.
引用
收藏
页码:256 / 285
页数:30
相关论文
共 22 条
  • [1] ASLAM JA, 1993, 3KTH P ANN IEEE S F
  • [2] OCCAM RAZOR
    BLUMER, A
    EHRENFEUCHT, A
    HAUSSLER, D
    WARMUTH, MK
    [J]. INFORMATION PROCESSING LETTERS, 1987, 24 (06) : 377 - 380
  • [3] LEARNABILITY AND THE VAPNIK-CHERVONENKIS DIMENSION
    BLUMER, A
    EHRENFEUCHT, A
    HAUSSLER, D
    WARMUTH, MK
    [J]. JOURNAL OF THE ACM, 1989, 36 (04) : 929 - 965
  • [4] DRUCKER H, 1992, COMMUNICATION
  • [5] Drucker Harris., 1993, ADV NEURAL INFORMATI, V5, P42
  • [6] FLOYD S, 1993, UCSCCRL9313 U CAL DE
  • [7] FREUND Y, 1995, EUROCOLT95
  • [8] GOLDMANN M, 1992, COMPUT COMPLEXITY, V2
  • [9] GRAHAM RL, 1991, CONCRETE MATH F COMP
  • [10] Haussler D., 1988, 29th Annual Symposium on Foundations of Computer Science (IEEE Cat. No.88CH2652-6), P100, DOI 10.1109/SFCS.1988.21928