Aggregating regression procedures to improve performance

被引:67
作者
Yang, YH [1 ]
机构
[1] Iowa State Univ, Dept Stat, Ames, IA 50011 USA
关键词
aggregating procedures; adaptive estimation; linear combining; nonparametric regression;
D O I
10.3150/bj/1077544602
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A fundamental question regarding combining procedures concerns the potential gain and how much one needs to pay for it in terms of statistical risk. Juditsky and Nemirovski considered the case where a large number of procedures are to be combined. We give upper and lower bounds for complementary cases. Under an l(1) constraint on the linear coefficients, it is shown that for pursuing the best linear combination of n(tau) procedures, in terms of rate of convergence under the squared L-2 loss, one can pay a price of order O(log n/n n(1-tau)) when 0 < tau < (-1)(2) and a price of order O((log n/n)(1/)(2)) 2 when (2)-(1)tau<infinity. These rates cannot be improved or essentially improved in a uniform sense. This result suggests that one should be cautious in pursuing the best linear combination, because one may end up paying a high price for nothing when linear combination in fact does not help. We show that with care in aggregation, the final procedure can automatically avoid paying the high price for such a case and then behaves as well as the best candidate procedure.
引用
收藏
页码:25 / 47
页数:23
相关论文
共 38 条
[1]   Risk bounds for model selection via penalization [J].
Barron, A ;
Birgé, L ;
Massart, P .
PROBABILITY THEORY AND RELATED FIELDS, 1999, 113 (03) :301-413
[2]   The minimum description length principle in coding and modeling [J].
Barron, A ;
Rissanen, J ;
Yu, B .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (06) :2743-2760
[3]  
Barron Andrew R., 1987, Open problems in communication and computation, P85
[4]   UNIVERSAL APPROXIMATION BOUNDS FOR SUPERPOSITIONS OF A SIGMOIDAL FUNCTION [J].
BARRON, AR .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1993, 39 (03) :930-945
[5]  
BARRON AR, 1994, MACH LEARN, V14, P115, DOI 10.1007/BF00993164
[6]   MINIMUM COMPLEXITY DENSITY-ESTIMATION [J].
BARRON, AR ;
COVER, TM .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1991, 37 (04) :1034-1054
[7]   COMBINATION OF FORECASTS [J].
BATES, JM ;
GRANGER, CWJ .
OPERATIONAL RESEARCH QUARTERLY, 1969, 20 (04) :451-&
[8]   ON ESTIMATING A DENSITY USING HELLINGER DISTANCE AND SOME OTHER STRANGE FACTS [J].
BIRGE, L .
PROBABILITY THEORY AND RELATED FIELDS, 1986, 71 (02) :271-291
[9]  
Breiman L, 1996, MACH LEARN, V24, P49
[10]   Model selection: An integral part of inference [J].
Buckland, ST ;
Burnham, KP ;
Augustin, NH .
BIOMETRICS, 1997, 53 (02) :603-618