Bandwidth selection: Classical or plug-in?

被引:230
作者
Loader, CR [1 ]
机构
[1] Lucent Technol, Murray Hill, NJ 07974 USA
关键词
Akaike's information criterion; bandwidth; cross validation; density estimation; local fitting; local likelihood; plug-in;
D O I
10.1214/aos/1018031201
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Bandwidth selection for procedures such as kernel density estimation and local regression have been widely studied over the past decade. Substantial "evidence" has been collected to establish superior performance of modern plug-in methods in comparison to methods such as cross validation: this has ranged from detailed analysis of rates of convergence, to simulations, to superior performance on real datasets. In this work we take a detailed look at some of this evidence, looking into the sources of differences. Our findings challenge the claimed superiority of plug-in methods on several fronts. First, plug-in methods are heavily dependent on arbitrary specification of pilot bandwidths and fail when this specification is wrong. Second, the often-quoted variability and undersmoothing of cross validation simply reflects the uncertainty of bandwidth selection; plug-in methods reflect this uncertainty by oversmoothing and missing important features when given difficult problems. Third, we look at asymptotic theory. Plug-in methods use available curvature information in an inefficient manner, resulting in inefficient estimates. Previous comparisons with classical approaches penalized the classical approaches for this inefficiency Asymptotically, the plug-in based estimates are beaten by their own pilot estimates.
引用
收藏
页码:415 / 438
页数:24
相关论文
共 38 条
[21]   EXACT MEAN INTEGRATED SQUARED ERROR [J].
MARRON, JS ;
WAND, MP .
ANNALS OF STATISTICS, 1992, 20 (02) :712-736
[22]  
MARRON JS, 1996, STAT THEORY COMPUTAT, P1
[23]   GENERALIZED LINEAR MODELS [J].
NELDER, JA ;
WEDDERBURN, RW .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-GENERAL, 1972, 135 (03) :370-+
[24]  
Park B. U., 1992, Computational Statistics, V7, P251
[25]   COMPARISON OF DATA-DRIVEN BANDWIDTH SELECTORS [J].
PARK, BU ;
MARRON, JS .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1990, 85 (409) :66-72
[26]   BANDWIDTH CHOICE FOR NONPARAMETRIC REGRESSION [J].
RICE, J .
ANNALS OF STATISTICS, 1984, 12 (04) :1215-1230
[27]   REMARKS ON SOME NONPARAMETRIC ESTIMATES OF A DENSITY-FUNCTION [J].
ROSENBLATT, M .
ANNALS OF MATHEMATICAL STATISTICS, 1956, 27 (03) :832-837
[28]  
RUDEMO M, 1982, SCAND J STAT, V9, P65
[29]   An effective bandwidth selector for local least squares regression [J].
Ruppert, D ;
Sheather, SJ ;
Wand, MP .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (432) :1257-1270
[30]  
SCHUSTER EF, 1981, COMPUTER SCI STAT P