Bootstrap by sequential resampling

被引:23
作者
Rao, CR
Pathak, PK
Koltchinskii, VI
机构
[1] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
[2] Univ New Mexico, Albuquerque, NM 87131 USA
关键词
bootstrap; sequential resampling; information content; sampling viewpoint; asymptotic correctness; empirical measures; empirical processes; weak convergence; symmetrization; Poissonization; p-variation; quantiles;
D O I
10.1016/S0378-3758(97)00041-4
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 [统计学]; 070103 [概率论与数理统计]; 0714 [统计学];
摘要
This paper examines resampling for bootstrap from a survey sampling point of view. Given an observed sample of size n, resampling for bootstrap involves n repeated trials of simple random sampling with replacement. From the point of view of information content, it is well known that simple random sampling with replacement does not result in samples that are equally informative (see Pathak (1964) Ann. Math. Statist. 35, 795-808; Biometrika 51, 185-193). This is due to the randomness in the number of distinct observations that occur in different bootstrap samples. We propose an alternative scheme of sampling sequentially (with replacement each time) until k distinct original observations appear. In such a scheme, the bootstrap sample size becomes random as it varies from sample to sample, but each sample has exactly the same number of distinct observations. We show that the choice of k = (1 - e(-1))n similar to 0.632n has some advantage, stemming from the observation made by Efron (1983, J. Am. Statist. Assoc. 78, 316-331) that the usual bootstrap samples are supported on approximately 0.632n of the original data points, Using recent results on empirical processes, we show that main empirical characteristics of the sequential resampling bootstrap are asymptotically within the distance of order similar to n(-3/4) Of the corresponding characteristics of the usual bootstrap. (C) 1997 Elsevier Science B.V.
引用
收藏
页码:257 / 281
页数:25
相关论文
共 29 条
[1]
EDGEWORTH EXPANSIONS FOR SAMPLING WITHOUT REPLACEMENT FROM FINITE POPULATIONS [J].
BABU, GJ ;
SINGH, K .
JOURNAL OF MULTIVARIATE ANALYSIS, 1985, 17 (03) :261-278
[2]
BABU GJ, 1993, MIXTURES GLOBAL LOCA
[3]
EDGEWORTH EXPANSION OF A FUNCTION OF SAMPLE MEANS [J].
BAI, ZD ;
RAO, CR .
ANNALS OF STATISTICS, 1991, 19 (03) :1295-1315
[4]
Bhattacharya RN., 1976, NORMAL APPROXIMATION
[5]
SOME ASYMPTOTIC THEORY FOR THE BOOTSTRAP [J].
BICKEL, PJ ;
FREEDMAN, DA .
ANNALS OF STATISTICS, 1981, 9 (06) :1196-1217
[6]
Dudley R.M., 1984, Ecole d'ete de Probabilites de Saint-Flour XII-1982, V1097, P1, DOI DOI 10.1007/BFB0099432
[7]
FRECHET DIFFERENTIABILITY, P-VARIATION AND UNIFORM DONSKER CLASSES [J].
DUDLEY, RM .
ANNALS OF PROBABILITY, 1992, 20 (04) :1968-1982
[10]
Efron B., 1982, SOC IND APPL MATH CB, V38