OPTIMAL AND DATA-BASED HISTOGRAMS

被引:1094
作者
SCOTT, DW
机构
[1] Department of Mathematical Sciences, Rice University, Houston, Texas
关键词
Frequency distribution; Histogram; Nonparametrio density estimation; Optimal bin width;
D O I
10.1093/biomet/66.3.605
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper the formula for the optimal histogram bin width is derived which asymptotically minimizes the integrated mean squared error. Monte Carlo methods are used to verify the usefulness of this formula for small samples. A data-based procedure for choosing the bin width parameter is proposed, which assumes a Gaussian reference standard and requires only the sample size and an estimate of the standard deviation. The sensitivity of the procedure is investigated using several probability models which violate the Gaussian assumption. © 1979 Biometrika Trust.
引用
收藏
页码:605 / 610
页数:6
相关论文
共 15 条
[1]  
[Anonymous], 1962, COMPLEX INTELL SYST
[2]  
BONEVA LI, 1971, J ROY STAT SOC B, V33, P1
[3]  
GUTTMAN I, 1965, INTRO ENGINEERING ST
[4]  
HABER A, 1969, GENERAL STATISTICS
[5]  
KENDALL MG, 1969, ADV THEORY STATISTIC, V1
[6]  
LARSON HJ, 1975, STATISTICS INTRO
[7]   ESTIMATION OF A PROBABILITY DENSITY-FUNCTION AND MODE [J].
PARZEN, E .
ANNALS OF MATHEMATICAL STATISTICS, 1962, 33 (03) :1065-&
[8]   REMARKS ON SOME NONPARAMETRIC ESTIMATES OF A DENSITY-FUNCTION [J].
ROSENBLATT, M .
ANNALS OF MATHEMATICAL STATISTICS, 1956, 27 (03) :832-837
[9]  
SILVERMAN BW, 1978, BIOMETRIKA, V65, P1