COMPARISON OF 3 ESTIMATORS OF THE NUMBER OF SPECIES

被引:20
作者
BUNGE, J
FITZPATRICK, M
HANDLEY, J
机构
[1] CORNELL UNIV,NEW YORK STATE SCH IND & LABOR RELAT,DEPT ECON & SOCIAL STAT,ITHACA,NY 14853
[2] ROCHESTER INST TECHNOL,ROCHESTER,NY 14623
关键词
D O I
10.1080/757584397
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider estimation of the number of cells in a multinomial distribution. This is one version of the species problem: there are many applications, such as the estimation of the number of unobserved species of animals; estimation of vocabulary size, etc. We describe the results of a simulation comparison of three principal 'frequentist' procedures for estimating the number of cells (or species). The first procedure postulates a functional form for the cell probabilities; the second procedure approximates the distribution of the probabilities by a parametric probability density function; and the third procedure is based on an estimate of the sample coverage, i.e. the sum of the probabilities of the observed cells. Among the procedures studied we find that the third (non-parametric) method is globally preferable; the second (functional parametric) method cannot be recommended; and that, when based on the inverse Gaussian density, the first method is competitive in some cases with the third method. We also discuss Sichel's recent generalized inverse Gaussian-based procedure which, with some refinement, promises to perform at least as well as the non-parametric method in all cases.
引用
收藏
页码:45 / 59
页数:15
相关论文
共 18 条