Equivalence of truncated count mixture distributions and mixtures of truncated count distributions

被引:36
作者
Bohning, Dankmar [1 ]
Kuhnert, Ronny
机构
[1] Univ Reading, Sch Biol Sci, Sch Appl Stat, Reading RG6 6FN, Berks, England
[2] Charite Med Sch Berlin, Inst Social Med Epidemiol & Hlth Econ, D-14195 Berlin, Germany
关键词
capture-recapture; mixture of truncated count densities; population size problem; truncated mixture of count densities;
D O I
10.1111/j.1541-0420.2006.00565.x
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
This article is about modeling count data with zero truncation. A parametric count density family is considered. The truncated mixture of densities from this family is different from the mixture of truncated densities from the same family. Whereas the former model is more natural to formulate and to interpret, the latter model is theoretically easier to treat. It is shown that for any mixing distribution leading to a truncated mixture, a (usually different) mixing distribution can be found so. that the associated mixture of truncated densities equals the truncated mixture, and vice versa. This implies that the likelihood surfaces for both situations agree, and in this sense both models are equivalent. Zero-truncated count data models are used frequently in the capture-recapture setting to estimate population size, and it can be shown that the two Horvitz-Thompson estimators, associated with the two models, agree. In particular, it is possible to achieve strong results for mixtures of truncated Poisson densities, including reliable, global construction of the unique NPMLE (nonparametric maximum likelihood estimator) of the mixing distribution, implying a unique estimator for the population size. The benefit of these results lies in the fact that it is valid to work with the mixture of truncated count densities, which is less appealing for the practitioner but theoretically easier. Mixtures of truncated count densities form a convex linear model, for which a developed theory exists, including global maximum likelihood theory as well as algorithmic approaches. Once the problem has been solved in this class, it might readily be transformed back to the original problem by means of an explicitly given mapping. Applications of these ideas are given, particularly in the case of the truncated Poisson family.
引用
收藏
页码:1207 / 1215
页数:9
相关论文
共 41 条
[1]  
[Anonymous], 2000, WILEY SERIES PROBABI
[2]   ESTIMATING COMPLETE SAMPLE-SIZE FROM AN INCOMPLETE POISSON SAMPLE [J].
BLUMENTHAL, S ;
DAHIYA, RC ;
GROSS, AJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1978, 73 (361) :182-187
[4]   A NOTE ON A TEST FOR POISSON OVERDISPERSION [J].
BOHNING, D .
BIOMETRIKA, 1994, 81 (02) :418-419
[5]   The EM algorithm with gradient function update for discrete mixtures with known (fixed) number of components [J].
Böhning, D .
STATISTICS AND COMPUTING, 2003, 13 (03) :257-265
[6]   Nonparametric maximum likelihood estimation of population size based on the counting distribution [J].
Böhning, D ;
Schön, D .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2005, 54 :721-737
[7]   Estimating the number of drug users in Bangkok 2001:: A capture-recapture approach using repeated entries in one list [J].
Böhning, D ;
Suppawattanabodee, B ;
Kusolvisitkul, W ;
Viwatwongkasem, C .
EUROPEAN JOURNAL OF EPIDEMIOLOGY, 2004, 19 (12) :1075-1083
[8]  
BOHNING D, 2000, COMPUTER ASSISTED AN
[9]   ESTIMATING THE NUMBER OF SPECIES - A REVIEW [J].
BUNGE, J ;
FITZPATRICK, M .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (421) :364-373
[10]  
Cameron AC, 1998, Regression Analysis of Count Data