Inference on the Order of a Normal Mixture

被引:40
作者
Chen, Jiahua [1 ]
Li, Pengfei [2 ]
Fu, Yuejiao [3 ]
机构
[1] Univ British Columbia, Dept Stat, Vancouver, BC V6T 1Z2, Canada
[2] Univ Waterloo, Dept Stat & Actuarial Sci, Waterloo, ON N2L 3G1, Canada
[3] York Univ, Dept Math & Stat, N York, ON M3J 1P3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Chi-squared limiting distribution; Computer experiment; EM test; Likelihood ratio test; Order selection; Tuning parameter; Unequal variance; LIKELIHOOD RATIO TEST; GENE-EXPRESSION; NUMBER; COMPONENTS; HOMOGENEITY; SELECTION; DISTANCE; MODELS; TESTS;
D O I
10.1080/01621459.2012.695668
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Finite normal mixture models are used in a wide range of applications. Hypothesis testing on the order of the normal mixture is an important yet unsolved problem. Existing procedures often lack a rigorous theoretical foundation. Many are also hard to implement numerically. In this article, we develop a new method to fill the void in this important area. An effective expectation-maximization (EM) test is invented for testing the null hypothesis of arbitrary order m(0) under a finite normal mixture model. For any positive integer m(0) >= 2, the limiting distribution of the proposed test statistic is chi(2)(2m0). We also use a novel computer experiment to provide empirical formulas for the tuning parameter selection. The finite sample performance of the test is examined through simulation studies. Real-data examples are provided. The procedure has been implemented in R code. The p-values for testing the null order of m(0) = 2 or m(0) = 3 can be calculated with a single command. This article has supplementary materials available online.
引用
收藏
页码:1096 / 1105
页数:10
相关论文
共 37 条
[1]  
[Anonymous], 2000, Sankhya Ser. A, DOI DOI 10.2307/25051289
[2]  
[Anonymous], 1985, Proc. Berkeley Conference
[3]  
BARTLETT MS, 1953, BIOMETRIKA, V40, P12, DOI 10.1093/biomet/40.1-2.12
[4]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[5]   Testing homogeneity in a mixture distribution via the L2 distance between competing models [J].
Charnigo, R ;
Sun, JY .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2004, 99 (466) :488-498
[6]  
Chen HF, 2003, STAT SINICA, V13, P351
[7]   OPTIMAL RATE OF CONVERGENCE FOR FINITE MIXTURE-MODELS [J].
CHEN, JH .
ANNALS OF STATISTICS, 1995, 23 (01) :221-233
[8]  
Chen JH, 2008, STAT SINICA, V18, P443
[9]   HYPOTHESIS TEST FOR NORMAL MIXTURE MODELS: THE EM APPROACH [J].
Chen, Jiahua ;
Li, Pengfei .
ANNALS OF STATISTICS, 2009, 37 (5A) :2523-2542
[10]   Order Selection in Finite Mixture Models With a Nonsmooth Penalty [J].
Chen, Jiahua ;
Khalili, Abbas .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (484) :1674-1683