HYPOTHESIS TEST FOR NORMAL MIXTURE MODELS: THE EM APPROACH

被引:108
作者
Chen, Jiahua [1 ]
Li, Pengfei [2 ]
机构
[1] Univ British Columbia, Dept Stat, Vancouver, BC V6T 1Z2, Canada
[2] Univ Alberta, Dept Math & Stat Sci, Edmonton, AB T6G 2G1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Chi-square limiting distribution; compactness; normal mixture models; homogeneity test; likelihood ratio test; statistical genetics; LIKELIHOOD RATIO TEST; STRUCTURAL PARAMETER; VARIABLE SELECTION; HOMOGENEITY; GENE; ASYMPTOTICS; POPULATION; COMPONENTS; NUMBER;
D O I
10.1214/08-AOS651
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Normal mixture distributions are arguably the most important mixture models. and also the most technically challenging. The likelihood function of the normal mixture model is unbounded based oil a set of random samples, unless an artificial bound is placed oil its component variance parameter. Moreover, the model is not strongly identifiable so it is hard to differentiate between over dispersion caused by the presence of a mixture and that caused by a large variance, and it has infinite Fisher information with respect to mixing proportions. There has been extensive research oil finite normal mixture models, but much of it addresses merely consistency of the point estimation or useful practical procedures, and many, results require undesirable restrictions oil the parameter space. We show that an EM-test for homogeneity is effective at overcoming many challenges in the context of finite normal mixtures. We find that the limiting, distribution of the EM-test is a simple function of the 0.5 chi(2)(0) + 0.5 chi(1)(2) and chi(2)(1) distributions when the mixing variances are equal but unknown and the chi(2)(2) when variances are unequal and unknown. Simulations unknown and the show that the limiting distributions approximate the finite sample distribution satisfactorily. Two genetic examples are used to illustrate the application of the EM-test.
引用
收藏
页码:2523 / 2542
页数:20
相关论文
共 37 条
[1]  
[Anonymous], 1985, Proc. Berkeley Conference
[2]  
[Anonymous], APPL STAT, DOI DOI 10.2307/2347790
[3]   Estimation and confidence sets for sparse normal mixtures [J].
Cai, T. Tony ;
Jin, Jiashun ;
Low, Mark G. .
ANNALS OF STATISTICS, 2007, 35 (06) :2421-2449
[4]   Testing homogeneity in a mixture distribution via the L2 distance between competing models [J].
Charnigo, R ;
Sun, JY .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2004, 99 (466) :488-498
[5]  
Chen HF, 2003, STAT SINICA, V13, P351
[6]   A modified likelihood ratio test for homogeneity in finite mixture models [J].
Chen, HF ;
Chen, JH ;
Kalbfleisch, JD .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2001, 63 :19-29
[7]  
CHEN J, 2008, HOMOGENEITY TEST NOR
[8]   Modified likelihood ratio test in finite mixture models with a structural parameter [J].
Chen, JH ;
Kalbfleisch, JD .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2005, 129 (1-2) :93-107
[9]   OPTIMAL RATE OF CONVERGENCE FOR FINITE MIXTURE-MODELS [J].
CHEN, JH .
ANNALS OF STATISTICS, 1995, 23 (01) :221-233
[10]  
Chen JH, 2008, STAT SINICA, V18, P443