Statistics, handle with care: Detecting multiple model components with the likelihood ratio test

被引:521
作者
Protassov, R
van Dyk, DA
Connors, A
Kashyap, VL
Siemiginowska, A
机构
[1] Harvard Univ, Dept Stat, Cambridge, MA 02138 USA
[2] Eureka Sci, Oakland, CA 94602 USA
[3] Harvard Smithsonian Ctr Astrophys, Cambridge, MA 02138 USA
关键词
methods; statistical;
D O I
10.1086/339856
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
The likelihood ratio test (LRT) and the related F-test, popularized in astrophysics by Eadie and coworkers in 1971, Bevington in 1969, Lampton, Margon, & Bowyer, in 1976, Cash in 1979, and Avni in 1978, do not ( even asymptotically) adhere to their nominal 2 and F-distributions in many statistical tests common in astrophysics, thereby casting many marginal line or source detections and nondetections into doubt. Although the above authors illustrate the many legitimate uses of these statistics, in some important cases it can be impossible to compute the correct false positive rate. For example, it has become common practice to use the LRT or the F-test to detect a line in a spectral model or a source above background despite the lack of certain required regularity conditions. (These applications were not originally suggested by Cash or by Bevington.) In these and other settings that involve testing a hypothesis that is on the boundary of the parameter space, contrary to common practice, the nominal 2 distribution for the LRT or the F-distribution for the F-test should not be used. In this paper, we characterize an important class of problems in which the LRT and the F test fail and illustrate this nonstandard behavior. We briefly sketch several possible acceptable alternatives, focusing on Bayesian posterior predictive probability values. We present this method in some detail since it is a simple, robust, and intuitive approach. This alternative method is illustrated using the gamma-ray burst of 1997 May 8 (GRB 970508) to investigate the presence of an Fe K emission line during the initial phase of the observation. There are many legitimate uses of the LRT and the F-test in astrophysics, and even when these tests are inappropriate, there remain several statistical alternatives (e. g., judicious use of error bars and Bayes factors). Nevertheless, there are numerous cases of the inappropriate use of the LRT and similar tests in the literature, bringing substantive scientific results into question.
引用
收藏
页码:545 / 559
页数:15
相关论文
共 41 条
[1]  
AVNI Y, 1978, ASTRON ASTROPHYS, V66, P307
[2]   BATSE GAMMA-RAY BURST LINE SEARCH .3. LINE DETECTABILITY [J].
BAND, DL ;
FORD, LA ;
MATTESON, JL ;
BRIGGS, MS ;
PACIESAS, WS ;
PENDLETON, GN ;
PREECE, RD ;
PALMER, DM ;
TEEGARDEN, BJ ;
SCHAEFER, BE .
ASTROPHYSICAL JOURNAL, 1995, 447 (01) :289-301
[3]   Batse gamma-ray burst line search .4. Line candidates from the visual search [J].
Band, DL ;
Ryder, S ;
Ford, LA ;
Matteson, JL ;
Palmer, DM ;
Teegarden, BJ ;
Briggs, MS ;
Paciesas, WS ;
Pendleton, GN ;
Preece, RD .
ASTROPHYSICAL JOURNAL, 1996, 458 (02) :746-754
[4]   BATSE gamma-ray burst line search .5. Probability of detecting a line in a burst [J].
Band, DL ;
Ford, LA ;
Matteson, JL ;
Briggs, MS ;
Paciesas, WS ;
Pendleton, GN ;
Preece, RD .
ASTROPHYSICAL JOURNAL, 1997, 485 (02) :747-755
[5]  
Bayarri MJ, 1999, BAYESIAN STATISTICS 6, P53
[6]  
Bevington R., 1969, DATA REDUCTION ERROR
[7]  
Carlin B. P., 2001, BAYES EMPIRICAL BAYE
[8]  
CASH W, 1979, ASTROPHYS J, V228, P939, DOI 10.1086/156922
[9]   ON THE DISTRIBUTION OF THE LIKELIHOOD RATIO [J].
CHERNOFF, H .
ANNALS OF MATHEMATICAL STATISTICS, 1954, 25 (03) :573-578
[10]  
Connors A., 1997, DATA ANAL ASTRONOMY, P251