Estimating Box-Cox power transformation parameter via goodness-of-fit tests

被引:43
作者
Asar, Ozgur [1 ]
Ilk, Ozlem [2 ]
Dag, Osman [3 ]
机构
[1] Univ Lancaster, Lancaster Med Sch, CHICAS, Lancaster, England
[2] Middle East Tech Univ, Dept Stat, Ankara, Turkey
[3] Hacettepe Univ, Dept Biostat, Fac Med, Ankara, Turkey
关键词
Artificial covariate; Data transformation; Normality tests; Searching algorithms; Statistical software; FALSE DISCOVERY RATE; VARIANCE TEST; APPROXIMATE ANALYSIS; NORMALITY;
D O I
10.1080/03610918.2014.957839
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Box-Cox power transformation is a commonly used methodology to transform the distribution of the data into a normal distribution. The methodology relies on a single transformation parameter. In this study, we focus on the estimation of this parameter. For this purpose, we employ seven popular goodness-of-fit tests for normality, namely Shapiro-Wilk, Anderson-Darling, Cramer-von Mises, Pearson Chi-square, Shapiro-Francia, Lilliefors and Jarque-Bera tests, together with a searching algorithm. The searching algorithm is based on finding the argument of the minimum or maximum depending on the test, i.e., maximum for the Shapiro-Wilk and Shapiro-Francia, minimum for the rest. The artificial covariate method of Dag etal. (2014) is also included for comparison purposes. Simulation studies are implemented to compare the performances of the methods. Results show that Shapiro-Wilk and the artificial covariate method are more effective than the others and Pearson Chi-square is the worst performing method. The methods are also applied to two real-life datasets. The R package AID is proposed for implementation of the aforementioned methods.
引用
收藏
页码:91 / 105
页数:15
相关论文
共 27 条
[21]  
SHAPIRO SS, 1965, BIOMETRIKA, V52, P591, DOI 10.2307/2333709
[22]   The use of multiple imputation for the analysis of missing data [J].
Sinharay, S ;
Stern, HS ;
Russell, D .
PSYCHOLOGICAL METHODS, 2001, 6 (04) :317-329
[23]   A class of Box-Cox transformation models for recurrent event data [J].
Sun, Liuquan ;
Tong, Xingwei ;
Zhou, Xian .
LIFETIME DATA ANALYSIS, 2011, 17 (02) :280-301
[24]   Jarque-Bera test and its competitors for testing normality -: A power comparison [J].
Thadewald, Thorsten ;
Buning, Herbert .
JOURNAL OF APPLIED STATISTICS, 2007, 34 (01) :87-105
[25]  
THODE HC, 2002, TESTING NORMALITY
[26]   APPROXIMATE ANALYSIS OF VARIANCE TEST FOR NON-NORMALITY SUITABLE FOR MACHINE CALCULATION [J].
WEISBERG, S ;
BINGHAM, C .
TECHNOMETRICS, 1975, 17 (01) :133-134
[27]   Comparisons of various types of normality tests [J].
Yap, B. W. ;
Sim, C. H. .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2011, 81 (12) :2141-2155