Normalizing variables with too-frequent values using a Kolmogorov-Smirnov test: A practical approach

被引:24
作者
Drezner, Zvi [1 ]
Turel, Ofir [1 ]
机构
[1] Calif State Univ Fullerton, Steven G Mihaylo Coll Business & Econ, Fullerton, CA 92834 USA
关键词
Normal distribution; Normalizing data; Kolmogorov-Smirnov; Too-frequent data; MULTIVARIATE NORMALITY;
D O I
10.1016/j.cie.2011.07.015
中图分类号
TP39 [计算机的应用];
学科分类号
080201 [机械制造及其自动化];
摘要
Many quantitative applications in business operations, environmental engineering, and production assume sufficient normality of data, which is often, demonstrated using tests of normality, such as the Kolmogorov deemed Smirnov test. A practical problem arises when a high proportion of a too-frequent value exists in data, in which case transformation to normality that passes tests for normality may be impossible. Analysts and researchers are therefore often concerned with the question: should we bother transforming the variable to normality? Or should we revert to other approaches not requiring a normal distribution? In this study, we find the critical number of the frequency of a single value for which there is no feasible transformation to normality within a given a of the Kolmogorov-Smirnov test. The resultant decision table can guide the effort of analysts and researchers. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1240 / 1244
页数:5
相关论文
共 17 条
[1]
Location and allocation of service units on a congested network [J].
Aboolian, Robert ;
Berman, Oded ;
Drezner, Zvi .
IIE TRANSACTIONS, 2008, 40 (04) :422-433
[2]
ACRONES MA, 2006, STAT PROBABILITY L B, V26, P211
[3]
AN ANALYSIS OF TRANSFORMATIONS [J].
BOX, GEP ;
COX, DR .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1964, 26 (02) :211-252
[4]
AN ANALYTIC APPROXIMATION TO THE DISTRIBUTION OF LILLIEFORS TEST STATISTIC FOR NORMALITY [J].
DALLAL, GE ;
WILKINSON, L .
AMERICAN STATISTICIAN, 1986, 40 (04) :294-296
[6]
Global-scale temperature patterns and climate forcing over the past six centuries [J].
Mann, ME ;
Bradley, RS ;
Hughes, MK .
NATURE, 1998, 392 (6678) :779-787
[7]
[8]
Mecklin CJ, 2004, INT STAT REV, V72, P123
[9]
A simulation approach to multivariate quality control [J].
Sepulveda, A ;
Nachlas, JA .
COMPUTERS & INDUSTRIAL ENGINEERING, 1997, 33 (1-2) :113-116
[10]
APPROXIMATIONS FOR NULL DISTRIBUTION OF W STATISTIC [J].
SHAPIRO, SS ;
WILK, MB .
TECHNOMETRICS, 1968, 10 (04) :861-&