Preventing human error: The impact of data entry methods on data accuracy and statistical results

被引:71
作者
Barchard, Kimberly A. [1 ]
Pace, Larry A. [2 ]
机构
[1] Univ Nevada, Dept Psychol, Las Vegas, NV 89154 USA
[2] Anderson Univ, Dept Psychol, Anderson, IN USA
关键词
Data entry; Double entry; Visual checking; Outliers; Data cleaning; OUTLIERS;
D O I
10.1016/j.chb.2011.04.004
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Human data entry can result in errors that ruin statistical results and conclusions. A single data entry error can make a moderate correlation turn to zero and a significant t-test non-significant. Therefore, researchers should design and use human computer interactions that minimize data entry errors. In this paper, 195 undergraduates were randomly assigned to three data entry methods: double entry, visual checking, and single entry. After training in their assigned method, participants entered 30 data sheets, each containing six types of data. Visual checking resulted in 2958% more errors than double entry, and was not significantly better than single entry. These data entry errors sometimes had terrible effects on coefficient alphas, correlations, and t-tests. For example, 66% of the visual checking participants produced incorrect values for coefficient alpha, which was sometimes wrong by more than .40. Moreover, these data entry errors would be hard to detect: Only 0.06% of the errors were blank or outside of the allowable range for the variables. Thus, researchers cannot rely upon histograms and frequency tables to detect data entry errors. Single entry and visual checking should be replaced with more effective data entry methods, such as double entry. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1834 / 1839
页数:6
相关论文
共 24 条
[1]  
Barchard Kimberly A., 2008, International Journal of Services and Standards, V4, P359, DOI 10.1504/IJSS.2008.020053
[2]  
BARCHARD KA, 2010, POKA YOKE DATA ENTRY
[3]   The PowerChecker: A Visual Basic program for ensuring data integrity [J].
Beaty, JC .
BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1999, 31 (04) :737-740
[4]   OUTLIERS AND IMPROPER SOLUTIONS - A CONFIRMATORY FACTOR-ANALYSIS EXAMPLE [J].
BOLLEN, KA .
SOCIOLOGICAL METHODS & RESEARCH, 1987, 15 (04) :375-384
[5]  
Burchinal M., 2006, Monographs of the Society for Research in Child Development, V71, P9
[6]  
CAMERON A, 2008, LANCET, P240, DOI DOI 10.1016/S0140-6736
[7]  
Cummings J, 1994, Qual Assur, V3, P300
[8]   Research electronic data capture (REDCap)-A metadata-driven methodology and workflow process for providing translational research informatics support [J].
Harris, Paul A. ;
Taylor, Robert ;
Thielke, Robert ;
Payne, Jonathon ;
Gonzalez, Nathaniel ;
Conde, Jose G. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (02) :377-381
[9]   A CRITICAL-LOOK AT SOME ANALYSES OF MAJOR-LEAGUE BASEBALL SALARIES [J].
HOAGLIN, DC ;
VELLEMAN, PF .
AMERICAN STATISTICIAN, 1995, 49 (03) :277-285
[10]  
HOWARD W, 1976, J ED STAT, V1, P285, DOI DOI 10.2307/1164985