Assessing the reliability of statistical software: Part I

被引:80
作者
McCullough, BD [1 ]
机构
[1] FCC, Washington, DC 20554 USA
关键词
accuracy; benchmarks; random number generator; software testing; StRD;
D O I
10.2307/2685442
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Entry-level tests of the accuracy of statistical software, such as Wilkinson's Statistics Quiz, have long been available, but more advanced collections of tests have not. This article proposes a set of intermediate-level tests focusing on three areas: estimation, both linear and nonlinear; random number generation; and statistical distributions (e.g., for calculating p-values). The complete methodology is described in detail. Convenient methods for summarizing the results are presented, so that an assessment of numerical accuracy can easily be incorporated into a software review.
引用
收藏
页码:358 / 366
页数:9
相关论文
共 59 条