To Explain or to Predict?

被引:1865
作者
Shmueli, Galit [1 ]
机构
[1] Univ Maryland, Robert H Smith Sch Business, Dept Decis Operat & Informat Technol, College Pk, MD 20742 USA
关键词
Explanatory modeling; causality; predictive modeling; predictive power; statistical strategy; data mining; scientific research; MODEL; SELECTION; DYNAMICS; TESTS; BAYES; PRICE; RISK;
D O I
10.1214/10-STS330
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Statistical modeling is a powerful tool for developing and testing theories by way of causal explanation, prediction, and description. In many disciplines there is near-exclusive use of statistical modeling for causal explanation and the assumption that models with high explanatory power are inherently of high predictive power. Conflation between explanation and prediction is common, yet the distinction must be understood for progressing scientific knowledge. While this distinction has been recognized in the philosophy of science, the statistical literature lacks a thorough discussion of the many differences that arise in the process of modeling for an explanatory versus a predictive goal. The purpose of this article is to clarify the distinction between explanatory and predictive modeling, to discuss its sources, and to reveal the practical implications of the distinction to each step in the modeling process.
引用
收藏
页码:289 / 310
页数:22
相关论文
共 99 条
  • [1] Prediction in multilevel models
    Afshartous, D
    de Leeuw, J
    [J]. JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2005, 30 (02) : 109 - 139
  • [2] Aitchison J., 1975, Statistical Prediction Analysis
  • [3] [Anonymous], 2006, Introduction to Linear Regression Analysis
  • [4] [Anonymous], 2003, Bayesian Data Analysis
  • [5] [Anonymous], 1977, The advanced theory of statistics
  • [6] [Anonymous], 2001, Statistics, Econometrics and Forecasting
  • [7] [Anonymous], 2002, DEALING DATA FLOOD
  • [8] [Anonymous], 2002, Information visualization in data mining and knowledge discovery
  • [9] [Anonymous], 2006, Not Even Wrong: The Failure of String Theory and the Search for Unity in Physical Law
  • [10] [Anonymous], 2008, Econometric Analysis of Count Data