A Survey of Model Evaluation Approaches With a Tutorial on Hierarchical Bayesian Methods

被引:230
作者
Shiffrin, Richard M. [2 ,3 ]
Lee, Michael D. [1 ]
Kim, Woojae [2 ,3 ]
Wagenmakers, Eric-Jan [4 ]
机构
[1] Univ Calif Irvine, Dept Cognit Sci, Irvine, CA 92697 USA
[2] Indiana Univ, Dept Psychol, Bloomington, IN 47405 USA
[3] Indiana Univ, Dept Cognit Sci, Bloomington, IN 47405 USA
[4] Univ Amsterdam, Dept Psychol, NL-1012 WX Amsterdam, Netherlands
关键词
Model selection; Model evaluation; Bayesian model selection; Minimum description length; Prequential analysis; Model mimicry; Hierarchical Bayesian modeling;
D O I
10.1080/03640210802414826
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
This article reviews current methods for evaluating models in the cognitive sciences, including theoretically based approaches, such as Bayes factors and minimum description length measures; simulation approaches, including model mimicry evaluations; and practical approaches, such as validation and generalization measures. This article argues that, although often useful in specific settings, most of these approaches are limited in their ability to give a general assessment of models. This article argues that hierarchical methods, generally, and hierarchical Bayesian methods, specifically, can provide a more thorough evaluation of models in the cognitive sciences. This article presents two worked examples of hierarchical Bayesian analyses to demonstrate how the approach addresses key questions of descriptive adequacy, parameter interference, prediction, and generalization in principled and coherent ways.
引用
收藏
页码:1248 / 1284
页数:37
相关论文
共 65 条
[1]   Comparison of Decision Learning Models Using the Generalization Criterion Method [J].
Ahn, Woo-Young ;
Busemeyer, Jerome R. ;
Wagenmakers, Eric-Jan ;
Stout, Julie C. .
COGNITIVE SCIENCE, 2008, 32 (08) :1376-1402
[2]  
[Anonymous], 2021, Bayesian Data Analysis
[3]  
[Anonymous], 2012, Probability Theory: The Logic Of Science
[4]   INSENSITIVITY TO FUTURE CONSEQUENCES FOLLOWING DAMAGE TO HUMAN PREFRONTAL CORTEX [J].
BECHARA, A ;
DAMASIO, AR ;
DAMASIO, H ;
ANDERSON, SW .
COGNITION, 1994, 50 (1-3) :7-15
[5]   A temporal ratio model of memory [J].
Brown, Gordon D. A. ;
Neath, Ian ;
Chater, Nick .
PSYCHOLOGICAL REVIEW, 2007, 114 (03) :539-576
[6]   Cross-validation methods [J].
Browne, MW .
JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2000, 44 (01) :108-132
[7]   Model comparisons and model selections based on generalization criterion methodology [J].
Busemeyer, JR ;
Wang, YM .
JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2000, 44 (01) :171-189
[8]  
CARLIN BP, 1995, J ROY STAT SOC B MET, V57, P473
[9]  
Chen M. H., 2000, MONTE CARLO METHODS
[10]   Marginal likelihood from the Gibbs output [J].
Chib, S .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (432) :1313-1321