BAYESIAN PREDICTION OF DETERMINISTIC FUNCTIONS, WITH APPLICATIONS TO THE DESIGN AND ANALYSIS OF COMPUTER EXPERIMENTS

被引:496
作者
CURRIN, C
MITCHELL, T
MORRIS, M
YLVISAKER, D
机构
[1] OAK RIDGE NATL LAB, DIV ENGN PHYS & MATH, MATH SCI SECT, OAK RIDGE, TN 37831 USA
[2] UNIV CALIF LOS ANGELES, DEPT MATH, LOS ANGELES, CA 90024 USA
[3] BRYN MAWR COLL, BRYN MAWR, PA 19010 USA
关键词
COMPUTER MODELS; CORRELATION FUNCTION; CROSS-VALIDATION; ENTROPY; EXPERIMENTAL DESIGN; INTERPOLATION; KRIGING; OPTIMAL DESIGN; SPLINE FITTING; STOCHASTIC PROCESSES;
D O I
10.2307/2290511
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This article is concerned with prediction of a function y(t) over a (multidimensional) domain T, given the function values at a set of "sites" {t(1), t(2),...,t(n)} in T, and with the design, that is, with the selection of those sites. The motivating application is the design and analysis of computer experiments, where t determines the input to a computer model of a physical or behavioral system, and y(t) is a response that is part of the output or is calculated from it. Following a Bayesian formulation, prior uncertainty about the function y is expressed by means of a random function Y, which is taken here to be a Gaussian stochastic process. The mean of the posterior process can be used as the prediction function y triple-over-dot (t), and the variance can be used as a measure of uncertainty. This kind of approach has been used previously in Bayesian interpolation and is strongly related to the kriging methods used in geostatistics. Here emphasis is placed on product linear and product cubic correlation functions, which yield prediction functions that are, respectively, linear or cubic splines in every dimension. A posterior entropy criterion is adopted for design; this minimizes the expected uncertainty about the posterior process, as measured by the entropy. A computational algorithm for finding entropy-optimal designs on multidimensional grids is described. Several examples are presented, including a two-dimensional experiment on a computer model of a thermal energy storage device and a six-dimensional experiment on an integrated circuit simulator. Predictions are made using several different families of correlation functions, with parameters chosen to maximize the likelihood. For comparison, predictions are also made via least squares fitting of various polynomial and spline models. The Bayesian design/prediction methods, which do not require any modeling of y, produce comparatively good predictions. For some correlation functions, however, the 95% posterior probability intervals do not give adequate coverage of the true values of y at selected test sites. These methods are fairly simple and offer considerable potential for virtually automatic implementation, although further development is needed before they can be applied routinely in practice.
引用
收藏
页码:953 / 963
页数:11
相关论文
共 48 条
[1]   BAYESIAN APPROACH TO MODEL INADEQUACY FOR POLYNOMIAL REGRESSION [J].
BLIGHT, BJN ;
OTT, L .
BIOMETRIKA, 1975, 62 (01) :79-88
[2]  
BORTH DM, 1975, J ROY STAT SOC B MET, V37, P77
[3]   DISCRIMINATION AMONG MECHANISTIC MODELS [J].
BOX, GEP ;
HILL, WJ .
TECHNOMETRICS, 1967, 9 (01) :57-+
[4]   OPTIMAL BAYESIAN EXPERIMENTAL-DESIGN FOR LINEAR-MODELS [J].
CHALONER, K .
ANNALS OF STATISTICS, 1984, 12 (01) :283-300
[5]  
CHEN ZH, 1989, ANN STAT, V17, P515, DOI 10.1214/aos/1176347117
[6]  
Currin C., 1988, TECHNICAL REPORT
[7]  
DAVIS JC, 1986, STATISTICS DATA ANAL
[8]   PREDICTIVE APPROACH TO MODEL SELECTION [J].
GEISSER, S ;
EDDY, WF .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (365) :153-160
[9]  
GEISSER S, 1979, J AM STAT ASSOC, V75, P765
[10]   STOCHASTIC RELAXATION, GIBBS DISTRIBUTIONS, AND THE BAYESIAN RESTORATION OF IMAGES [J].
GEMAN, S ;
GEMAN, D .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1984, 6 (06) :721-741