Neural network uncertainty assessment using Bayesian statistics: A remote sensing application

被引:31
作者
Aires, F [1 ]
Prigent, C
Rossow, WB
机构
[1] Columbia Univ, Dept Appl Phys & Appl Math, NASA, Goddard Inst Space Studies, New York, NY 10025 USA
[2] Ecole Polytech, CNRS IPSL, Lab Meteorol Dynam, F-91128 Palaiseau, France
[3] Observ Paris, CNRS, LERMA, F-75014 Paris, France
[4] NASA, Goddard Inst Space Studies, New York, NY 10025 USA
关键词
D O I
10.1162/0899766041941925
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
Neural network (NN) techniques have proved successful for many regression problems, in particular for remote sensing; however, uncertainty estimates are rarely provided. In this article, a Bayesian technique to evaluate uncertainties of the NN parameters (i.e., synaptic weights) is first presented. In contrast to more traditional approaches based on point estimation of the NN weights, we assess uncertainties on such estimates to monitor the robustness of the NN model. These theoretical developments are illustrated by applying them to the problem of retrieving surface skin temperature, microwave surface emissivities, and integrated water vapor content from a combined analysis of satellite microwave and infrared observations over land. The weight uncertainty estimates are then used to compute analytically the uncertainties in the network outputs (i.e., error bars and correlation structure of these errors). Such quantities are very important for evaluating any application of an NN model. The uncertainties on the NN Jacobians are then considered in the third part of this article. Used for regression fitting, NN models can be used effectively to represent highly nonlinear, multivariate functions. In this situation, most emphasis is put on estimating the output errors, but almost no attention has been given to errors associated with the internal structure of the regression model. The complex structure of dependency inside the NN is the essence of the model, and assessing its quality, coherency, and physical character makes all the difference between a blackbox model with small output errors and a reliable, robust, and physically coherent model. Such dependency structures are described to the first order by the NN Jacobians: they indicate the sensitivity of one output with respect to the inputs of the model for given input data. We use a Monte Carlo integration procedure to estimate the robustness of the NN Jacobians. A regularization strategy based on principal component analysis is proposed to suppress the multicollinearities in order to make these Jacobians robust and physically meaningful.
引用
收藏
页码:2415 / 2458
页数:44
相关论文
共 35 条
[1]
Remote sensing from the infrared atmospheric sounding interferometer instrument -: 1.: Compression, denoising, and first-guess retrieval algorithms -: art. no. 4619 [J].
Aires, F ;
Rossow, WB ;
Scott, NA ;
Chédin, A .
JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 2002, 107 (D22)
[2]
Remote sensing from the infrared atmospheric sounding interferometer instrument -: 2.: Simultaneous retrieval of temperature, water vapor, and ozone atmospheric profiles -: art. no. 4620 [J].
Aires, F ;
Rossow, WB ;
Scott, NA ;
Chédin, A .
JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 2002, 107 (D22)
[3]
Inferring instantaneous, multivariate and nonlinear sensitivities for the analysis of feedback processes in a dynamical system: Lorenz model case-study [J].
Aires, F ;
Rossow, WB .
QUARTERLY JOURNAL OF THE ROYAL METEOROLOGICAL SOCIETY, 2003, 129 (587) :239-275
[4]
The "weight smoothing" regularization of MLP for Jacobian stabilization [J].
Aires, F ;
Schmitt, M ;
Chedin, A ;
Scott, N .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (06) :1502-1510
[5]
A new neural network approach including first guess for retrieval of atmospheric water vapor, cloud liquid water path, surface temperature, and emissivities over land from satellite microwave observations [J].
Aires, F ;
Prigent, C ;
Rossow, WB ;
Rothstein, M .
JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 2001, 106 (D14) :14887-14907
[6]
[Anonymous], 2005, NEURAL NETWORKS PATT
[7]
BADEA V, 1991, PROBLEMES INCORRECTE
[8]
Bates D. M., 1988, NONLINEAR REGRESSION
[9]
STATISTICAL APPLICATIONS OF A METRIC ON SUBSPACES TO SATELLITE METEOROLOGY [J].
CRONE, LJ ;
CROSBY, DS .
TECHNOMETRICS, 1995, 37 (03) :324-328
[10]
Gelman A, 2013, BAYESIAN DATA ANAL, DOI DOI 10.1201/9780429258411