On the optimality of conditional expectation as a Bregman predictor

被引:155
作者
Banerjee, A [1 ]
Guo, X
Wang, H
机构
[1] Univ Texas, Dept Elect & Comp Engn, Austin, TX 78712 USA
[2] Cornell Univ, Sch Operat Res & Ind Engn, Ithaca, NY 14583 USA
[3] Brown Univ, Div Appl Math, Providence, RI 02912 USA
关键词
Bregman loss functions (BLFs); conditional expectation; prediction;
D O I
10.1109/TIT.2005.850145
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of predicting a random variable X from observations, denoted by a random variable Z. It is well known that the conditional expectation E[X vertical bar Z] is the optimal L-2 predictor (also known as "the least-mean-square error" predictor) of X, among all (Borel measurable) functions of Z. In this correspondence, we provide necessary and sufficient conditions for the general loss functions under which the conditional expectation is the unique optimal predictor. We show that E[X vertical bar Z] is the optimal predictor for all Bregman loss functions (BLFs), of which the L-2 loss function is a special case. Moreover, under mild conditions, we show that the BLFs are exhaustive, i.e., if for every random variable X, the infimum of E[F(X, y) over all constants y is attained by the expectation E[X], then F is a BL.
引用
收藏
页码:2664 / 2669
页数:6
相关论文
共 20 条
[1]  
[Anonymous], 1995, ADV CALCULUS SEVERAL
[2]  
ATHREYA KB, 1999, 992 IOW STAT U DEP M
[3]  
Banerjee A, 2004, SIAM PROC S, P234
[4]   ENTROPIC MEANS [J].
BENTAL, A ;
CHARNES, A ;
TEBOULLE, M .
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1989, 139 (02) :537-551
[5]  
Bregman LM, 1967, USSR Computational Mathematics and Mathematical Physics, V7, P200
[6]  
Censor Y., 1998, Parallel optimization: Theory, Algorithms, and Applications
[7]  
Cover T. M., 2005, ELEM INF THEORY, DOI 10.1002/047174882X
[9]   GENERALIZED PROJECTIONS FOR NONNEGATIVE FUNCTIONS [J].
CSISZAR, I .
ACTA MATHEMATICA HUNGARICA, 1995, 68 (1-2) :161-186
[10]  
Csiszar I., 1967, STUD SCI MATH HUNG, V2, P299