Field Significance of Regression Patterns

被引:29
作者
DelSole, Timothy [1 ]
Yang, Xiaosong
机构
[1] Ctr Ocean Land Atmosphere Studies, Calverton, MD 20705 USA
基金
美国国家航空航天局; 美国海洋和大气管理局; 美国国家科学基金会;
关键词
CANONICAL CORRELATION; SUBSET REGRESSION; REANALYSIS; MODEL;
D O I
10.1175/2011JCLI4105.1
中图分类号
P4 [大气科学(气象学)];
学科分类号
0706 ; 070601 ;
摘要
Regression patterns often are used to diagnose the relation between a field and a climate index, but a significance test for the pattern "as a whole" that accounts for the multiplicity and interdependence of the tests has not been widely available. This paper argues that field significance can be framed as a test of the hypothesis that all regression coefficients vanish in a suitable multivariate regression model. A test for this hypothesis can be derived from the generalized likelihood ratio test. The resulting statistic depends on relevant covariance matrices and accounts for the multiplicity and interdependence of the tests. It also depends only on the canonical correlations between the predictors and predictands, thereby revealing a fundamental connection to canonical correlation analysis. Remarkably, the test statistic is invariant to a reversal of the predictors and predictands, allowing the field significance test to be reduced to a standard univariate hypothesis test. In practice, the test cannot be applied when the number of coefficients exceeds the sample size, reflecting the fact that testing more hypotheses than data is ill conceived. To formulate a proper significance test, the data are represented by a small number of principal components, with the number chosen based on cross-validation experiments. However, instead of selecting the model that minimizes the cross-validated mean square error, a confidence interval for the cross-validated error is estimated and the most parsimonious model whose error is within the confidence interval of the minimum error is chosen. This procedure avoids selecting complex models whose error is close to much simpler models. The procedure is applied to diagnose long-term trends in annual average sea surface temperature and boreal winter 300-hPa zonal wind. In both cases a statistically significant 50-yr trend pattern is extracted. The resulting spatial filter can be used to monitor the evolution of the regression pattern without temporal filtering.
引用
收藏
页码:5094 / 5107
页数:14
相关论文
共 29 条
[1]   Checking for model consistency in optimal fingerprinting [J].
Allen, MR ;
Tett, SFB .
CLIMATE DYNAMICS, 1999, 15 (06) :419-434
[2]  
Anderson T. W., 1984, An introduction to multivariate statistical analysis, V2nd
[3]  
[Anonymous], 1988, Applied Multivariate Statistical Analysis
[4]  
[Anonymous], 1980, Multivariate Analysis
[5]   A Significant Component of Unforced Multidecadal Variability in the Recent Acceleration of Global Warming [J].
DelSole, Timothy ;
Tippett, Michael K. ;
Shukla, Jagadish .
JOURNAL OF CLIMATE, 2011, 24 (03) :909-926
[6]   Artificial Skill due to Predictor Screening [J].
DelSole, Timothy ;
Shukla, Jagadish .
JOURNAL OF CLIMATE, 2009, 22 (02) :331-345
[7]   FREQUENCY OF SELECTING NOISE VARIABLES IN SUBSET REGRESSION-ANALYSIS - A SIMULATION STUDY [J].
FLACK, VF ;
CHANG, PC .
AMERICAN STATISTICIAN, 1987, 41 (01) :84-86
[8]   A NOTE ON SCREENING REGRESSION EQUATIONS [J].
FREEDMAN, DA .
AMERICAN STATISTICIAN, 1983, 37 (02) :152-155
[9]   Multi-pattern fingerprint method for detection and attribution of climate change [J].
Hasselmann, K .
CLIMATE DYNAMICS, 1997, 13 (09) :601-611
[10]   PENALIZED DISCRIMINANT-ANALYSIS [J].
HASTIE, T ;
BUJA, A ;
TIBSHIRANI, R .
ANNALS OF STATISTICS, 1995, 23 (01) :73-102