ON PRINCIPAL HESSIAN DIRECTIONS FOR DATA VISUALIZATION AND DIMENSION REDUCTION - ANOTHER APPLICATION OF STEINS LEMMA

被引:482
作者
LI, KC
机构
[1] Mathematics at University of California, Los Angeles, Los Angeles, CA
关键词
PROJECTION PURSUIT; SLICED INVERSE REGRESSION; STATISTICAL GRAPHICS; STEINS LEMMA;
D O I
10.1080/01621459.1992.10476258
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Modem graphical tools have enhanced our ability to learn many things from data directly. With much user-friendly graphical software available, we are encouraged to plot a lot more often than before. The benefits from direct interaction with graphics have been enormous. But trailing behind these high-tech advances is the issue of appropriate guidance on what to plot. There are too many directions to project a high-dimensional data set and unguided plotting can be time-consuming and fruitless. In a recent article, Li set up a statistical framework for study on this issue, based on a notion of effective dimension reduction (edr) directions. They are the directions to project a high dimensional input variable for the purpose of effectively viewing and studying its relationship with an output variable. A methodology, sliced inverse regression, was introduced and shown to be useful in finding edr directions. This article introduces another method for finding edr directions. It begins with the observation that the eigenvectors for the Hessian matrices of the regression function are helpful in the study of the shape of the regression surface. A notation of principal Hessian directions (pHd's) is defined that locates the main axes along which the regression surface shows the largest curvatures in an aggregate sense. We show that pHd's can be used to find edr directions. We further use the celebrated Stein lemma for suggesting estimates. The sampling properties of the estimated pHd's are obtained. A significance test is derived for suggesting the genuineness of a view found by our method. Some versions for implementing this method are discussed, and simulation results and an application to real data are reported. The relationship of this method with exploratory projection pursuit is also discussed.
引用
收藏
页码:1025 / 1039
页数:15
相关论文
共 46 条
[1]  
[Anonymous], 1966, PERTURBATION THEORY
[2]  
Box G.E.P., 1987, EMPIRICAL MODEL BUIL
[4]  
BREIMAN L, 1985, J AM STAT ASSOC, V80, P580, DOI 10.2307/2288473
[5]  
Breiman L, 2017, CLASSIFICATION REGRE, P368, DOI 10.1201/9781315139470
[6]  
Brillinger D. R., 1983, WADSWORTH STATIST PR, P97
[7]  
BRILLINGER DR, 1977, BIOMETRIKA, V64, P509, DOI 10.1093/biomet/64.3.509
[8]  
BRILLINGER DR, 1991, J AM STAT ASSOC, V86, P333, DOI 10.1080/01621459.1991.10475038
[9]  
CHEN H, 1991, ANN STAT, V19, P142
[10]  
CLEVELAND WS, 1988, COLECTED WORKS JW TU, V5