MULTIVARIATE EXPLORATORY DATA-ANALYSIS AND GRAPHICS - A TUTORIAL

被引:16
作者
WEIHS, C
机构
[1] Mathematical Applications, Information Services, CIBA-GEIGY Ltd, R-1008.Z2.22, Basel,CH-4002, Switzerland
关键词
STEM AND LEAF DISPLAY; HISTOGRAM; BOXPLOT; QUANTILE PLOT; SCATTERPLOT; REGRESSION; SMOOTHING; 3D ROTATION; SCATTERPLOT MATRIX; OMEGA STRATEGY; DIMENSION REDUCTION; STABILITY OF STRUCTURE; RESAMPLING; INTERPRETATION OF STRUCTURE; PREDICTION MODELS; VARIABLES SELECTION;
D O I
10.1002/cem.1180070502
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Exploratory data analysis (EDA) is a toolbox of data manipulation methods for looking at data to see what they seem to say, i.e. one tries to let the data speak for themselves. In this way there is hope that the data will lead to indications about 'models' of relationships not expected a priori. In this respect EDA is a pre-step to confirmatory data analysis which delivers measures of how adequate a model is. In this tutorial the focus is on multivariate exploratory data analysis for quantitative data using linear methods for dimension reduction and prediction. Purely graphical multivariate tools such as 3D rotation and scatterplot matrices are discussed after having introduced the univariate and bivariate tools on which they are based. The main tasks of multivariate exploratory data analysis are identified as 'search for structure' by dimension reduction and 'model selection' by comparing predictive power. Resampling is used to support validity, and variables selection to improve interpretability.
引用
收藏
页码:305 / 340
页数:36
相关论文
共 49 条
[1]   GRAPHS IN STATISTICAL-ANALYSIS [J].
ANSCOMBE, FJ .
AMERICAN STATISTICIAN, 1973, 27 (01) :17-21
[2]   THE GRAND TOUR - A TOOL FOR VIEWING MULTIDIMENSIONAL DATA [J].
ASIMOV, D .
SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1985, 6 (01) :128-143
[3]  
BECKER RA, 1988, DYNAMIC GRAPHICS STA, P201
[4]  
BUJA A, 1989, ANN STAT, V17, P453, DOI 10.1214/aos/1176347115
[5]  
BUJA A, 1988, DYNAMIC GRAPHICS STA, P277
[6]  
Chambers J. M., 1983, GRAPHICAL METHODS DA
[7]   GRAPHICAL PERCEPTION - THE VISUAL DECODING OF QUANTITATIVE INFORMATION ON GRAPHICAL DISPLAYS OF DATA [J].
CLEVELAND, WS ;
MCGILL, R .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1987, 150 :192-229
[8]   ROBUST LOCALLY WEIGHTED REGRESSION AND SMOOTHING SCATTERPLOTS [J].
CLEVELAND, WS .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (368) :829-836
[9]  
Cleveland WS, 1994, ELEMENTS GRAPHING DA
[10]   PREDICTIVE ABILITY OF REGRESSION-MODELS .1. STANDARD-DEVIATION OF PREDICTION ERRORS (SDEP) [J].
CRUCIANI, G ;
BARONI, M ;
CLEMENTI, S ;
COSTANTINO, G ;
RIGANELLI, D ;
SKAGERBERG, B .
JOURNAL OF CHEMOMETRICS, 1992, 6 (06) :335-346