Calibration of multivariate scatter plots for exploratory analysis of relations within and between sets of variables in genomic research

被引:39
作者
Graffelman, J
van Eeuwijk, F
机构
[1] Univ Politecn Cataluna, Dept Stat & Operat Res, Barcelona 08028, Spain
[2] Univ Wageningen & Res Ctr, Dept Plant Sci, Lab Plant Breeding, NL-6700 AJ Wageningen, Netherlands
[3] Ctr BioSyst Genom, NL-6700 AB Wageningen, Netherlands
关键词
biplot; calibration; goodness-of-fit; goodness-of-scale; principal component analysis; correspondence analysis; canonical correlation analysis; redundancy analysis;
D O I
10.1002/bimj.200510177
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The scatter plot is a well known and easily applicable graphical tool to explore relationships between two quantitative variables. For the exploration of relations between multiple variables, generalisations of the scatter plot are useful. We present an overview of multivariate scatter plots focussing on the following situations. Firstly, we look at a scatter plot for portraying relations between quantitative variables within one data matrix. Secondly, we discuss a similar plot for the case of qualitative variables. Thirdly, we describe scatter plots for the relationships between two sets of variables where we focus on correlations. Finally, we treat plots of the relationships between multiple response and predictor variables, focussing on the matrix of regression coefficients. We will present both known and new results, where an important original contribution concerns a procedure for the inclusion of scales for the variables in multivariate scatter plots. We provide software for drawing such scales. We illustrate the construction and interpretation of the plots by means of examples on data collected in a genomic research program on taste in tomato.
引用
收藏
页码:863 / 879
页数:17
相关论文
共 30 条
[1]  
[Anonymous], 1976, MODERN FACTOR ANAL
[2]  
[Anonymous], 1981, Nonlinear Multivariate Analysis
[3]  
Benzecri J., 1973, ANAL DONNEES
[4]  
Dalgaard P., 2002, INTRO STAT R
[5]   ON THE INTEGRATED INTERPRETATION OF INDIRECT SITE ORDINATIONS - A CASE-STUDY USING SEMI-ARID VEGETATION IN SOUTHEASTERN SPAIN [J].
DARGIE, TCD .
VEGETATIO, 1984, 55 (01) :37-55
[6]  
DAVIES PT, 1982, J R STAT SOC C-APPL, V31, P244
[7]   Enhancing scatterplots with smoothed densities [J].
Eilers, PHC ;
Goeman, JJ .
BIOINFORMATICS, 2004, 20 (05) :623-U82
[8]  
Gabriel K. R., 1982, ENCY STATISTICAL SCI, V1, P263
[9]   BIPLOTS IN BIOMEDICAL-RESEARCH [J].
GABRIEL, KR ;
ODOROFF, CL .
STATISTICS IN MEDICINE, 1990, 9 (05) :469-485
[10]  
GABRIEL KR, 1971, BIOMETRIKA, V58, P453, DOI 10.2307/2334381