Selection of key ambient particulate variables for epidemiological studies - Applying cluster and heatmap analyses as tools for data reduction

被引:36
作者
Gu, Jianwei [1 ,2 ]
Pitz, Mike [1 ,2 ]
Breitner, Susanne [1 ]
Birmili, Wolfram [3 ]
von Klot, Stephanie [1 ]
Schneider, Alexandra [1 ]
Soentgen, Jens [2 ]
Reller, Armin [2 ]
Peters, Annette [1 ]
Cyrys, Josef [1 ,2 ]
机构
[1] German Res Ctr Environm Hlth, Helmholtz Zentrum Munchen, Inst Epidemiol 2, D-86754 Neuherberg, Germany
[2] Univ Augsburg, Environm Sci Ctr WZU, D-86159 Augsburg, Germany
[3] Leibniz Inst Tropospher Res, D-04318 Leipzig, Germany
基金
美国国家环境保护局;
关键词
Cluster analysis; Heatmap analysis; Particle size distribution; Positive matrix factorization; Data reduction; Epidemiological study; PARTICLE-SIZE-DISTRIBUTION; AIR-POLLUTION; SURFACE-AREA; ULTRAFINE PARTICLES; HEART-DISEASE; EAST-GERMANY; URBAN AIR; NUMBER; HEALTH; DISTRIBUTIONS;
D O I
10.1016/j.scitotenv.2012.07.040
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The success of epidemiological studies depends on the use of appropriate exposure variables. The purpose of this study is to extract a relatively small selection of variables characterizing ambient particulate matter from a large measurement data set. The original data set comprised a total of 96 particulate matter variables that have been continuously measured since 2004 at an urban background aerosol monitoring site in the city of Augsburg, Germany. Many of the original variables were derived from measured particle size distribution (PSD) across the particle diameter range 3 nm to 10 mu m, including size-segregated particle number concentration, particle length concentration, particle surface concentration and particle mass concentration. The data set was complemented by integral aerosol variables. These variables were measured by independent instruments, including black carbon, sulfate, particle active surface concentration and particle length concentration. It is obvious that such a large number of measured variables cannot be used in health effect analyses simultaneously. The aim of this study is a pre-screening and a selection of the key variables that will be used as input in forthcoming epidemiological studies. In this study, we present two methods of parameter selection and apply them to data from a two-year period from 2007 to 2008. We used the agglomerative hierarchical cluster method to find groups of similar variables. In total, we selected 15 key variables from 9 clusters which are recommended for epidemiological analyses. We also applied a two-dimensional visualization technique called "heatmap" analysis to the Spearman correlation matrix. 12 key variables were selected using this method. Moreover, the positive matrix factorization (PMF) method was applied to the PSD data to characterize the possible particle sources. Correlations between the variables and PMF factors were used to interpret the meaning of the cluster and the heatmap analyses. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:541 / 550
页数:10
相关论文
共 65 条
  • [1] [Anonymous], INTRO R NOTES R PROG
  • [2] [Anonymous], RES REP HLTH EFF I R
  • [3] [Anonymous], J STAT SOFTW
  • [4] [Anonymous], 2006, Air Quality Guidelines: Global Update 2005: Particulate Matter, Ozone, Nitrogen Dioxide, and Sulfur Dioxide
  • [5] [Anonymous], RES REP HLTH EFF I R
  • [6] [Anonymous], [No title captured]
  • [7] [Anonymous], 2008, Document 32008L0050, DOI DOI 10.2766/14352
  • [8] Particle number size distributions in urban air before and after volatilisation
    Birmili, W.
    Heinke, K.
    Pitz, M.
    Matschullat, J.
    Wiedensohler, A.
    Cyrys, J.
    Wichmann, H. -E.
    Peters, A.
    [J]. ATMOSPHERIC CHEMISTRY AND PHYSICS, 2010, 10 (10) : 4643 - 4660
  • [9] Birmili W, 2009, GEFAHRST REINHALT L, V69, P137
  • [10] Particulate Matter Air Pollution and Cardiovascular Disease An Update to the Scientific Statement From the American Heart Association
    Brook, Robert D.
    Rajagopalan, Sanjay
    Pope, C. Arden, III
    Brook, Jeffrey R.
    Bhatnagar, Aruni
    Diez-Roux, Ana V.
    Holguin, Fernando
    Hong, Yuling
    Luepker, Russell V.
    Mittleman, Murray A.
    Peters, Annette
    Siscovick, David
    Smith, Sidney C., Jr.
    Whitsel, Laurie
    Kaufman, Joel D.
    [J]. CIRCULATION, 2010, 121 (21) : 2331 - 2378