Using supervised principal components analysis to assess multiple pollutant effects

被引:42
作者
Roberts, Steven [1 ]
Martin, Michael A. [1 ]
机构
[1] Australian Natl Univ, Coll Business & Econ, Sch Finance & Appl Stat, Canberra, ACT 0200, Australia
关键词
air pollution; mortality; multiple pollutants; principal components analysis; time series;
D O I
10.1289/ehp.9226
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
BACKGROUND: Many investigations of the adverse health effects of multiple air pollutants analyze the time series involved by simultaneously entering the multiple pollutants into a Poisson log-line model. This method can yield unstable parameter estimates when the pollutants involved suffer high intercorrelation; therefore, traditional approaches to dealing with multicollinearity, such as principal component analysis (PCA), have been promoted in this context. OBJECTIVES: A characteristic of PCA is that its construction does not consider the relationship between the covariates and the adverse health outcomes. A refined version of PCA, supervised principal components analysis (SPCA), is proposed that specifically addresses this issue. METHODS: Models controlling for long-term trends and weather effects were used in conjunction with each SPCA and PCA to estimate the association between multiple air pollutants and mortality for U.S. cities. The methods were compared further via a simulation study. RESULTS: Simulation studies demonstrated that SPCA, unlike PCA, was successful in identifying the correct subset of multiple pollutants associated with mortality. Because of this property, SPCA and PCA returned different estimates for the relationship between air pollution and mortality. CONCLUSIONS: Although a number of methods for assessing the effects of multiple pollutants have been proposed, such methods can falter in the presence of high correlation among pollutants. Both PCA and SPCA address this issue. By allowing the exclusion of pollutants that are not associated with the adverse health outcomes from the mixture of pollutants selected, SPCA offers a critical improvement over PCA.
引用
收藏
页码:1877 / 1882
页数:6
相关论文
共 35 条
[1]   PLS regression methods [J].
Höskuldsson, Agnar .
Journal of Chemometrics, 1988, 2 (03) :211-228
[2]  
[Anonymous], 2006, R LANG ENV STAT COMP
[3]   Prediction by supervised principal components [J].
Bair, E ;
Hastie, T ;
Paul, D ;
Tibshirani, R .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (473) :119-137
[4]   Latent root regression analysis: an alternative method to PLS [J].
Bertrand, D ;
Qannari, E ;
Vigneau, E .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2001, 58 (02) :227-234
[5]   Association between particulate- and gas-phase components of urban air pollution and daily mortality in eight Canadian cities [J].
Burnett, RT ;
Brook, J ;
Dann, T ;
Delocla, C ;
Philips, O ;
Cakmak, S ;
Vincent, R ;
Goldberg, MS ;
Krewski, D .
INHALATION TOXICOLOGY, 2000, 12 :15-39
[6]   A study of the association between daily mortality and ambient air pollutant concentrations in Pittsburgh, Pennsylvania [J].
Chock, DP ;
Winkler, SL .
JOURNAL OF THE AIR & WASTE MANAGEMENT ASSOCIATION, 2000, 50 (08) :1481-1500
[7]   Effect of the fine fraction of particulate matter versus the coarse mass and other pollutants on daily mortality in Santiago, Chile [J].
Cifuentes, LA ;
Vega, J ;
Köpfer, K ;
Lava, LB .
JOURNAL OF THE AIR & WASTE MANAGEMENT ASSOCIATION, 2000, 50 (08) :1287-1298
[8]  
Cox LH, 2000, ENVIRONMETRICS, V11, P611, DOI 10.1002/1099-095X(200011/12)11:6<611::AID-ENV443>3.0.CO
[9]  
2-Y
[10]   Estimating particulate matter-mortality dose-response curves and threshold levels: An analysis of daily time-series for the 20 largest US cities [J].
Daniels, MJ ;
Dominici, F ;
Samet, JM ;
Zeger, SL .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2000, 152 (05) :397-406