Exposure assessment models for elemental components of particulate matter in an urban environment: A comparison of regression and random forest approaches

被引:190
作者
Brokamp, Cole [1 ,2 ]
Jandarov, Roman [2 ]
Rao, M. B. [2 ]
LeMasters, Grace [2 ,3 ]
Ryan, Patrick [1 ,2 ]
机构
[1] Cincinnati Childrens Hosp Med Ctr, Div Biostat & Epidemiol, Cincinnati, OH 45229 USA
[2] Univ Cincinnati, Dept Environm Hlth, Cincinnati, OH USA
[3] Cincinnati Childrens Hosp Med Ctr, Div Asthma Res, Cincinnati, OH 45229 USA
关键词
Elemental PM2.5; Land use regression; Random forest; LAND-USE REGRESSION; AIR-POLLUTION; CLASSIFICATION; PREDICTION; ASSOCIATION; VARIABILITY; ALGORITHM; SELECTION; DISTANCE; INFANTS;
D O I
10.1016/j.atmosenv.2016.11.066
中图分类号
X [环境科学、安全科学];
学科分类号
083001 [环境科学];
摘要
Exposure assessment for elemental components of particulate matter (PM) using land use modeling is a complex problem due to the high spatial and temporal variations in pollutant concentrations at the local scale. Land use regression (LUR) models may fail to capture complex interactions and non-linear relationships between pollutant concentrations and land use variables. The increasing availability of big spatial data and machine learning methods present an opportunity for improvement in PM exposure assessment models. In this manuscript, our objective was to develop a novel land use random forest (LURF) model and compare its accuracy and precision to a LUR model for elemental components of PM in the urban city of Cincinnati, Ohio. PM smaller than 2.5 mu m (PM2.5) and eleven elemental components were measured at 24 sampling stations from the Cincinnati Childhood Allergy and Air Pollution Study (CCAAPS). Over 50 different predictors associated with transportation, physical features, community socioeconomic characteristics, greenspace, land cover, and emission point sources were used to construct LUR and LURF models. Cross validation was used to quantify and compare model performance. LURF and LUR models were created for aluminum (Al), copper (Cu), iron (Fe), potassium (K), manganese (Mn), nickel (Ni), lead (Pb), sulfur (S), silicon (Si), vanadium (V), zinc (Zn), and total PM2.5 in the CCAAPS study area. LURF utilized a more diverse and greater number of predictors than LUR and LURF models for Al, K, Mn, Pb, Si, Zn, TRAP, and PM2.5 all showed a decrease in fractional predictive error of at least 5% compared to their LUR models. LURF models for Al, Cu, Fe, K, Mn, Pb, Si, Zn, TRAP, and PM2.5 all had a cross validated fractional predictive error less than 30%. Furthermore, LUR models showed a differential exposure assessment bias and had a higher prediction error variance. Random forest and other machine learning methods may provide more accurate exposure assessment. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 51 条
[1]
Estimation of outdoor NOx, NO2, and BTEX exposure in a cohort of pregnant women using land use regression modeling [J].
Aguilera, Inmaculada ;
Sunyer, Jordi ;
Fernandez-Patier, Rosalia ;
Hoek, Gerard ;
Aguirre-Alfaro, Amelia ;
Meliefste, Kees ;
Bomboi-Mingarro, M. Teresa ;
Nieuwenhuijsen, Mark J. ;
Herce-Garraleta, Dolores ;
Brunekreef, Bert .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2008, 42 (03) :815-821
[2]
Alvarez S, 2005, CLIN CANCER RES, V11, P1146
[3]
[Anonymous], 2013, INTRO STAT LEARNING
[4]
[Anonymous], 2014, rgeos: Interface to Geometry Engine-Open Source (GEOS). R package version 0.3
[5]
Application of the deletion/substitution/addition algorithm to selecting land use regression models for interpolating air pollution measurements in California [J].
Beckerman, Bernardo S. ;
Jerrett, Michael ;
Martin, Randall V. ;
van Donkelaar, Aaron ;
Ross, Zev ;
Burnett, Richard T. .
ATMOSPHERIC ENVIRONMENT, 2013, 77 :172-177
[6]
Natural-Cause Mortality and Long-Term Exposure to Particle Components: An Analysis of 19 European Cohorts within the Multi-Center ESCAPE Project [J].
Beelen, Rob ;
Hoek, Gerard ;
Raaschou-Nielsen, Ole ;
Stafoggia, Massimo ;
Andersen, Zorana Jovanovic ;
Weinmayr, Gudrun ;
Hoffmann, Barbara ;
Wolf, Kathrin ;
Samoli, Evangelia ;
Fischer, Paul H. ;
Nieuwenhuijsen, Mark J. ;
Xun, Wei W. ;
Katsouyanni, Klea ;
Dimakopoulou, Konstantina ;
Marcon, Alessandro ;
Vartiainen, Erkki ;
Lanki, Timo ;
Yli-Tuomi, Tarja ;
Oftedal, Bente ;
Schwarze, Per E. ;
Nafstad, Per ;
De Faire, Ulf ;
Pedersen, Nancy L. ;
Otstenson, Claes-Goran ;
Fratiglioni, Laura ;
Penell, Johanna ;
Korek, Michal ;
Pershagen, Goran ;
Eriksen, Kirsten Thorup ;
Overvad, Kim ;
Sorensen, Mette ;
Eeftens, Marloes ;
Peeters, Petra H. ;
Meliefste, Kees ;
Wang, Meng ;
Bueno-de-Mesquita, H. Bas ;
Sugiri, Dorothea ;
Kramer, Ursula ;
Heinrich, Joachim ;
de Hoogh, Kees ;
Key, Timothy ;
Peters, Annette ;
Hampel, Regina ;
Concin, Hans ;
Nagel, Gabriele ;
Jaensch, Andrea ;
Ineichen, Alex ;
Tsai, Ming-Yi ;
Schaffner, Emmanuel ;
Probst-Hensch, Nicole M. .
ENVIRONMENTAL HEALTH PERSPECTIVES, 2015, 123 (06) :525-533
[7]
Comparison of the performances of land use regression modelling and dispersion modelling in estimating small-scale variations in long-term air pollution concentrations in a Dutch urban area [J].
Beelen, Rob ;
Voogt, Marita ;
Duyzer, Jan ;
Zandveld, Peter ;
Hoek, Gerard .
ATMOSPHERIC ENVIRONMENT, 2010, 44 (36) :4614-4621
[8]
Spatial and temporal variation in PM2.5 chemical composition in the United States for health effects studies [J].
Bell, Michelle L. ;
Dominici, Francesca ;
Ebisu, Keita ;
Zeger, Scott L. ;
Samet, Jonathan M. .
ENVIRONMENTAL HEALTH PERSPECTIVES, 2007, 115 (07) :989-995
[9]
Bivand K., 2005, CLASSES METHODS SPAT
[10]
Bivand R., 2014, RGDAL BINDINGS GEOSP