Forecasting daily high ozone concentrations by classification trees

被引:9
作者
Bruno, F [1 ]
Cocchi, D [1 ]
Trivisano, C [1 ]
机构
[1] Univ Bologna, Dipartimento Sci Stat Paolo Fortunati, I-40126 Bologna, Italy
关键词
air pollution; classification trees; ozone exceedance; unbalanced classes;
D O I
10.1002/env.631
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This article proposes the use of classification trees (CART) as a suitable technique for forecasting the daily exceedance of ozone standards established by Italian law. A model is formulated for predicting, 1 and 2 days beforehand, the most probable class of the maximum daily urban ozone concentration in the city of Bologna. The standard employed is the so-called 'warning level' (180 mug/m(3)). Meteorological forecasted variables are considered as predictors. Pollution data show a considerable discrepancy between the dimensions of the two classes of events. The first class includes those days when the observed maximum value exceeds the established standard, while the second class contains those when the observed maximum value does not exceed the said standard. Due to this peculiarity, model selection procedures using cross-validation usually lead to overpruning. We can overcome this drawback by means of techniques which replicate observations, through the modification of their inclusion probabilities in the cross-validation sets. Copyright (C) 2004 John Wiley Sons, Ltd.
引用
收藏
页码:141 / 153
页数:13
相关论文
共 18 条
[1]  
Bellman R, 1961, ADAPTIVE CONTROL PRO, DOI DOI 10.1515/9781400874668
[2]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]  
Breiman L, 1998, ANN STAT, V26, P801
[5]  
BRUNO F, 1999, ENV STAT EARTH SPACE
[6]  
Cardie C., 1997, Proceedings of the Fourteenth International Conference on Machine Learning, P57
[7]  
CHAN PK, 1999, MACH LEARN, V14, P1
[8]   Modeling the effects of meteorology on ozone in Houston using cluster analysis and generalized additive models [J].
Davis, JM ;
Eder, BK ;
Nychka, D ;
Yang, Q .
ATMOSPHERIC ENVIRONMENT, 1998, 32 (14-15) :2505-2520
[9]  
FAWCETT T, 1996, MACHINE LEARNING LIS, V8
[10]  
Huang LS, 1999, ENVIRONMETRICS, V10, P103, DOI 10.1002/(SICI)1099-095X(199901/02)10:1<103::AID-ENV341>3.3.CO