Probabilistic Forecasts of Mesoscale Convective System Initiation Using the Random Forest Data Mining Technique

被引:68
作者
Ahijevych, David [1 ]
Pinto, James O. [1 ]
Williams, John K. [2 ]
Steiner, Matthias [1 ]
机构
[1] Natl Ctr Atmospher Res, POB 3000, Boulder, CO 80307 USA
[2] Weather Co, Andover, MA USA
基金
美国国家科学基金会;
关键词
PRECIPITATION FORECASTS; CLASSIFICATION; PREDICTION; PERFORMANCE; MODELS; CYCLE;
D O I
10.1175/WAF-D-15-0113.1
中图分类号
P4 [大气科学(气象学)];
学科分类号
0706 ; 070601 ;
摘要
A data mining and statistical learning method known as a random forest (RF) is employed to generate 2-h forecasts of the likelihood for initiation of mesoscale convective systems (MCS-I). The RF technique uses an ensemble of decision trees to relate a set of predictors [in this case radar reflectivity, satellite imagery, and numerical weather prediction (NWP) model diagnostics] to a predictand (in this case MCS-I). The RF showed a remarkable ability to detect MCS-I events. Over 99% of the 550 observed MCS-I events were detected to within 50 km. However, this high detection rate came with a tendency to issue false alarms either because of premature warning of an MCS-I event or in the continued elevation of RF forecast likelihoods well after an MCS-I event occurred. The skill of the RF forecasts was found to increase with the number of trees and the fraction of positive events used in the training set. The skill of the RF was also highly dependent on the types of predictor fields included in the training set and was notably better when a more recent training period was used. The RF offers advantages over high-resolution NWP because it can be run in a fraction of the time and can account for nonlinearly varying biases in the model data. In addition, as part of the training process, the RF ranks the importance of each predictor, which can be used to assess the utility of new datasets in the prediction of MCS-I.
引用
收藏
页码:581 / 599
页数:19
相关论文
共 55 条
  • [1] Benjamin S. G., 2014, P 4 AV RANG AER MET, P24
  • [2] SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
    Blewitt, Marnie E.
    Gendrel, Anne-Valerie
    Pang, Zhenyi
    Sparrow, Duncan B.
    Whitelaw, Nadia
    Craig, Jeffrey M.
    Apedaile, Anwyn
    Hilton, Douglas J.
    Dunwoodie, Sally L.
    Brockdorff, Neil
    Kay, Graham F.
    Whitelaw, Emma
    [J]. NATURE GENETICS, 2008, 40 (05) : 663 - 669
  • [3] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [4] Technical note: Some properties of splitting criteria
    Breiman, L
    [J]. MACHINE LEARNING, 1996, 24 (01) : 41 - 47
  • [5] Rainfall occurrence in the US warm season: The diurnal cycle
    Carbone, R. E.
    Tuttle, J. D.
    [J]. JOURNAL OF CLIMATE, 2008, 21 (16) : 4132 - 4146
  • [6] Comparison of the diurnal precipitation cycle in convection-resolving and non-convection-resolving mesoscale models
    Clark, Adam J.
    Gallus, William A., Jr.
    Chen, Tsing-Chang
    [J]. MONTHLY WEATHER REVIEW, 2007, 135 (10) : 3456 - 3473
  • [7] Application of Object-Based Time-Domain Diagnostics for Tracking Precipitation Systems in Convection-Allowing Models
    Clark, Adam J.
    Bullock, Randy G.
    Jensen, Tara L.
    Xue, Ming
    Kong, Fanyou
    [J]. WEATHER AND FORECASTING, 2014, 29 (03) : 517 - 542
  • [8] Colavito J. A., 2011, 15 C AV RANG AER MET, P136
  • [9] Colavito J. A., 2012, P 3 AV RANG AER MET
  • [10] Forecasting the maintenance of quasi-linear mesoscale convective systems
    Coniglio, Michael C.
    Brooks, Harold E.
    Weiss, Steven J.
    Corfidi, Stephen F.
    [J]. WEATHER AND FORECASTING, 2007, 22 (03) : 556 - 570