Predicting tree species presence and basal area in Utah: A comparison of stochastic gradient boosting, generalized additive models, and tree-based methods

被引:210
作者
Moisen, Gretchen G.
Freeman, Elizabeth A.
Blackard, Jock A.
Frescino, Tracey S.
Zimmermann, Niklaus E.
Edwards, Thomas C., Jr.
机构
[1] USDA Forest Serv, Rocky Mt Res Stn, Ogden, UT 84401 USA
[2] Swiss Fed Res Inst WSL, Dept Landscape Res, CH-8903 Birmensdorf, Switzerland
[3] Utah State Univ, USGS Utah Cooperat Fish & Wildlife Res Unit, Coll Nat Resources, Logan, UT 84322 USA
关键词
species presence; predictive mapping; forest inventory; GAMs; classification trees; regression trees; See5; Cubist; stochastic gradient boosting;
D O I
10.1016/j.ecolmodel.2006.05.021
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Many efforts are underway to produce broad-scale forest attribute maps by modelling forest class and structure variables collected in forest inventories as functions of satellite-based and biophysical information. Typically, variants of classification and regression trees implemented in Rulequest's (c) See5 and Cubist (for binary and continuous responses, respectively) are the tools of choice in many of these applications. These tools are widely used in large remote sensing applications, but are not easily interpretable, do not have ties with survey estimation methods, and use proprietary unpublished algorithms. Consequently, three alternative modelling techniques were compared for mapping presence and basal area of 13 species located in the mountain ranges of Utah, USA. The modelling techniques compared included the widely used See5/Cubist, generalized additive models (GAMs), and stochastic gradient boosting (SGB). Model performance was evaluated using independent test data sets. Evaluation criteria for mapping species presence included specificity, sensitivity, Kappa, and area under the curve (AUC). Evaluation criteria for the continuous basal area variables included correlation and relative mean squared error. For predicting species presence (setting thresholds to maximize Kappa), SGB had higher values for the majority of the species for specificity and Kappa, while GAMs had higher values for the majority of the species for sensitivity. In evaluating resultant AUC values, GAM and/or SGB models had significantly better results than the See5 models where significant differences could be detected between models. For nine out of 13 species, basal area prediction results for all modelling techniques were poor (correlations less than 0.5 and relative mean squared errors greater than 0.8), but SGB provided the most stable predictions in these instances. SGB and Cubist performed equally well for modelling basal area for three species with moderate prediction success, while all three modelling tools produced comparably good predictions (correlation of 0.68 and relative mean squared error of 0.56) for one species. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:176 / 187
页数:12
相关论文
共 67 条
[1]   Spatial prediction of species distribution: an interface between ecological theory and statistical modelling [J].
Austin, MP .
ECOLOGICAL MODELLING, 2002, 157 (2-3) :101-118
[2]  
Bailey R. G., 1994, ECOREGIONS SUBREGION
[3]  
BAUER E, 1998, MACH LEARN, V5, P1
[4]  
BECHTOLD WA, 2005, SRS80 US DEP AGR FOR
[5]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]  
Breiman L, 1998, ANN STAT, V26, P801
[8]   Observed and predicted responses of plant growth to climate across Canada [J].
Bunn, AG ;
Goetz, SJ ;
Fiske, GJ .
GEOPHYSICAL RESEARCH LETTERS, 2005, 32 (16) :1-4
[9]   Development of a robust classifier of freshwater residence in barramundi (Lates calcarifer) life histories using elemental ratios in scales and boosted regression trees [J].
Cappo, M ;
De'ath, G ;
Boyle, S ;
Aumend, J ;
Olbrich, R ;
Hoedt, F ;
Perna, C ;
Brunskill, G .
MARINE AND FRESHWATER RESEARCH, 2005, 56 (05) :713-723
[10]  
Chan JCW, 2001, IEEE T GEOSCI REMOTE, V39, P693, DOI 10.1109/36.911126