Multiple criteria for evaluating machine learning algorithms for land cover classification from satellite data

被引:189
作者
DeFries, RS
Chan, JCW
机构
[1] Univ Maryland, Dept Geog, College Pk, MD 20742 USA
[2] Univ Maryland, Earth Syst Sci Interdisciplinary Ctr, College Pk, MD 20742 USA
基金
美国国家航空航天局;
关键词
D O I
10.1016/S0034-4257(00)00142-5
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Operational monitoring of land cover from satellite data will require automated procedures for analyzing large volumes of data. We propose multiple criteria for assessing algorithms for this task. In addition to standard classification accuracy measures, we propose criteria to account for computational resources requires by the algorithms, stability of the algorithms, and robustness to noise in the training data. We also propose that classification accuracy take account, through estimation of misclassification costs, of unequal consequences to the user depending on which cover types are confused. In this article, we apply these criteria to three variants of decision tree classifiers, a standard decision tree implemented in C5.0 and two techniques recently proposed in the machine learning literature known as "bagging" and "boosting." Each of these algorithms are applied to two data sets, a global land cover classification from 8 km AVHRR data and a Landsat Thematic Mapper scene in Peru. Results indicate comparable accuracy of the three variants of the decision tree algorithms on the two data sets, with boosting providing marginally higher accuracies. The bagging and boosting algorithms, however, are both substantially more stable and more robust to noise in the training data compared with the standard C5.0 decision tree. The bagging algorithm is most costly in terms of computational resources while the standard decision tree is least costly. The results illustrate that the choice of the most suitable algorithm requires consideration of a suite of criteria in additions to the traditional accuracy measures and that there are likely to be trade-offs between algorithm performance and required computational resources. (C) Elsevier Science Inc., 2000.
引用
收藏
页码:503 / 515
页数:13
相关论文
共 41 条
  • [1] Agbu P.A., 1994, The NOAA_NASA Pathfinder AVHRR Land Data Set User's Manual
  • [2] AHERN F, 1998, P 27 INT S REM SENS
  • [3] [Anonymous], 1998, ICML
  • [4] Baase Sara, 1988, Computer Algorithms: Introduction to Design and Analysis, V2
  • [5] Bagging predictors
    Breiman, L
    [J]. MACHINE LEARNING, 1996, 24 (02) : 123 - 140
  • [6] Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946
  • [7] Brodley CE, 1999, AM SCI, V87, P54, DOI 10.1511/1999.1.54
  • [8] Congalton R.G., 2019, Assessing the Accuracy of Remotely Sensed data: Principles and Practices
  • [9] A REVIEW OF ASSESSING THE ACCURACY OF CLASSIFICATIONS OF REMOTELY SENSED DATA
    CONGALTON, RG
    [J]. REMOTE SENSING OF ENVIRONMENT, 1991, 37 (01) : 35 - 46
  • [10] Global land cover classifications at 8 km spatial resolution: the use of training data derived from Landsat imagery in decision tree classifiers
    De Fries, RS
    Hansen, M
    Townshend, JRG
    Sohlberg, R
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 1998, 19 (16) : 3141 - 3168