Evaluating the predictive performance of habitat models developed using logistic regression

被引:1557
作者
Pearce, J [1 ]
Ferrier, S [1 ]
机构
[1] NSW Natl Parks & Wildlife Serv, Armidale, NSW 2350, Australia
关键词
logistic regression; model evaluation; prediction; relative operating characteristic curve;
D O I
10.1016/S0304-3800(00)00322-7
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
The use of statistical models to predict the likely occurrence or distribution of species is becoming an increasingly important tool in conservation planning and wildlife management. Evaluating the predictive performance of models using independent data is a vital step in model development. Such evaluation assists in determining the suitability of a model for specific applications, facilitates comparative assessment of competing models and modelling techniques, and identities aspects of a model most in need of improvement. The predictive performance of habitat models developed using logistic regression needs to be evaluated in terms of two components: reliability or calibration (the agreement between predicted probabilities of occurrence and observed proportions of sites occupied), and discrimination capacity (the ability of a model to correctly distinguish between occupied and unoccupied sites). Lack of reliability can be attributed to two systematic sources, calibration bias and spread. Techniques are described for evaluating both of these sources of error. The discrimination capacity of logistic regression models is often measured by cross-classifying observations and predictions in a two-by-two table, and calculating indices of classification performance. However, this approach relies on the essentially arbitrary choice of a threshold probability to determine whether or not a site is predicted to be occupied. An alternative approach is described which measures discrimination capacity in terms of the area under a relative operating characteristic (ROC) curve relating relative proportions of correctly and incorrectly classified predictions over a wide and continuous range of threshold levels. Wider application of the techniques promoted in this paper could greatly improve understanding of the usefulness, and potential limitations, of habitat models developed for use in conservation planning and wildlife management. (C) 2000 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:225 / 245
页数:21
相关论文
共 42 条
  • [1] [Anonymous], 1981, Statistical Tables
  • [2] MEASUREMENT OF THE REALIZED QUALITATIVE NICHE - ENVIRONMENTAL NICHES OF 5 EUCALYPTUS SPECIES
    AUSTIN, MP
    NICHOLLS, AO
    MARGULES, CR
    [J]. ECOLOGICAL MONOGRAPHS, 1990, 60 (02) : 161 - 177
  • [3] AREA ABOVE ORDINAL DOMINANCE GRAPH AND AREA BELOW RECEIVER OPERATING CHARACTERISTIC GRAPH
    BAMBER, D
    [J]. JOURNAL OF MATHEMATICAL PSYCHOLOGY, 1975, 12 (04) : 387 - 415
  • [4] COMPARING INDICATORS OF HEALTH OR NUTRITIONAL-STATUS
    BROWNIE, C
    HABICHT, JP
    COGILL, B
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 1986, 124 (06) : 1031 - 1044
  • [5] CLODE D, 1997, JOINT OLD GROWTH FOR
  • [6] Collett D, 1991, MODELLING BINARY DAT
  • [7] Conover W. J., 1980, PRACTICAL NONPARAMET
  • [8] COX DR, 1958, BIOMETRIKA, V45, P562, DOI 10.1093/biomet/45.3-4.562
  • [9] DIFFENBACH DR, 1989, J WILDLIFE MANAGE, V53, P383
  • [10] Efron B., 1993, INTRO BOOTSTRAP, V1st ed., DOI DOI 10.1201/9780429246593