The effects of species' range sizes on the accuracy of distribution models: ecological phenomenon or statistical artefact?

被引:443
作者
McPherson, JM
Jetz, W
Rogers, DJ
机构
[1] Univ Oxford, Dept Zool, Oxford OX1 3PS, England
[2] Princeton Univ, Dept Ecol & Evolutionary Biol, Princeton, NJ 08544 USA
[3] Univ New Mexico, Dept Biol, Albuquerque, NM 87131 USA
关键词
discriminant analysis; kappa; logistic regression; prevalence; ROC plots; sample size; satellite imagery;
D O I
10.1111/j.0021-8901.2004.00943.x
中图分类号
X176 [生物多样性保护];
学科分类号
090705 ;
摘要
1. Conservation scientists and resource managers increasingly employ empirical distribution models to aid decision-making. However, such models are not equally reliable for all species, and range size can affect their performance. We examined to what extent this effect reflects statistical artefacts arising from the influence of range size on the sample size and sampling prevalence (proportion of samples representing species presence) of data used to train and test models. 2. Our analyses used both simulated data and empirical distribution models for 32 bird species endemic to South Africa, Lesotho and Swaziland. Models were built with either logistic regression or non-linear discriminant analysis, and assessed with four measures of model accuracy: sensitivity, specificity, Cohen's kappa and the area under the curve (AUC) of receiver-operating characteristic (ROC) plots. Environmental indices derived from Fourier-processed satellite imagery served as predictors. 3. We first followed conventional modelling practice to illustrate how range size might influence model performance, when sampling prevalence reflects species' natural prevalences. We then demonstrated that this influence is primarily artefactual. Statistical artefacts can arise during model assessment, because Cohen's kappa responds systematically to changes in prevalence. AUC, in contrast, is largely unaffected, and thus a more reliable measure of model performance. Statistical artefacts also arise during model fitting. Both logistic regression and discriminant analysis are sensitive to the sample size and sampling prevalence of training data. Both perform best when sample size is large and prevalence intermediate. 4. Synthesis and applications. Species' ecological characteristics may influence the performance of distribution models. Statistical artefacts, however, can confound results in comparative studies seeking to identify these characteristics. To mitigate artefactual effects, we recommend careful reporting of sampling prevalence, AUC as the measure of accuracy, and fixed, intermediate levels of sampling prevalence in comparative studies.
引用
收藏
页码:811 / 823
页数:13
相关论文
共 65 条
  • [1] AGRESTI A., 2019, INTRO CATEGORICAL DA
  • [2] [Anonymous], 1996, ANAL TIME SERIES INT
  • [3] An autologistic model for the spatial distribution of wildlife
    Augustin, NH
    Mugglestone, MA
    Buckland, ST
    [J]. JOURNAL OF APPLIED ECOLOGY, 1996, 33 (02) : 339 - 347
  • [4] COEFFICIENT KAPPA - SOME USES, MISUSES, AND ALTERNATIVES
    BRENNAN, RL
    PREDIGER, DJ
    [J]. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1981, 41 (03) : 687 - 699
  • [5] CHANCE-CORRECTED MEASURES OF THE VALIDITY OF A BINARY DIAGNOSTIC-TEST
    BRENNER, H
    GEFELLER, O
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 1994, 47 (06) : 627 - 633
  • [6] BIAS, PREVALENCE AND KAPPA
    BYRT, T
    BISHOP, J
    CARLIN, JB
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 1993, 46 (05) : 423 - 429
  • [7] Combining probabilities of occurrence with spatial reserve design
    Cabeza, M
    Araújo, MB
    Wilson, RJ
    Thomas, CD
    Cowley, MJR
    Moilanen, A
    [J]. JOURNAL OF APPLIED ECOLOGY, 2004, 41 (02) : 252 - 262
  • [8] HIGH AGREEMENT BUT LOW KAPPA .2. RESOLVING THE PARADOXES
    CICCHETTI, DV
    FEINSTEIN, AR
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 1990, 43 (06) : 551 - 558
  • [9] A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES
    COHEN, J
    [J]. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) : 37 - 46
  • [10] A large-scale model of wolf distribution in Italy for conservation planning
    Corsi, F
    Duprè, E
    Boitani, L
    [J]. CONSERVATION BIOLOGY, 1999, 13 (01) : 150 - 159