Classification in conservation biology: A comparison of five machine-learning methods

被引:135
作者
Kampichler, Christian [1 ,2 ]
Wieland, Ralf [3 ]
Calme, Sophie [4 ,5 ]
Weissenberger, Holger [4 ]
Arriaga-Weiss, Stefan [2 ]
机构
[1] NIOO KNAW, Vogeltrekstat, Dutch Ctr Avian Migrat & Demog, NL-6666 ZG Heteren, Netherlands
[2] Univ Juarez Autonoma Tabasco, Div Ciencias Biol, Villahermosa 86150, Tabasco, Mexico
[3] ZALF Leibniz Zentrum Agrarlandschaftsforsch, Inst Landschaftssyst Anal, D-15374 Muncheberg, Germany
[4] El Colegio Frontera Sur, Unidad Chetumal, Chetmal 77900, Quintana Roo, Mexico
[5] Univ Sherbrooke, Dept Biol, Sherbrooke, PQ J1K 2R1, Canada
关键词
Artificial neural networks; Classification trees; Fuzzy logic; Meleagris ocellata; Random forests; Support vector machines; GENERALIZED ADDITIVE-MODELS; RANDOM FORESTS; NEURAL-NETWORKS; FUZZY; DISTRIBUTIONS; CLASSIFIERS; INFORMATION; PERFORMANCE; TREES; AREA;
D O I
10.1016/j.ecoinf.2010.06.003
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Classification is one of the most widely applied tasks in ecology. Ecologists have to deal with noisy, high-dimensional data that often are non-linear and do not meet the assumptions of conventional statistical procedures. To overcome this problem, machine-learning methods have been adopted as ecological classification methods. We compared five machine-learning based classification techniques (classification trees, random forests, artificial neural networks, support vector machines, and automatically induced rule-based fuzzy models) in a biological conservation context. The study case was that of the ocellated turkey (Meleagris ocellata), a bird endemic to the Yucatan peninsula that has suffered considerable decreases in local abundance and distributional area during the last few decades. On a grid of 10x 10 km cells that was superimposed to the peninsula we analysed relationships between environmental and social explanatory variables and ocellated turkey abundance changes between 1980 and 2000. Abundance was expressed in three (decrease, no change, and increase) and 14 more detailed abundance change classes, respectively. Modelling performance varied considerably between methods with random forests and classification trees being the most efficient ones as measured by overall classification error and the normalised mutual information index. Artificial neural networks yielded the worst results along with linear discriminant analysis, which was included as a conventional statistical approach. We not only evaluated classification accuracy but also characteristics such as time effort, classifier comprehensibility and method intricacy aspects that determine the success of a classification technique among ecologists and conservation biologists as well as for the communication with managers and decision makers. We recommend the combined use of classification trees and random forests due to the easy interpretability of classifiers and the high comprehensibility of the method. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:441 / 450
页数:10
相关论文
共 61 条
[1]   Predicting habitat suitability with machine learning models:: The potential area of Pinus sylvestris L. in the Iberian Peninsula [J].
Benito Garzon, Marta ;
Blazek, Radim ;
Neteler, Markus ;
Sanchez de Dios, Rut ;
Sainz Ollero, Helios ;
Furlanello, Cesare .
ECOLOGICAL MODELLING, 2006, 197 (3-4) :383-393
[2]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[3]  
Bouchon-Meunier B, 2007, STUD FUZZ SOFT COMP, V215, P219
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   Application and evaluation of classification trees for screening unwanted plants [J].
Caley, Peter ;
Kuhnert, Petra M. .
AUSTRAL ECOLOGY, 2006, 31 (05) :647-655
[7]  
CALME S, STUDIES AVI IN PRESS
[8]   Hierarchical tree classifiers to find suitable sites for sandplain grasslands and heathlands on Martha's Vineyard Island, Massachusetts [J].
Chase, Tom ;
Rothley, Kristina D. .
BIOLOGICAL CONSERVATION, 2007, 136 (01) :65-75
[9]   Random forests for classification in ecology [J].
Cutler, D. Richard ;
Edwards, Thomas C., Jr. ;
Beard, Karen H. ;
Cutler, Adele ;
Hess, Kyle T. .
ECOLOGY, 2007, 88 (11) :2783-2792
[10]  
De'ath G, 2000, ECOLOGY, V81, P3178, DOI 10.1890/0012-9658(2000)081[3178:CARTAP]2.0.CO