Spatial prediction of landslides using a hybrid machine learning approach based on Random Subspace and Classification and Regression Trees

被引:200
作者
Binh Thai Pham [1 ,2 ]
Prakash, Indra [3 ]
Dieu Tien Bui [1 ,2 ]
机构
[1] Ton Duc Thang Univ, Geog Informat Sci Res Grp, Ho Chi Minh City, Vietnam
[2] Ton Duc Thang Univ, Fac Environm & Labour Safety, Ho Chi Minh City, Vietnam
[3] Govt Gujarati, Bhaskarcharya Inst Space Applicat & Geoinformat B, Dept Sci & Technol, Gandhinagar, India
关键词
Landslide susceptibility map; Machine learning; Emsembles; Random Subspace; Classification and Regression Trees; ARTIFICIAL NEURAL-NETWORKS; SUPPORT VECTOR MACHINES; RIVER FAULT ZONE; LOGISTIC-REGRESSION; SUSCEPTIBILITY ASSESSMENT; AREA; GIS; ENSEMBLES; INTEGRATION; TECTONICS;
D O I
10.1016/j.geomorph.2017.12.008
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
A hybrid machine learning approach of Random Subspace (RSS) and Classification And Regression Trees (CART) is proposed to develop a model named RSSCART for spatial prediction of landslides. This model is a combination of the RSS method which is known as an efficient ensemble technique and the CART which is a state of the art classifier. The Luc Yen district of Yen Bai province, a prominent landslide prone area of Viet Nam, was selected for the model development. Performance of the RSSCART model was evaluated through the Receiver Operating Characteristic (ROC) curve, statistical analysis methods, and the Chi Square test. Results were compared with other benchmark landslide models namely Support Vector Machines (SVM), single CART, Nave Bayes Trees (NBT), and Logistic Regression (LR). In the development of model, ten important landslide affecting factors related with geomorphology, geology and geo-environment were considered namely slope angles, elevation, slope aspect, curvature, lithology, distance to faults, distance to rivers, distance to roads, and rainfall. Performance of the RSSCART model (AUC = 0.841) is the best compared with other popular landslide models namely SVM (0.835), single CART (0.822), NBT (0.821), and LR (0.723). These results indicate that performance of the RSSCART is a promising method for spatial landslide prediction. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:256 / 270
页数:15
相关论文
共 68 条
[31]   Landslide susceptibility assessment in Lianhua County (China): A comparison between a random forest data mining technique and bivariate and multivariate statistical models [J].
Hong, Haoyuan ;
Pourghasemi, Hamid Reza ;
Pourtaghi, Zohre Sadat .
GEOMORPHOLOGY, 2016, 259 :105-118
[32]   INTERPOLATING MEAN RAINFALL USING THIN-PLATE SMOOTHING SPLINES [J].
HUTCHINSON, MF .
INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SYSTEMS, 1995, 9 (04) :385-403
[33]   A Geographical Information System (GIS) Based Probabilistic Certainty Factor Approach in Assessing Landslide Susceptibility: The Case Study of Kimi, Euboea, Greece [J].
Ilia, Ioanna ;
Koumantakis, Ioannis ;
Rozos, Dimitrios ;
Koukis, Georgios ;
Tsangaratos, Paraskevas .
ENGINEERING GEOLOGY FOR SOCIETY AND TERRITORY, VOL 2: LANDSLIDE PROCESSES, 2015, :1199-1204
[34]   Evaluation and validation of landslide-susceptibility maps obtained by a GIS matrix method:: examples from the Betic Cordillera (southern Spain) [J].
Irigaray, C. ;
Fernandez, T. ;
El Hamdouni, R. ;
Chacon, J. .
NATURAL HAZARDS, 2007, 41 (01) :61-79
[35]   A comparative study of conventional, ANN black box, fuzzy and combined neural and fuzzy weighting procedures for landslide susceptibility zonation in Darjeeling Himalayas [J].
Kanungo, D. P. ;
Arora, M. K. ;
Sarkar, S. ;
Gupta, R. P. .
ENGINEERING GEOLOGY, 2006, 85 (3-4) :347-366
[36]  
Kohavi R., 1996, P 2 INT C KNOWL DISC, P202, DOI DOI 10.1007/978-3-642-16530-6_42
[37]   Random Subspace Ensembles for fMRI Classification [J].
Kuncheva, Ludmila I. ;
Rodriguez, Juan J. ;
Plumpton, Catrin O. ;
Linden, David E. J. ;
Johnston, Stephen J. .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2010, 29 (02) :531-542
[38]   Landslide susceptibility mapping in the Damrei Romel area, Cambodia using frequency ratio and logistic regression models [J].
Lee, Saro ;
Sambath, Touch .
ENVIRONMENTAL GEOLOGY, 2006, 50 (06) :847-855
[39]   Mining the customer credit using classification and regression tree and multivariate adaptive regression splines [J].
Lee, TS ;
Chiu, CC ;
Chou, YC ;
Lu, CJ .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (04) :1113-1130
[40]  
Liem N. V., 2016, Vietnam Journal of Earth Sciences, V38, P202