A novel ensemble decision tree-based CHi-squared Automatic Interaction Detection (CHAID) and multivariate logistic regression models in landslide susceptibility mapping

被引:148
作者
Althuwaynee, Omar F. [1 ]
Pradhan, Biswajeet [1 ]
Park, Hyuck-Jin [2 ]
Lee, Jung Hyun [2 ]
机构
[1] Univ Putra Malaysia, Fac Engn, Dept Civil Engn, Serdang 43400, Selangor Darul, Malaysia
[2] Sejong Univ, Dept Geoinformat Engn, Seoul 143747, South Korea
基金
新加坡国家研究基金会;
关键词
Landslide; Ensemble modeling; Decision tree; CHAID.LR; GIS; Remote sensing; South Korea; HOA BINH PROVINCE; ANALYTICAL HIERARCHY PROCESS; EVIDENTIAL BELIEF FUNCTIONS; FUZZY-LOGIC; WENCHUAN EARTHQUAKE; SPATIAL PREDICTION; LIKELIHOOD RATIO; GIS; HAZARD; MOUNTAINS;
D O I
10.1007/s10346-014-0466-0
中图分类号
P5 [地质学];
学科分类号
0709 ; 081803 ;
摘要
An ensemble algorithm of data mining decision tree (DT)-based CHi-squared Automatic Interaction Detection (CHAID) is widely used for prediction analysis in variety of applications. CHAID as a multivariate method has an automatic classification capacity to analyze large numbers of landslide conditioning factors. Moreover, it results two or more nodes for each independent variable, where every node contains numbers of presence or absence of landslides (dependent variable). Other DT methods such as Quick, Unbiased, Efficient Statistic Tree (QUEST) and Classification and Regression Trees (CRT) are not able to produce multi branches based tree. Thus, the main objective of this paper is to use CHAID method to perform the best classification fit for each conditioning factors, then, combined it with logistic regression (LR) to find the corresponding coefficients of best fitting function that assess the optimal terminal nodes. In the first step, a landslide inventory map with 296 landslide locations were extracted from various sources over the Pohang-Kyeong Joo catchment (South Korea). Then, the inventory was randomly split into two datasets, 70% was used for training the models, and the remaining 30% was used for validation purpose. Thirteen landslide conditioning factors were used for the susceptibility modeling. Then, CHAID was applied and revealed that some conditioning factors such as altitude, soil drain, soil texture and TWI, as terminal nodes and reflected the best classification fit. Then, a proposed ensemble technique was applied and the interpretations of the coefficients showed that the relationship between the decision tree branch nodes distance from drain, soil drain, and TWI, respectively, leads to better consequences assessment of landslides in the current study area. The validation results showed that both success and prediction rates, 75 and 79%, respectively. This study proved the efficiency and reliability of ensemble DT and LR model in landslide susceptibility mapping.
引用
收藏
页码:1063 / 1078
页数:16
相关论文
共 67 条
  • [1] An easy-to-use MATLAB program (MamLand) for the assessment of landslide susceptibility using a Mamdani fuzzy algorithm
    Akgun, A.
    Sezer, E. A.
    Nefeslioglu, H. A.
    Gokceoglu, C.
    Pradhan, B.
    [J]. COMPUTERS & GEOSCIENCES, 2012, 38 (01) : 23 - 34
  • [2] A comparison of landslide susceptibility maps produced by logistic regression, multi-criteria decision, and likelihood ratio methods: a case study at Izmir, Turkey
    Akgun, Aykut
    [J]. LANDSLIDES, 2012, 9 (01) : 93 - 106
  • [3] Althuwaynee Omar F., 2012, 2012 IEEE Colloquium on Humanities, Science and Engineering Research (CHUSER 2012), P362, DOI 10.1109/CHUSER.2012.6504340
  • [4] A novel ensemble bivariate statistical evidential belief function with knowledge-based analytical hierarchy process and multivariate statistical logistic regression for landslide susceptibility mapping
    Althuwaynee, Omar F.
    Pradhan, Biswajeet
    Park, Hyuck-Jin
    Lee, Jung Hyun
    [J]. CATENA, 2014, 114 : 21 - 36
  • [5] Application of an evidential belief function model in landslide susceptibility mapping
    Althuwaynee, Omar F.
    Pradhan, Biswajeet
    Lee, Saro
    [J]. COMPUTERS & GEOSCIENCES, 2012, 44 : 120 - 135
  • [6] [Anonymous], INV FOR LANDSL OCC K
  • [7] [Anonymous], EGU GEN ASS C
  • [8] [Anonymous], 2014, C4. 5: programs for machine learning
  • [9] [Anonymous], 2012, ENSEMBLE BASED LANDS
  • [10] [Anonymous], ANSW TREE 2 0 US GUI