A method for mineral prospectivity mapping integrating C4.5 decision tree, weights-of-evidence and m-branch smoothing techniques: a case study in the eastern Kunlun Mountains, China

被引:35
作者
Chen, Cuihua [1 ]
He, Binbin [2 ]
Zeng, Ze [2 ]
机构
[1] Chengdu Univ Technol, Coll Geosci, Chengdu 610059, Sichuan, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Resources & Environm, Chengdu 611731, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Eastern Kunlun Mountains; Mineral prospectivity mapping; C4.5 decision tree; M-branch smoothing; Weights-of-evidence model; CLASSIFICATION; SYSTEMS;
D O I
10.1007/s12145-013-0128-0
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this study, a novel method that integrates C4.5 decision tree, weights-of-evidence and m-branch smoothing techniques was proposed for mineral prospectivity mapping. First, a weights-of-evidence model was used to rank the importance of each evidential map and determine the optimal buffer distance. Second, a classification technique that uses a C4.5 decision tree in data mining was used to construct a decision tree classifier for the grid dataset. Finally, an m-branch smoothing technique was used as a predictor, which transformed the decision tree into a probability evaluation tree. The method makes no conditional independence assumption and can be applied for class imbalanced datasets like those collected during mineral exploration for prospectivity mapping of an area. The traits of comprehensibility, accuracy and efficiency were derived from the C4.5 decision tree. In addition, a case study for iron prospectivity mapping was performed in the eastern Kunlun Mountains, China. Sixty-two Skarn iron deposits and eight evidential maps related to iron mineralization were studied. On the final map, areas of low, moderate and high potential for iron deposit occurrence covered areas of 71,491, 14,298, and 9,532 km(2), respectively. For the goodness-of-fit test, 91.94 % of the total 62 iron deposits were within a high-potential area, 8.06 % were within a moderate-potential area and 0 % were within a low-potential area. For ten-fold cross-validation, 82.26 % were within a high-potential area, 14.52 % were within a moderate-potential area and 3.22 % were within a low-potential area. To evaluate the predictive accuracy, Receiver Operating Characteristic (ROC) curves and Area Under ROC Curve (AUC) were employed. The accuracy of the goodness-of-fit test reached 97.07 %, and the accuracy of the ten-fold cross-validation was 95.10 %. The majority of the iron deposits were within high-potential and moderate-potential areas, which covered a small proportion of the study area.
引用
收藏
页码:13 / 24
页数:12
相关论文
共 32 条
[1]   Support vector machine for multi-classification of mineral prospectivity areas [J].
Abedi, Maysam ;
Norouzi, Gholam-Hossain ;
Bahroudi, Abbas .
COMPUTERS & GEOSCIENCES, 2012, 46 :272-283
[2]  
Agterberg F.P., 1993, COMPUTERS GEOLOGY 25, P13
[3]  
Agterberg FP., 1992, NONRENEWABLE RESOURC, V1, P39, DOI [10.1007/BF01782111, DOI 10.1007/BF01782111]
[4]   APPLICATIONS OF FUZZY EXPERTS SYSTEMS IN INTEGRATED OIL-EXPLORATION [J].
AMINZADEH, F .
COMPUTERS & ELECTRICAL ENGINEERING, 1994, 20 (02) :89-97
[5]  
[Anonymous], P 14 EUR C ART INT
[6]  
[Anonymous], 2014, C4. 5: programs for machine learning
[7]  
[Anonymous], 2011, Pei. data mining concepts and techniques
[8]  
Binbin He, 2011, Proceedings of the 2011 IEEE International Conference on Spatial Data Mining and Geographical Knowledge Services (ICSDM 2011), P96, DOI 10.1109/ICSDM.2011.5969012
[9]  
Bonham-Carter G F., 1990, Geological Survey of Canada Paper, V89, P171
[10]  
Carranza E.J.M., 2004, NAT RESOUR RES, V13, P173, DOI [DOI 10.1023/B:NARR.0000046919.87758.F5, 10.1023/B:NARR.0000046919.87758.f5]