Predictive lithologic mapping of South Korea from geochemical data using decision trees

被引:18
作者
Bacal, Ma Chrizelle Joyce Orillo [1 ]
Hwang, SangGi [1 ]
Guevarra-Segura, Ivy [1 ]
机构
[1] Pai Chai Univ, Dept Civil Environm & Railrd Engn, Daejeon, South Korea
关键词
Multivariate classification; Decision trees; Geochemical pattern; Digital geologic mapping; South Korea; REMOTE-SENSING DATA; SUPPORT-VECTOR-MACHINE; MINERAL PROSPECTIVITY; RANDOM FORESTS; AIRBORNE GEOPHYSICS; ANOMALIES; CLASSIFICATION; IDENTIFICATION; DEPOSITS; PROVINCE;
D O I
10.1016/j.gexplo.2019.06.008
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Two machine learning algorithms, C4.5 and random forest, collectively known as decision trees were utilized to directly establish the relationship between geochemical maps of South Korea and its geology. Using a large database containing geochemical and lithologic properties, inconsistencies in the a priori lithologic information were fixed using confusion matrix analysis and F-measure comparison via iterative C4.5 implementation. This corrective method resulted in eighteen rock classes but the succeeding C4.5 and random forest application only focused on classifying the 10 most common rock units. Geologic age was included as an attribute at such stage. Results were assessed using accuracy, precision, recall, kappa statistics, and F-measure. Average concentration of major oxides using records of correctly classified rock units were evaluated through Z-score normalization. C4.5 classification successfully predicted the spatial distribution of key lithologic units at 87% whereas random forest classification was at 96%. For both decision tree models, average standardized concentration of major oxides in each lithology adhere to perceived geologic knowledge, thereby proving the validity of the results. Rock age is determined as the most important predictor whereas major elements Al2O3, Na2O, and MgO together with trace elements Cr, Ni, and Cu are the strongest numeric predictors. Misinterpreted data points are mainly due to interpolation errors at or near map polygon boundaries, especially where map polygons are less than 50 km(2), and/or natural and anthropogenic contamination. Despite the misclassifications, decision trees are proven to be effective techniques in classifying lithologic units, and thus can reproduce a reliable geologic map from geochemical data of South Korea.
引用
收藏
页数:15
相关论文
共 81 条
[11]   Weathering reactions and isometric log-ratio coordinates: Do they speak to each other? [J].
Buccianti, Antonella ;
Zuo, Renguang .
APPLIED GEOCHEMISTRY, 2016, 75 :189-199
[12]  
Carneiro CD, 2012, GEOPHYSICS, V77, pK17, DOI [10.1190/GEO2011-0302.1, 10.1190/geo2011-0302.1]
[13]   Data-Driven Predictive Modeling of Mineral Prospectivity Using Random Forests: A Case Study in Catanduanes Island (Philippines) [J].
Carranza, Emmanuel John M. ;
Laborte, Alice G. .
NATURAL RESOURCES RESEARCH, 2016, 25 (01) :35-50
[14]   Data-driven predictive mapping of gold prospectivity, Baguio district, Philippines: Application of Random Forests algorithm [J].
Carranza, Emmanuel John M. ;
Laborte, Alice G. .
ORE GEOLOGY REVIEWS, 2015, 71 :777-787
[15]   Random forest predictive modeling of mineral prospectivity with small number of prospects and data with missing values in Abra (Philippines) [J].
Carranza, Emmanuel John M. ;
Laborte, Alice G. .
COMPUTERS & GEOSCIENCES, 2015, 74 :60-70
[16]   Catchment basin modelling of stream sediment anomalies revisited: incorporation of EDA and fractal analysis [J].
Carranza, Emmanuel John M. .
GEOCHEMISTRY-EXPLORATION ENVIRONMENT ANALYSIS, 2010, 10 (04) :365-381
[17]   A method for mineral prospectivity mapping integrating C4.5 decision tree, weights-of-evidence and m-branch smoothing techniques: a case study in the eastern Kunlun Mountains, China [J].
Chen, Cuihua ;
He, Binbin ;
Zeng, Ze .
EARTH SCIENCE INFORMATICS, 2014, 7 (01) :13-24
[18]   Application of classical statistics and multifractals to delineate Au mineralization-related geochemical anomalies from stream sediment data: a case study in Xinghai-Zeku, Qinghai, China [J].
Chen, Xin ;
Zheng, Youye ;
Xu, Rongke ;
Wang, Huimin ;
Jiang, Xiaojia ;
Yan, Hongze ;
Cai, Pengjie ;
Guo, Xianzheng .
GEOCHEMISTRY-EXPLORATION ENVIRONMENT ANALYSIS, 2016, 16 (3-4) :253-264
[19]   Application of one-class support vector machine to quickly identify multivariate anomalies from geochemical exploration data [J].
Chen, Yongliang ;
Wu, Wei .
GEOCHEMISTRY-EXPLORATION ENVIRONMENT ANALYSIS, 2017, 17 (03) :231-238
[20]   Mapping mineral prospectivity using an extreme learning machine regression [J].
Chen, Yongliang ;
Wu, Wei .
ORE GEOLOGY REVIEWS, 2017, 80 :200-213