Random forest predictive modeling of mineral prospectivity with small number of prospects and data with missing values in Abra (Philippines)

被引:275
作者
Carranza, Emmanuel John M. [1 ]
Laborte, Alice G. [2 ]
机构
[1] James Cook Univ, Townsville, Qld 4811, Australia
[2] Int Rice Res Inst, Los Banos, Laguna, Philippines
关键词
Data-driven modeling; Ensemble of regression trees; Porphyry copper; GIS; ARTIFICIAL NEURAL-NETWORKS; QUANTITATIVE ESTIMATION; INTEGRATION; EXPLORATION; DISTRICT; DEPOSITS; PROVINCE; CLASSIFICATION; REGRESSION; MACHINE;
D O I
10.1016/j.cageo.2014.10.004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Machine learning methods that have been used in data-driven predictive modeling of mineral prospectivity (e.g., artificial neural networks) invariably require large number of training prospect/locations and are unable to handle missing values in certain evidential data. The Random Forests (RF) algorithm, which is a machine learning method, has recently been applied to data-driven predictive mapping of mineral prospectivity, and so it is instructive to further study its efficacy in this particular field. This case study, carried out using data from Abra (Philippines), examines (a) if RF modeling can be used for data-driven modeling of mineral prospectivity in areas with a few (i.e., <20) mineral occurrences and (b) if RF modeling can handle evidential data with missing values. We found that RF modeling outperforms weights-of-evidence (WofE) modeling of porphyry-Cu prospectivity in the Abra area, where 12 porphyry-Cu prospects are known to exist. Moreover, just like WofE modeling, RF modeling allows analysis of the spatial associations of known prospects with individual layers of evidential data. Furthermore, RF modeling can handle missing values in evidential data through an RF-based imputation technique whereas in WofE modeling values are simply represented by zero weights. Therefore, the RF algorithm is potentially more useful than existing methods that are currently used for data-driven predictive mapping of mineral prospectivity. In particular, it is not a purely black-box method like artificial neural networks in the context of data-driven predictive modeling of mineral prospectivity. However, further testing of the method in other areas with a few mineral occurrences is needed to fully investigate its usefulness in data-driven predictive modeling of mineral prospectivity. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:60 / 70
页数:11
相关论文
共 92 条
[1]   Support vector machine for multi-classification of mineral prospectivity areas [J].
Abedi, Maysam ;
Norouzi, Gholam-Hossain ;
Bahroudi, Abbas .
COMPUTERS & GEOSCIENCES, 2012, 46 :272-283
[2]   Integration of various geophysical data with geological and geochemical data to determine additional drilling for copper exploration [J].
Abedi, Maysam ;
Norouzi, Gholam-Hossain .
JOURNAL OF APPLIED GEOPHYSICS, 2012, 83 :35-45
[3]  
Agterberg F.P., 1993, COMPUTERS GEOLOGY 25, P13
[4]  
Agterberg F.P., 2005, Nat. Resour. Res., V14, P1, DOI DOI 10.1007/S11053-005-4674-0
[5]  
AGTERBERG FP, 1990, COMPUT GEOL, V7, P1
[6]  
Agterberg FP., 2002, NAT RESOUR RES, V11, P249, DOI [10.1023/A:1021193827501, DOI 10.1023/A:1021193827501]
[7]  
An P., 1994, Nonrenewable Resour, V3, P60, DOI [10.1007/BF02261716, DOI 10.1007/BF02261716]
[8]  
[Anonymous], R PROJ STAT COMP
[9]  
[Anonymous], NAT RESOUR RES
[10]  
[Anonymous], 2010, Nat. Resour. Res.