Parallel MARS algorithm based on B-splines

被引:5
作者
Bakin, S [1 ]
Hegland, M
Osborne, MR
机构
[1] Australian Natl Univ, Sch Math Sci, Canberra, ACT 0200, Australia
[2] Australian Natl Univ, Comp Sci Lab, Canberra, ACT 0200, Australia
关键词
MARS; B-splines; data mining; parallel algorithms;
D O I
10.1007/PL00022715
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We investigate one of the possible ways for improving Friedman's Multivariate Adaptive Regression Splines (MARS) algorithm designed for flexible modelling of high-dimensional data. In our version of MARS called BMARS we use B-splines instead of truncated power basis functions. The fact that B-splines have compact support allows us to introduce the notion of a "scale" of a basis function. The algorithm starts building up models by using large-scale basis functions and switches over to a smaller scale after the fitting ability of the large scale splines has been exhausted. The process is repeated until the prespecified number of basis functions has been produced. In addition, we discuss a parallelisation of BMARS as well as an application of the algorithm to processing of a large commercial data set. The results demonstrate the computational efficiency of our algorithm and its ability to generate models competitive with those of the original MARS.
引用
收藏
页码:463 / 484
页数:22
相关论文
共 14 条
[1]  
[Anonymous], 1990, SUBSET SELECTION REG, DOI DOI 10.1007/978-1-4899-2939-6
[2]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946
[3]  
CHEN Z, 1990, SMS00990 AUSTR NAT U
[4]  
COX MG, 1981, TOPICS NUMERICAL ANA, P79
[5]  
Fayyad U. M., 1996, ADV KNOWLEDGE DISCOV, P1, DOI DOI 10.1609/AIMAG.V17I3.1230
[6]   PROJECTION PURSUIT REGRESSION [J].
FRIEDMAN, JH ;
STUETZLE, W .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1981, 76 (376) :817-823
[7]   MULTIVARIATE ADAPTIVE REGRESSION SPLINES [J].
FRIEDMAN, JH .
ANNALS OF STATISTICS, 1991, 19 (01) :1-67
[8]  
FRIEDMAN JH, 1981, 108 STANF U
[9]  
Geist A, 1994, PVM PARALLEL VIRTUAL
[10]   VARIABLE SELECTION VIA GIBBS SAMPLING [J].
GEORGE, EI ;
MCCULLOCH, RE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (423) :881-889