Data preparation process for construction knowledge generation through knowledge discovery in databases

被引:110
作者
Soibelman, L [1 ]
Kim, H [1 ]
机构
[1] Univ Illinois, Dept Civil Engn, Urbana, IL 61801 USA
关键词
data processing; databases; neural networks; construction industry; data analysis;
D O I
10.1061/(ASCE)0887-3801(2002)16:1(39)
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
As the construction industry is adapting to new computer technologies in terms of hardware and software, computerized construction data are becoming increasingly available. The explosive growth of many business, government, and scientific databases has begun to far outpace our ability to interpret and digest the data. Such volumes of data clearly overwhelm the traditional methods of data analysis such as spreadsheets and ad-hoc queries. The traditional methods can create informative reports from data, but cannot analyze the contents of those reports, A significant need exists for a new generation of techniques and tools with the ability to automatically assist humans in analyzing the mountains of data for useful knowledge. Knowledge discovery in databases (KDD) and data mining (DM) are tools that allow identification of valid, useful, and previously unknown patterns so that the construction manager may analyze the large amount of construction project data. These technologies combine techniques from machine learning, artificial intelligence, pattern recognition, statistics, databases, and visualization to automatically extract concepts, interrelationships, and patterns of interest from large databases. This paper presents the necessary steps such as (1) identification of problems, (2) data preparation, (3) data mining, (4) data analysis, and (5) refinement process required for the implementation of KDD. In order to test the feasibility of the proposed approach, a prototype of the KDD system was developed and tested with a construction management database, RMS (Resident Management System), provided by the U. S. Corps of Engineers. In this paper, the KDD process was applied to identify the cause(s) of construction activity delays. However, its possible applications can be extended to identify cause(s) of cost overrun and quality control/assurance among other construction problems. Predictable patterns may be revealed in construction data that were previously thought to be chaotic.
引用
收藏
页码:39 / 48
页数:10
相关论文
共 38 条
[1]  
Anand SS, 1998, LECT NOTES ARTIF INT, V1394, P25
[2]  
Anand T., 1992, Proceedings of the Eighth Conference on Artificial Intelligence for Applications (Cat. No.92CH3122-9), P2, DOI 10.1109/CAIA.1992.200003
[3]  
[Anonymous], ADV NEURAL INFORM PR
[4]  
[Anonymous], 1989, NeurIPS
[5]   Nonlinear structural control using neural networks [J].
Bani-Hani, K ;
Ghaboussi, J .
JOURNAL OF ENGINEERING MECHANICS-ASCE, 1998, 124 (03) :319-327
[6]  
BARRIE D, 1985, PROFESSIONAL CONSTRU
[7]  
BHANDARI I, 1995, INT J DATA MINING KN, V1, P121
[8]  
BLANCHARD D, 1993, NEURAL BASED FRAUD D, V2, P28
[9]  
BRACHMAN, 1997, DATA MIN KNOWL DISC, V1, P33
[10]  
BUCHHEIT RB, 2000, P COMP CIV ENG STANF, P914