Applying text and data mining techniques to forecasting the trend of petitions filed to e-People

被引:22
作者
Suh, Jong Hwan [1 ]
Park, Chung Hoon [2 ]
Jeon, Si Hyun [3 ]
机构
[1] SAMSUNG SDS, Informat & Commun Technol Res & Dev Ctr, Seoul 135798, South Korea
[2] SAMSUNG SDS, Consulting Div, Seoul 135918, South Korea
[3] ACRC, Seoul 125705, South Korea
关键词
Text mining; Data mining; Petition; Keyword extracting; Document clustering; Forecasting; e-Government; Open Innovation; e-People; RADIAL BASIS FUNCTION; K-MEANS ALGORITHM; NEURAL-NETWORK; DECISION TREE; MACHINE; INTEGRATION; ENSEMBLE; MODEL;
D O I
10.1016/j.eswa.2010.04.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the Internet has been the virtual place where citizens are united and their opinions are promptly shifted into the action, two way communications between the government sector and the citizen have been more important among activities of e-Government. Hence, Anti-corruption and Civil Rights Commission (ACRC) in the Republic of Korea has constructed the online petition portal system named e-People. In addition, the nation's Open Innovation through e-People has gained increasing attention. That is because e-People can be applied for the virtual space where citizens participate in improving the national law and policy by simply filing petitions to e-People as the voice of the nation. However, currently there are problems and challenging issues to be solved until e-People can function as the virtual space for the nation's Open Innovation based on petitions collected from citizens. First, there is no objective and systematic method for analyzing a large number of petitions filed to e-People without a lot of manual works of petition inspectors. Second, e-People is required to forecast the trend of petitions filed to e-People more accurately and quickly than petition inspectors for making a better decision on the national law and policy strategy. Therefore, in this paper, we propose the framework of applying text and data mining techniques not only to analyze a large number of petitions filed to e-People but also to predict the trend of petitions. In detail, we apply text mining techniques to unstructured data of petitions to elicit keywords from petitions and identify groups of petitions with the elicited keywords. Moreover, we apply data mining techniques to structured data of the identified petition groups on purpose to forecast the trend of petitions. Our approach based on applying text and data mining techniques decreases time-consuming manual works on reading and classifying a large number of petitions, and contributes to increasing accuracy in evaluating the trend of petitions. Eventually, it helps petition inspectors to give more attention on detecting and tracking important groups of petitions that possibly grow as nationwide problems. Further, the petitions ordered by their petition groups' trend values can be used as the baseline for making a better decision on the national law and policy strategy. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:7255 / 7268
页数:14
相关论文
共 59 条
[1]  
*ACRC, 2008, DEM CONT SESS E CHAL
[2]   Comparing radial basis function and feed-forward neural networks assisted by linear discriminant or principal component analysis for simultaneous spectrophotometric quantification of mercury and copper [J].
Akhlaghi, Y ;
Kompany-Zareh, M .
ANALYTICA CHIMICA ACTA, 2005, 537 (1-2) :331-338
[3]   Bankruptcy forecasting:: An empirical comparison of AdaBoost and neural networks [J].
Alfaro, Esteban ;
Garcia, Noelia ;
Gamez, Matias ;
Elizondo, David .
DECISION SUPPORT SYSTEMS, 2008, 45 (01) :110-122
[4]   Clustering of document collection - A weighting approach [J].
Aliguliyev, Ramiz M. .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) :7904-7916
[5]  
[Anonymous], 2005, Discovering Knowledge in Data: An Introduction to Data Mining
[6]  
Atkinson R.D., 2008, DIGITAL QUALITY LIFE
[7]  
Berry M.J., 2000, MASTERING DATA MININ
[8]   Partitioning-based clustering for Web document categorization [J].
Boley, D ;
Gini, M ;
Gross, R ;
Han, EH ;
Hastings, K ;
Karypis, G ;
Kumar, V ;
Mobasher, B ;
Moore, J .
DECISION SUPPORT SYSTEMS, 1999, 27 (03) :329-341
[9]   Business data mining - a machine learning perspective [J].
Bose, I ;
Mahapatra, RK .
INFORMATION & MANAGEMENT, 2001, 39 (03) :211-225
[10]   Applying decision tree and neural network to increase quality of dermatologic diagnosis [J].
Chang, Chun-Lang ;
Chen, Chih-Hao .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :4035-4041