Applying text and data mining techniques to forecasting the trend of petitions filed to e-People

被引:22
作者
Suh, Jong Hwan [1 ]
Park, Chung Hoon [2 ]
Jeon, Si Hyun [3 ]
机构
[1] SAMSUNG SDS, Informat & Commun Technol Res & Dev Ctr, Seoul 135798, South Korea
[2] SAMSUNG SDS, Consulting Div, Seoul 135918, South Korea
[3] ACRC, Seoul 125705, South Korea
关键词
Text mining; Data mining; Petition; Keyword extracting; Document clustering; Forecasting; e-Government; Open Innovation; e-People; RADIAL BASIS FUNCTION; K-MEANS ALGORITHM; NEURAL-NETWORK; DECISION TREE; MACHINE; INTEGRATION; ENSEMBLE; MODEL;
D O I
10.1016/j.eswa.2010.04.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the Internet has been the virtual place where citizens are united and their opinions are promptly shifted into the action, two way communications between the government sector and the citizen have been more important among activities of e-Government. Hence, Anti-corruption and Civil Rights Commission (ACRC) in the Republic of Korea has constructed the online petition portal system named e-People. In addition, the nation's Open Innovation through e-People has gained increasing attention. That is because e-People can be applied for the virtual space where citizens participate in improving the national law and policy by simply filing petitions to e-People as the voice of the nation. However, currently there are problems and challenging issues to be solved until e-People can function as the virtual space for the nation's Open Innovation based on petitions collected from citizens. First, there is no objective and systematic method for analyzing a large number of petitions filed to e-People without a lot of manual works of petition inspectors. Second, e-People is required to forecast the trend of petitions filed to e-People more accurately and quickly than petition inspectors for making a better decision on the national law and policy strategy. Therefore, in this paper, we propose the framework of applying text and data mining techniques not only to analyze a large number of petitions filed to e-People but also to predict the trend of petitions. In detail, we apply text mining techniques to unstructured data of petitions to elicit keywords from petitions and identify groups of petitions with the elicited keywords. Moreover, we apply data mining techniques to structured data of the identified petition groups on purpose to forecast the trend of petitions. Our approach based on applying text and data mining techniques decreases time-consuming manual works on reading and classifying a large number of petitions, and contributes to increasing accuracy in evaluating the trend of petitions. Eventually, it helps petition inspectors to give more attention on detecting and tracking important groups of petitions that possibly grow as nationwide problems. Further, the petitions ordered by their petition groups' trend values can be used as the baseline for making a better decision on the national law and policy strategy. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:7255 / 7268
页数:14
相关论文
共 59 条
[31]   Integration of self-organizing feature maps neural network and genetic K-means algorithm for market segmentation [J].
Kuo, RJ ;
An, YL ;
Wang, HS ;
Chung, WJ .
EXPERT SYSTEMS WITH APPLICATIONS, 2006, 30 (02) :313-324
[32]   Integration of self-organizing feature map and K-means algorithm for market segmentation [J].
Kuo, RJ ;
Ho, LM ;
Hu, CM .
COMPUTERS & OPERATIONS RESEARCH, 2002, 29 (11) :1475-1493
[33]   Neural network techniques for financial performance prediction: integrating fundamental and technical analysis [J].
Lam, M .
DECISION SUPPORT SYSTEMS, 2004, 37 (04) :567-581
[34]  
LEE JA, 2004, STRATEGY IMPLEMENTIN
[35]   Back-propagation neural network for long-term tidal predictions [J].
Lee, TL .
OCEAN ENGINEERING, 2004, 31 (02) :225-238
[36]   Text document clustering based on frequent word meaning sequences [J].
Li, Yanjun ;
Chung, Soon M. ;
Holt, John D. .
DATA & KNOWLEDGE ENGINEERING, 2008, 64 (01) :381-404
[37]  
Marchionini G, 2003, COMMUN ACM, V46, P24
[38]  
Palvia S.C., 2007, International Conference on E-governance, P1
[39]  
PANDA SS, 2007, APPL SOFT COMPUT, V29, P927
[40]   Document clustering for electronic meetings: an experimental comparison of two techniques [J].
Roussinov, DG ;
Chen, HC .
DECISION SUPPORT SYSTEMS, 1999, 27 (1-2) :67-79