CUSTOMER SEGMENTATION AND CLASSIFICATION FROM BLOGS BY USING DATA MINING: AN EXAMPLE OF VOIP PHONE

被引:7
作者
Chen, Long-Sheng [1 ]
Hsu, Chun-Chin [2 ]
Chen, Mu-Chen [3 ]
机构
[1] Chaoyang Univ Technol, Dept Informat Management, Wufong Township 41349, Taichung County, Taiwan
[2] Chaoyang Univ Technol, Dept Ind Engn & Management, Wufong Township 41349, Taichung County, Taiwan
[3] Natl Chiao Tung Univ, Inst Traff & Transportat, Taipei, Taiwan
关键词
Back-propagation neural network; Blog; Data mining; Self-organizing map; Sparse data; Support vector machines; DECISION TREE; WEB;
D O I
10.1080/01969720903152593
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Blogs have been considered the 4th Internet application that can cause radical changes in the world, after e-mail, instant messaging, and Bulletin Board System (BBS). Many Internet users rely heavily on them to express their emotions and personal comments on whatever topics interest them. Nowadays, blogs have become the popular media and could be viewed as new marketing channels. Depending on the blog search engine, Technorati, we tracked about 94 million blogs in August 2007. It also reported that a whole new blog is created every 7.4 seconds and 275,000 blogs are updated daily. These figures can be used to illustrate the reason why more and more companies attempt to discover useful knowledge from this vast number of blogs for business purposes. Therefore, blog mining could be a new trend of web mining. The major objective of this study is to present a structure that includes unsupervised (self-organizing map) and supervised learning methods (back-propagation neural networks, decision tree, and support vector machines) for extracting knowledge from blogs, namely, a blog mining (BM) model. Moreover, a real case regarding VoIP (Voice over Internet Protocol) phone products is provided to demonstrate the effectiveness of the proposed method.
引用
收藏
页码:608 / 632
页数:25
相关论文
共 51 条
[1]  
[Anonymous], 2009, A practical guide to support vector classification
[2]   A public outreach in epilepsy surgery using a serial novel on BLOG: A preliminary report [J].
Asano, Eishi .
BRAIN & DEVELOPMENT, 2007, 29 (02) :102-104
[3]   The use of data mining to predict web performance [J].
Borzemski, Leszek .
CYBERNETICS AND SYSTEMS, 2006, 37 (06) :587-608
[4]  
CHAKRABARTI S, 2000, SIGKDD EXPLORATIONS, V1, P1
[5]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[6]   Mining communities and their relationships in blogs: A study of online hate groups [J].
Chau, Michael ;
Xu, Jennifer .
INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2007, 65 (01) :57-70
[7]   Web mining: Machine learning for Web applications [J].
Chen, HC ;
Chau, M .
ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 2004, 38 :289-329
[8]   A personalized recommender system based on web usage mining and decision tree induction [J].
Cho, YH ;
Kim, JK ;
Kim, SH .
EXPERT SYSTEMS WITH APPLICATIONS, 2002, 23 (03) :329-342
[9]   A short walk in the Blogistan [J].
Cohen, E ;
Krishnamurthy, B .
COMPUTER NETWORKS, 2006, 50 (05) :615-630
[10]  
Cristianini N., 2000, An Introduction to Support Vector Machines and Other Kernel-based Learning Methods, DOI DOI 10.1017/CB09780511801389