A framework for building web mining applications in the world of blogs: A case study in product sentiment analysis

被引:16
作者
Costa, Evandro [1 ]
Ferreira, Rafael [1 ]
Brito, Patrick [1 ]
Bittencourt, Ig Ibert [1 ]
Holanda, Olavo [1 ]
Machado, Aydano [1 ]
Marinho, Tarsis [1 ]
机构
[1] Univ Fed Alagoas, Computat Inst, GrOW Grp Otimizacao Web, BR-57072970 Maceio, AL, Brazil
关键词
Semantic web; Social web; Blogs; Sentiment analysis;
D O I
10.1016/j.eswa.2011.09.135
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently there has been much interest in electronic commerce applications that use data mining techniques to explore datasets in the social media context. However, most of the applications have already been developed in an ad hoc manner, mainly, due to the lack of adequate tools, yielding difficulties in customizing applications and requesting high time consuming for constructing and maintaining these applications. This work addresses these problems and proposes a software framework for building Web mining applications in the blog world. The architecture of the proposed framework combines the use of blog crawling and data mining algorithms, in order to provide a complete and flexible solution for building general-purpose Web mining applications. The framework flexibility allows some important customizations, such as the construction of adapters for reading text from different blogs, and the use of different pre-processing techniques and data mining algorithms. In order to improve the efficacy of information extraction from blogs, ontology is used in the blog's description. For this, there are software agents responsible for tracking and indexing blogs related to a specific tag and for mining blog datasets. Moreover, web services are used for encapsulating existing tools and maximize reuse. This framework has been instantiated in order to be applied for helping the blog users to effectively find out relevant information in the blog world. The focus of this paper is on describing the novel software architecture of the general framework (blog crawling and data mining) providing detailed information about the data mining sub-framework, which uses the semantic web services technology for automating service composition and consists on the main research contribution. A case study of an e-commerce application for analyzing the user's sentiment regarding specific products is reported and its results considers the effort reduction when creating a web mining application by using the proposed integrated frameworks and existing data mining tools, as well as a qualitative analysis related to quality aspects of the developed application, such as the evolution impact. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:4813 / 4834
页数:22
相关论文
共 33 条
  • [1] Agrawal R., 1994, P 20 INT C VER LARG, P487, DOI DOI 10.5555/645920.672836
  • [2] [Anonymous], 2005, WSDL S SUBMISSION RE
  • [3] [Anonymous], 2006, Business Process Execution Language for Web Services BPEL and BPEL4WS
  • [4] [Anonymous], 1997, MACHINE LEARNING, MCGRAW-HILL SCIENCE/ENGINEERING/MATH
  • [5] Bass L., 2003, Software Architecture in Practice. Software Engineering
  • [6] Boley H., 2005, SEMANTIC WEB SERVICE
  • [7] Buschmann F., 1996, Pattern-Oriented Software Architecture, Volume 1: A System of Patterns
  • [8] CALADO I, 2009, P S APPL COMP HON HI, P694
  • [9] Web Data Mining System Based on Web Services
    Chen, Chunying
    Zhou, Xiongwei
    Zhang, Jianzhong
    [J]. HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 3, PROCEEDINGS, 2009, : 216 - +
  • [10] Social media and search
    Cho, Junghoo
    Tomkins, Andrew
    [J]. IEEE INTERNET COMPUTING, 2007, 11 (06) : 13 - 15