Multi-aspect sentiment analysis for Chinese online social reviews based on topic modeling and HowNet lexicon

被引:202
作者
Fu Xianghua [1 ]
Liu Guo [1 ]
Guo Yanyan [1 ]
Wang Zhiqiang [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Aspect detection; Sentiment analysis; Social reviews; Topic modeling; HowNet lexicon; CLASSIFICATION;
D O I
10.1016/j.knosys.2012.08.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
User-generated reviews on the Web reflect users' sentiment about products, services and social events. Existing researches mostly focus on the sentiment classification of the product and service reviews in document level. Reviews of social events such as economic and political activities, which are called social reviews, have specific characteristics different to the reviews of products and services. In this paper, we propose an unsupervised approach to automatically discover the aspects discussed in Chinese social reviews and also the sentiments expressed in different aspects. The approach is called Multi-aspect Sentiment Analysis for Chinese Online Social Reviews (MSA-COSRs). We first apply the Latent Dirichlet Allocation (LDA) model to discover multi-aspect global topics of social reviews, and then extract the local topic and associated sentiment based on a sliding window context over the review text. The aspect of the local topic is identified by a trained LDA model, and the polarity of the associated sentiment is classified by HowNet lexicon. The experiment results show that MSA-COSR cannot only obtain good topic partitioning results, but also help to improve sentiment analysis accuracy. It helps to simultaneously discover multi-aspect fine-grained topics and associated sentiment. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:186 / 195
页数:10
相关论文
共 42 条
[1]  
[Anonymous], 2010, Jointly modeling aspects and opinions with a MaxEnt-LDA hybrid
[2]  
[Anonymous], 2008, P ACL 08 HLT ASS COM
[3]  
[Anonymous], 2005, P 14 ACM INT C INF
[4]  
[Anonymous], 2006, P 10 C COMPUTATIONAL
[5]  
[Anonymous], 2009, P 2009 C EMPIRICAL M, DOI 10.3115/1699648.1699700
[6]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[7]  
Bo Pang, 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI 10.1561/1500000001
[8]   Protein-to-Protein Interactions: Technologies, Databases, and Algorithms [J].
Cannataro, Mario ;
Guzzi, Pietro H. ;
Veltri, Pierangelo .
ACM COMPUTING SURVEYS, 2010, 43 (01)
[9]  
Cao Juan, 2008, Chinese Journal of Computers, V31, P1780
[10]  
[杜伟夫 Du Weifu], 2009, [计算机研究与发展, Journal of Computer Research and Development], V46, P1713