QPLSA: Utilizing quad-tuples for aspect identification and rating

被引:16
作者
Luo, Wenjuan [1 ,2 ]
Zhuang, Fuzhen [1 ]
Zhao, Weizhong [3 ]
He, Qing [1 ]
Shi, Zhongzhi [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Xiangtan Univ, Coll Informat Engn, Xiangtan 411105, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Quad-tuple PLSA; Aspect mining; Sentiment analysis;
D O I
10.1016/j.ipm.2014.08.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
080201 [机械制造及其自动化];
摘要
Aspect level sentiment analysis is important for numerous opinion mining and market analysis applications. In this paper, we study the problem of identifying and rating review aspects, which is the fundamental task in aspect level sentiment analysis. Previous review aspect analysis methods seldom consider entity or rating but only 2-tuples, i.e., head and modifier pair, e.g., in the phrase "nice room", "room" is the head and "nice" is the modifier. To solve this problem, we novelly present a Quad-tuple Probability Latent Semantic Analysis (QPLSA), which incorporates entity and its rating together with the 2-tuples into the PLSA model. Specifically, QPLSA not only generates fine-granularity aspects, but also captures the correlations between words and ratings. We also develop two novel prediction approaches, the Quad-tuple Prediction (from the global perspective) and the Expectation Prediction (from the local perspective). For evaluation, systematic experiments show that: Quad-tuple PLSA outperforms 2-tuple PLSA significantly on both aspect identification and aspect rating prediction for publication datasets. Moreover, for aspect rating prediction, QPLSA shows significant superiority over state-of-the-art baseline methods. Besides, the Quad-tuple Prediction and the Expectation Prediction also show their strong ability in aspect rating on different datasets. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:25 / 41
页数:17
相关论文
共 38 条
[1]
[Anonymous], 2011, P 2011 SIAM INT C DA
[2]
[Anonymous], 2005, Proceedings of HLT/EMNLP on Interactive Demonstrations
[3]
[Anonymous], 2011, PROCEEDINGS OF THE 2
[4]
[Anonymous], 1999, NAT LANG ENG, DOI DOI 10.1017/S1351324999002181
[5]
[Anonymous], 2010, HUMAN LANGUAGE TECHN
[6]
[Anonymous], P 10 ACM SIGKDD INT
[7]
[Anonymous], 2008, P 17 INT C WORLD WID, DOI DOI 10.1145/1367497.1367513
[8]
[Anonymous], NIPS
[9]
[Anonymous], 2005, P C HUM LANG TECHN E
[10]
[Anonymous], 2009, P INT C WORLD WIDE W, DOI DOI 10.1145/1526709.1526728