Sentiment analysis: A combined approach

被引:385
作者
Prabowo, Rudy [1 ]
Thelwall, Mike [1 ]
机构
[1] Wolverhampton Univ, Sch Comp & Informat Technol, Wolverhampton WV1 1SB, England
关键词
Sentiment analysis; Unsupervised learning; Machine learning; Hybrid classification; WEB;
D O I
10.1016/j.joi.2009.01.003
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Sentiment analysis is an important current research area. This paper combines rule-based classification, supervised learning and machine learning into a new combined method. This method is tested on movie reviews, product reviews and MySpace comments. The results show that a hybrid classification can improve the classification effectiveness in terms of micro- and macro-averaged F-1. F-1 is a measure that takes both the precision and recall of a classifier's effectiveness into account. In addition, we propose a semi-automatic, complementary approach in which each classifier can contribute to other classifiers to achieve a good level of effectiveness. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:143 / 157
页数:15
相关论文
共 47 条
[1]   Data collection methods on the Web for informetric purposes - A review and analysis [J].
Bar-Ilan, J .
SCIENTOMETRICS, 2001, 50 (01) :7-32
[2]  
Bar-Ilan J., 1999, Cybermetrics, V3, P1
[3]  
Belew R.K., 2000, Finding Out About: A Cognitive Perspective on Search Engine Technology and the WWW
[4]  
Calvo R. A., 2000, Intelligent Data Analysis, V4, P411
[5]  
Choi Y., 2005, P C HUMAN LANGUAGE T, P355
[6]  
CHURCH KW, 1990, 27TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, P76
[7]   SOME METHODS FOR STRENGTHENING THE COMMON X2 TESTS [J].
COCHRAN, WG .
BIOMETRICS, 1954, 10 (04) :417-451
[8]  
Cohen W. W., 1995, P 12 INT C MACH LEAR, P115, DOI DOI 10.1016/B978-1-55860-377-6.50023-2
[9]  
CONRAD JG, 1994, P 17 ANN INT ACM SIG, P260
[10]  
Dave K., 2003, Proceedings of the 12th international conference on world wide web, P519, DOI [DOI 10.1145/775152.775226, 10.1145/775152.775226]