Use of negation phrases in automatic sentiment classification of product reviews

被引:25
作者
Na, JC [1 ]
Khoo, C [1 ]
Wu, PHJ [1 ]
机构
[1] Nanyang Technol Univ, Sch Commun & Informat, Div Informat Studies, Singapore 637718, Singapore
关键词
D O I
10.1016/j.lcats.2005.04.007
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
This paper reports a study in automatic sentiment classification, i.e., automatically classifying documents as expressing positive or negative sentiments. The Study investigates the effectiveness of using a machine-learning algorithm, support vector machine (SVM), on various text features to classify on-line product reviews into recommended (positive sentiment) and not recommended (negative sentiment). In the first part of this study, several approaches, unigrams (individual words), selected words (such as verb, adjective, and adverb), and words labeled with part-of-speech tags were investigated. Using SVM, the unigram approach obtained an accuracy rate of around 76%. Error analysis suggests various approaches for improving classification accuracy: handling of negation phrases, inferencing from superficial words, and handling the problem of comments on parts of the product. The second part of the study investigated the use of negation phrase n-grams to improve classification accuracy. This approach increased the accuracy rate to 79.33%. Compared with traditional subject classification which mainly uses unigrams, syntactic and semantic processing of text appear more important for sentiment classification. We expect that deeper linguistic processing will help increase accuracy for sentiment classification. (c) 2005 Elsevier Inc. All rights reserved.
引用
收藏
页码:180 / 191
页数:12
相关论文
共 24 条
[1]  
[Anonymous], 1997, Proceedings of the fourteenth international conference on machine learning, DOI DOI 10.1016/J.ESWA.2008.05.026
[2]  
[Anonymous], AM HERITAGE DICT
[3]  
Dave Kushal, 2003, Proceedings of WWW-03, 12th International Conference on the World Wide Web, P519, DOI [DOI 10.1145/775152.775226, 10.1145/775152.775226]
[4]  
Downes W., 2000, LANG LIT, V9, P99, DOI DOI 10.1177/096394700000900201
[5]  
Finn A, 2002, LECT NOTES COMPUT SC, V2291, P353
[6]   Predicting the semantic orientation of adjectives [J].
Hatzivassiloglou, V ;
McKeown, KR .
35TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 8TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 1997, :174-181
[7]  
Hatzivassiloglou V., 2000, Proceedings of the 18th conference on Computational linguistics-Volume, P299, DOI DOI 10.3115/990820.990864
[8]  
Hearst M.A., 1992, TEXT BASED INTELLIGE
[9]  
Joachims T, 1999, MACHINE LEARNING, PROCEEDINGS, P200
[10]  
Joachims T., 1998, Lecture Notes in Computer Science, P137, DOI DOI 10.1007/BFB0026683