Sentiment Analyzer: Extracting sentiments about a given topic using natural language processing techniques

被引:242
作者
Yi, JH [1 ]
Nasukawa, T [1 ]
Bunescu, R [1 ]
Niblack, W [1 ]
机构
[1] IBM Corp, Almaden Res Ctr, San Jose, CA 95120 USA
来源
THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS | 2003年
关键词
D O I
10.1109/ICDM.2003.1250949
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present Sentiment Analyzer (SA) that extracts sentiment (or opinion) about a subject from online text documents. Instead of classifying the sentiment of an entire document about a subject, SA detects all references to the given subject, and determines sentiment in each of the references using natural language processing (NLP) techniques. Our sentiment analysis consists of 1) a topic specific feature term extraction, 2) sentiment extraction, and 3) (subject, sentiment) association by relationship analysis. SA utilizes two linguistic resources for the analysis: the sentiment lexicon and the sentiment pattern database. The performance of the algorithms was verified on online product review articles ("digital camera" and "music" reviews), and more general documents including general webpages and news articles.
引用
收藏
页码:427 / 434
页数:8
相关论文
共 25 条
[1]  
[Anonymous], COMPUTATIONAL LINGUI
[2]  
[Anonymous], COMPUTATIONAL LINGUI
[3]  
Berland Matthew, 1999, Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics, P57, DOI DOI 10.3115/1034678.1034697
[4]  
DAS S, 2001, P 8 APFA
[5]  
DAS S, 2001, P 37 ACL C
[6]  
DAVE K, 2003, P 12 INT WWW C
[7]   Predicting the semantic orientation of adjectives [J].
Hatzivassiloglou, V ;
McKeown, KR .
35TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 8TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 1997, :174-181
[8]  
Hearst M.A., 1992, TEXT BASED INTELLIGE
[9]  
KATZ B, 1997, P AAAI SPRING S NLP
[10]  
LI H, 2001, P 7 ACM SIGKDD C