Lexicon-Based Methods for Sentiment Analysis

被引:1742
作者
Taboada, Maite [1 ]
Brooke, Julian [2 ]
Tofiloski, Milan [3 ]
Voll, Kimberly [4 ]
Stede, Manfred [5 ]
机构
[1] Simon Fraser Univ, Dept Linguist, Burnaby, BC V5A 1S6, Canada
[2] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3G4, Canada
[3] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
[4] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1Z4, Canada
[5] Univ Potsdam, Dept Linguist, D-14476 Golm, Germany
基金
加拿大自然科学与工程研究理事会;
关键词
CLASSIFICATION; EXPRESSIONS; LANGUAGE;
D O I
10.1162/COLI_a_00049
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a lexicon-based approach to extracting sentiment from text. The Semantic Orientation CALculator (SO-CAL) uses dictionaries of words annotated with their semantic orientation (polarity and strength), and incorporates intensification and negation. SO-CAL is applied to the polarity classification task, the process of assigning a positive or negative label to a text that captures the text's opinion towards its main subject matter. We show that SO-CAL's performance is consistent across domains and on completely unseen data. Additionally, we describe the process of dictionary creation, and our use of Mechanical Turk to check dictionaries for consistency and reliability.
引用
收藏
页码:267 / 307
页数:41
相关论文
共 120 条
[21]  
[Anonymous], P INT C REC ADV NAT
[22]  
[Anonymous], THESIS BRANDEIS U WA
[23]  
Asher N., 2008, Coling 2008' Companion volume: Posters. Manchester, P7
[24]  
Asher N, 2009, LINGUIST INVESTIG, V32, P279
[25]  
BARTLETT J, 2008, SAS GLOBAL FORUM 200
[26]  
Batson D., 1992, EMOTION, P294
[27]   Language patterns and ATTITUDE [J].
Bednarek, Monika .
FUNCTIONS OF LANGUAGE, 2009, 16 (02) :165-192
[28]  
Benamara Farah., 2007, P INT C WEBLOGS SOCI
[29]   ADVERBIAL STANCE TYPES IN ENGLISH [J].
BIBER, D ;
FINEGAN, E .
DISCOURSE PROCESSES, 1988, 11 (01) :1-34
[30]  
Blitzer John., 2007, Annual Meeting-Association For Computational Linguistics, V45, P440