Learning subjective language

被引:268
作者
Wiebe, J [1 ]
Wilson, T
Bruce, R
Bell, M
Martin, M
机构
[1] Univ Pittsburgh, Dept Comp Sci, Pittsburgh, PA 15260 USA
[2] Univ Pittsburgh, Intelligent Syst Program, Pittsburgh, PA 15260 USA
[3] Univ N Carolina, Dept Comp Sci, Asheville, NC 28804 USA
[4] New Mexico State Univ, Dept Comp Sci, Las Cruces, NM 88003 USA
关键词
Character recognition - Natural language processing systems;
D O I
10.1162/0891201041850885
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Subjectivity in natural language refers to aspects of language used to express opinions, evaluations, and speculations. There are numerous natural language processing applications for which subjectivity analysis is relevant, including information extraction and text categorization. The goal of this work is learning subjective language from corpora. Clues of subjectivity are generated and tested, including low-frequency words, collocations, and adjectives and verbs identified using distributional similarity. The features are also examined working together in concert. The features, generated from different data sets using different procedures, exhibit consistency in performance in that they all do better and worse on the same data sets. In addition, this article shows that the density of subjectivity clues in the surrounding context strongly affects how likely it is that a word is subjective, and it provides the results of an annotation study assessing the subjectivity of sentences with high-density features. Finally, the clues are used to perform opinion piece recognition (a type of text categorization and genre detection) to demonstrate the utility of the knowledge acquired in this article.
引用
收藏
页码:277 / 308
页数:32
相关论文
共 76 条
[1]  
Agrawal R., 2003, P 12 INT WORLD WID W
[2]  
Alvarado S. J., 1986, Proceedings AAAI-86: Fifth National Conference on Artificial Intelligence, P250
[3]   QUANTIFICATION OF REWRITING BY THE BROTHERS GRIMM - A COMPARISON OF SUCCESSIVE VERSIONS OF 3 TALES [J].
ANDERSON, CW ;
MCMASTER, GE .
COMPUTERS AND THE HUMANITIES, 1989, 23 (4-5) :341-346
[4]  
[Anonymous], P 1 INT WORKSH INN I
[5]  
[Anonymous], 1998, P 17 INT C COMP LING
[6]  
Aone C., 2000, Proceedings Seventeenth National Conference on Artificial Intelligence (AAAI-2000). Twelfth Innovative Applications of Artificial Intelligence Conference (IAAI-2000), P945
[7]  
Banfield Ann., 1982, UNSPEAKABLE SENTENCE
[8]  
Barzilay R, 2000, SEVENTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-2001) / TWELFTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-2000), P679
[9]  
Biber D., 1993, Computational Linguistics, V19, P531
[10]  
BRILL E, 1992, THIRD CONFERENCE ON APPLIED NATURAL LANGUAGE PROCESSING, P152, DOI 10.3115/974499.974526