Stylistic text classification using functional lexical features

被引:92
作者
Argamon, Shlomo
Whitelaw, Casey
Chase, Paul
Hota, Sobhan Raj
Garg, Navendu
Levitan, Shlomo
机构
[1] IIT, Dept Comp Sci, Linguist Cognit Lab, Chicago, IL 60616 USA
[2] Univ Sydney, Sch Informat Technol, Sydney, NSW 2006, Australia
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2007年 / 58卷 / 06期
关键词
D O I
10.1002/asi.20553
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most text analysis and retrieval work to date has focused on the topic of a text; that is, what it is about. However, a text also contains much useful information in its style, or how it is written. This includes information about its author, its purpose, feelings it is meant to evoke, and more. This article develops a new type of lexical feature for use in stylistic text classification, based on taxonomies of various semantic functions of certain choice words or phrases. We demonstrate the usefulness of such features for the stylistic text classification tasks of determining author identity and nationality, the gender of literary characters, a text's sentiment (positive/negative evaluation), and the rhetorical character of scientific journal articles. We further show how the use of functional features aids in gaining insight about stylistic differences among different kinds of texts.
引用
收藏
页码:802 / 822
页数:21
相关论文
共 75 条
  • [1] Androutsopoulos I, 2000, P WORKSH MACH LEARN, P9
  • [2] [Anonymous], 1993, P INF RETR 93
  • [3] [Anonymous], 1990, COLING 1990 VOLUME 1
  • [4] [Anonymous], 2005, P JOINT C ASS COMP H
  • [5] [Anonymous], 1972, UNDERSTANDING NATURA
  • [6] [Anonymous], P C REC ADV NAT LANG
  • [7] [Anonymous], 1999, P 37 ANN M ASS COMPU
  • [8] [Anonymous], 1990, SUPPORT VECTOR LEARN
  • [9] [Anonymous], 1968, SELECTED PAPERS J R
  • [10] [Anonymous], SOCIOLINGUISTICS INT