Sentiment analysis in financial texts

被引:86
作者
Chan, Samuel W. K. [1 ]
Chong, Mickey W. C. [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Decis Sci, Shatin, Hong Kong, Peoples R China
关键词
Text analysis; Financial time series; Decision support systems; INVESTOR SENTIMENT; NEWS; INFORMATION; VOLATILITY; MANAGEMENT;
D O I
10.1016/j.dss.2016.10.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The growth of financial. texts in the wake of big data has challenged most organizations and brought escalating demands for analysis tools. In general, text streams are more challenging to handle than numeric data streams. Text streams are unstructured by nature, but they represent collective expressions that are of value in any financial decision. It can be both daunting and necessary to make sense of unstructured textual data. In this study, we address key questions related to the explosion of interest in how to extract insight from unstructured data and howto determine if such insight provides any hints concerning the trends of financial markets. A sentiment analysis engine (SAE) is proposed which takes advantage of linguistic analyses based on grammars. This engine extends sentiment analysis not only at the word token level, but also at the phrase level within each sentence. An assessment heuristic is applied to extract the collective expressions shown in the texts. Also, three evaluations are presented to assess the performance of the engine. First, several standard parsing evaluation metrics are applied on two treebanks. Second, a benchmark evaluation using a dataset of English movie review is conducted. Results show our SAE outperforms the traditional bag of words approach. Third, a financial text stream with twelve million words that aligns with a stock market index is examined. The evaluation results and their statistical significance provide strong evidence of a long persistence in the mood time series generated by the engine. In addition, our approach establishes grounds for belief that the sentiments expressed through text streams are helpful for analyzing the trends in a stock market index, although such sentiments and market indices are normally considered to be completely uncorrelated. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:53 / 64
页数:12
相关论文
共 60 条
[11]  
Chan SWK, 2013, LECT NOTES COMPUT SC, V8082, P419, DOI 10.1007/978-3-642-40585-3_53
[12]  
Chan SWK, 2011, LECT NOTES COMPUT SC, V6608, P155, DOI 10.1007/978-3-642-19400-9_13
[13]   Stock price reaction to news and no-news: drift and reversal after headlines [J].
Chan, WS .
JOURNAL OF FINANCIAL ECONOMICS, 2003, 70 (02) :223-260
[14]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[15]  
Chen HC, 2012, MIS QUART, V36, P1165
[16]   Giving context to accounting numbers: The role of news coverage [J].
Chen, Kuo-Tay ;
Lu, Hsin-Min ;
Chen, Tsai-Jyh ;
Li, Shu-Hsing ;
Lian, Jian-Shuen ;
Chen, Hsinchun .
DECISION SUPPORT SYSTEMS, 2011, 50 (04) :673-679
[17]  
Collins Michael J., 1999, THESIS
[18]   A comment on measuring the Hurst exponent of financial time series [J].
Couillard, M ;
Davison, M .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2005, 348 :404-418
[19]   HOW MARKETS PROCESS INFORMATION - NEWS RELEASES AND VOLATILITY [J].
EDERINGTON, LH ;
LEE, JH .
JOURNAL OF FINANCE, 1993, 48 (04) :1161-1191
[20]  
Eickhoff M., 2016, INF MANAG