Evaluating and understanding text-based stock price prediction models

被引:69
作者
de Fortuny, Enric Junque [1 ]
De Smedt, Tom [2 ]
Martens, David [1 ]
Daelemans, Walter [2 ]
机构
[1] Univ Antwerp, Fac Appl Econ, B-2000 Antwerp, Belgium
[2] Univ Antwerp, Fac Arts, B-2000 Antwerp, Belgium
关键词
Stock; Prediction; Support Vector Machine; Sentiment mining; Evaluation; SYSTEMS; NEWS;
D O I
10.1016/j.ipm.2013.12.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite the fact that both the Efficient Market Hypothesis and Random Walk Theory postulate that it is impossible to predict future stock prices based on currently available information, recent advances in empirical research have been proving the opposite by achieving what seems to be better than random prediction performance. We discuss some of the (dis)advantages of the most widely used performance metrics and conclude that is difficult to assess the external validity of performance using some of these measures. Moreover, there remain many questions as to the real-world applicability of these empirical models. In the first part of this study we design novel stock price prediction models, based on state-of-the-art text-mining techniques to assert whether we can predict the movement of stock prices more accurately by including indicators of irrationality. Along with this, we discuss which metrics are most appropriate for which scenarios in order to evaluate the models. Finally, we discuss how to gain insight into text-mining-based stock price prediction models in order to evaluate, validate and refine the models. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:426 / 441
页数:16
相关论文
共 36 条
[1]  
[Anonymous], NATURE STAT LEARNING
[2]  
[Anonymous], 2010, P 7 INT C LANG RES E
[3]  
[Anonymous], ACM COMPUTING SURVEY
[4]  
[Anonymous], DATA MINING EVOLUTIO
[5]  
[Anonymous], 4 INT S NEUR NETW DY
[6]  
[Anonymous], P 9 INT C INF KNOWL
[7]  
[Anonymous], PATTERN RECOGN LETT
[8]  
[Anonymous], 2011, J COMPUT SCI-NETH, DOI DOI 10.1016/j.jocs.2010.12.007
[9]  
[Anonymous], COMPUTATIONAL LINGUI
[10]  
[Anonymous], BRIEFINGS BIOINFORMA