Evaluating and understanding text-based stock price prediction models

被引:69
作者
de Fortuny, Enric Junque [1 ]
De Smedt, Tom [2 ]
Martens, David [1 ]
Daelemans, Walter [2 ]
机构
[1] Univ Antwerp, Fac Appl Econ, B-2000 Antwerp, Belgium
[2] Univ Antwerp, Fac Arts, B-2000 Antwerp, Belgium
关键词
Stock; Prediction; Support Vector Machine; Sentiment mining; Evaluation; SYSTEMS; NEWS;
D O I
10.1016/j.ipm.2013.12.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite the fact that both the Efficient Market Hypothesis and Random Walk Theory postulate that it is impossible to predict future stock prices based on currently available information, recent advances in empirical research have been proving the opposite by achieving what seems to be better than random prediction performance. We discuss some of the (dis)advantages of the most widely used performance metrics and conclude that is difficult to assess the external validity of performance using some of these measures. Moreover, there remain many questions as to the real-world applicability of these empirical models. In the first part of this study we design novel stock price prediction models, based on state-of-the-art text-mining techniques to assert whether we can predict the movement of stock prices more accurately by including indicators of irrationality. Along with this, we discuss which metrics are most appropriate for which scenarios in order to evaluate the models. Finally, we discuss how to gain insight into text-mining-based stock price prediction models in order to evaluate, validate and refine the models. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:426 / 441
页数:16
相关论文
共 36 条
[11]  
Bodie Z., 2008, INVESTMENTS, V7th
[12]  
Breiman L, 1984, OLSHEN STONE CLASSIF, DOI 10.1201/9781315139470
[13]   An overview and framework for PD backtesting and benchmarking [J].
Castermans, G. ;
Martens, D. ;
Van Gestel, T. ;
Hamers, B. ;
Baesens, B. .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2010, 61 (03) :359-373
[14]   Media coverage in times of political crisis: A text mining approach [J].
de Fortuny, Enric Junque ;
De Smedt, Tom ;
Martens, David ;
Daelemans, Walter .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (14) :11616-11622
[15]  
De Smedt T, 2012, LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P3568
[16]  
De Smedt T, 2012, J MACH LEARN RES, V13, P2063
[17]  
Esuli Andrea., 2006, LREC 2006 Proceedings, 2006, S, P417
[18]   THE BEHAVIOR OF STOCK-MARKET PRICES [J].
FAMA, EF .
JOURNAL OF BUSINESS, 1965, 38 (01) :34-105
[19]  
Fayyad U, 1996, AI MAG, V17, P37
[20]   Stock prediction: Integrating text mining approach using real-time news [J].
Fung, GPC ;
Yu, JX ;
Lam, W .
2003 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR FINANCIAL ENGINEERING, PROCEEDINGS, 2003, :395-402