A hybrid model based on rough sets theory and genetic algorithms for stock price forecasting

被引:169
作者
Cheng, Ching-Hsue [2 ]
Chen, Tai-Liang [3 ]
Wei, Liang-Ying [1 ]
机构
[1] Yuanpei Univ, Dept Informat Management, Hsinchu 30015, Taiwan
[2] Natl Yunlin Univ Sci & Technol, Dept Informat Management, Touliu 640, Yunlin, Taiwan
[3] Wenzao Ursuline Coll Languages, Dept Informat Management & Commun, Kaohsiung 807, Taiwan
关键词
Rough set theory; Genetic algorithms; Cumulative probability distribution approach; Minimize entropy principle approach; Technical indicators; NEURAL-NETWORKS; PREDICTION; DISCRETIZATION;
D O I
10.1016/j.ins.2010.01.014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
080201 [机械制造及其自动化];
摘要
In the stock market, technical analysis is a useful method for predicting stock prices. Although, professional stock analysts and fund managers usually make subjective judgments, based on objective technical indicators, it is difficult for non-professionals to apply this forecasting technique because there are too many complex technical indicators to be considered. Moreover, two drawbacks have been found in many of the past forecasting models: (1) statistical assumptions about variables are required for time series models, such as the autoregressive moving average model (ARMA) and the autoregressive conditional heteroscedasticity (ARCH), to produce forecasting models of mathematical equations, and these are not easily understood by stock investors; and (2) the rules mined from some artificial intelligence (AI) algorithms, such as neural networks (NN), are not easily realized. In order to overcome these drawbacks, this paper proposes a hybrid forecasting model, using multi-technical indicators to predict stock price trends. Further, it includes four proposed procedures in the hybrid model to provide efficient rules for forecasting, which are evolved from the extracted rules with high support value, by using the toolset based on rough sets theory (RST): (1) select the essential technical indicators, which are highly related to the future stock price, from the popular indicators based on a correlation matrix; (2) use the cumulative probability distribution approach (CDPA) and minimize the entropy principle approach (MEPA) to partition technical indicator value and daily price fluctuation into linguistic values, based on the characteristics of the data distribution; (3) employ a RST algorithm to extract linguistic rules from the linguistic technical indicator dataset; and (4) utilize genetic algorithms (GAs) to refine the extracted rules to get better forecasting accuracy and stock return. The effectiveness of the proposed model is verified with two types of performance evaluations, accuracy and stock return, and by using a six-year period of the TAIEX (Taiwan Stock Exchange Capitalization Weighted Stock Index) as the experiment dataset. The experimental results show that the proposed model is superior to the two listed forecasting models (RST and GAs) in terms of accuracy, and the stock return evaluations have revealed that the profits produced by the proposed model are higher than the three listed models (Buy-and-Hold, RST and GAs). (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:1610 / 1629
页数:20
相关论文
共 50 条
[1]
Acklam P.J., 2004, ALGORITHM COMPUTING
[2]
Using genetic algorithms to find technical trading rules [J].
Allen, F ;
Karjalainen, R .
JOURNAL OF FINANCIAL ECONOMICS, 1999, 51 (02) :245-271
[3]
ANNA B, 2007, COMPUTER INFORM SYST, P3
[4]
[Anonymous], LECT NOTES ARTIFICIA, DOI DOI 10.1007/3-540-69115-4_85
[5]
[Anonymous], TECHNICAL ANAL
[6]
[Anonymous], 2020, PROC INT JOINT C NEU
[7]
AZO ME, 1994, NEURAL NETWORK TIME
[8]
BLICKLE T, 1995, P 6 INT C GEN ALG, P506
[9]
GENERALIZED AUTOREGRESSIVE CONDITIONAL HETEROSKEDASTICITY [J].
BOLLERSLEV, T .
JOURNAL OF ECONOMETRICS, 1986, 31 (03) :307-327
[10]
Box GEP, 1976, Time Series Analysis: Forecasting and Control, V2nd