A MIDAS modelling framework for Chinese inflation index forecast incorporating Google search data

被引:51
作者
Li, Xin [1 ]
Shang, Wei [2 ]
Wang, Shouyang [2 ]
Ma, Jian [3 ]
机构
[1] Univ Chinese Acad Sci, Sch Management, Beijing, Peoples R China
[2] Chinese Acad Sci, Acad Math & Syst Sci, Beijing, Peoples R China
[3] City Univ Hong Kong, Dept Informat Syst, Kowloon, Hong Kong, Peoples R China
基金
美国国家科学基金会;
关键词
Inflation index forecast; Consumer price index; MIDAS modelling framework; User generated content; Google search data; SENTIMENT; GROWTH; MEDIA;
D O I
10.1016/j.elerap.2015.01.001
中图分类号
F [经济];
学科分类号
02 ;
摘要
Increased internet penetration makes it possible for user generated content (UGC) to reflect people's insights and expectations on economic activities. As representative and easily accessible UGC data that reflect public opinions on economic issues, Google search data have been used to forecast macroeconomic indicators in existing literatures. However, very little empirical research has directly used Google search data to improve the forecast accuracy. This paper proposes an integrated framework, which constructs keywords base and extracts search data accordingly, and then incorporates the search data into a mixed data sampling (MIDAS) model. Five groups of search data are extracted based on the constructed keywords and are then used in MIDAS model to forecast Chinese consumer price index (CPI) from 2004 to 2012. The empirical results indicate that the search data are strongly correlated with CPI, which is officially released by the Statistic Bureau of China; the MIDAS model including the search data outperforms the benchmark models, with the average reduction of root mean square error (RMSE) being 32.9%. This research provides a rigorous and generalizable framework for macroeconomic trend prediction using Google search data, and would have great potential in supporting business decisions by eliciting relevant information from UGC data in the Internet. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:112 / 125
页数:14
相关论文
共 56 条
  • [1] Alper C. E., 2008, 7460 MPRA
  • [2] Regression models with mixed sampling frequencies
    Andreou, Elena
    Ghysels, Eric
    Kourtellos, Andros
    [J]. JOURNAL OF ECONOMETRICS, 2010, 158 (02) : 246 - 261
  • [3] Do macro variables, asset markets, or surveys forecast inflation better?
    Ang, Andrew
    Bekaert, Geert
    Wei, Min
    [J]. JOURNAL OF MONETARY ECONOMICS, 2007, 54 (04) : 1163 - 1212
  • [4] [Anonymous], 2005, 114 NBER
  • [5] [Anonymous], 2013, TIME SERIES ANAL FOR, DOI DOI 10.1002/9781118619193
  • [6] [Anonymous], 2011, P INT C INF SYST
  • [7] [Anonymous], 1991, TIME SERIES TECHNIQU
  • [8] [Anonymous], 2009, APPL EC Q, DOI DOI 10.3790/AEQ.55.2.107
  • [9] Is all that talk just noise? The information content of Internet stock message boards
    Antweiler, W
    Frank, MZ
    [J]. JOURNAL OF FINANCE, 2004, 59 (03) : 1259 - 1294
  • [10] Google search volume and its influence on liquidity and returns of German stocks
    Bank M.
    Larch M.
    Peter G.
    [J]. Financial Markets and Portfolio Management, 2011, 25 (3): : 239 - 264