Predicting consumer behavior with Web search

被引:340
作者
Goel, Sharad [1 ]
Hofman, Jake M. [1 ]
Lahaie, Sebastien [1 ]
Pennock, David M. [1 ]
Watts, Duncan J. [1 ]
机构
[1] Yahoo Res, Microecon & Social Syst, New York, NY 10018 USA
关键词
culture; predictions;
D O I
10.1073/pnas.1005962107
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent work has demonstrated that Web search volume can "predict the present," meaning that it can be used to accurately track outcomes such as unemployment levels, auto and home sales, and disease prevalence in near real time. Here we show that what consumers are searching for online can also predict their collective future behavior days or even weeks in advance. Specifically we use search query volume to forecast the opening weekend box-office revenue for feature films, first-month sales of video games, and the rank of songs on the Billboard Hot 100 chart, finding in all cases that search counts are highly predictive of future outcomes. We also find that search counts generally boost the performance of baseline models fit on other publicly available data, where the boost varies from modest to dramatic, depending on the application in question. Finally, we reexamine previous work on tracking flu trends and show that, perhaps surprisingly, the utility of search data relative to a simple autoregressive model is modest. We conclude that in the absence of other data sources, or where small improvements in predictive performance are material, search queries provide a useful guide to the near future.
引用
收藏
页码:17486 / 17490
页数:5
相关论文
共 15 条
  • [1] [Anonymous], 2005, Proceedings 11th International Conference Knowledge Discovery in Data Mining, DOI DOI 10.1145/1081870.1081883
  • [2] [Anonymous], 2010, ARXIV10035699
  • [3] Askitas, 2009, APPL EC Q, V55, P107, DOI [DOI 10.3790/AEQ.55.2.107, 10.2139/ssrn.1465341, 10.3790/aeq.55.2.107]
  • [4] Choi H, 2009, PREDICTING INITIAL C, P1
  • [5] Choi H., 2009, Predicting the Present with Google Trends
  • [6] Cancer Internet search activity on a major search engine, United States 2001-2003
    Cooper, CP
    Mallon, KP
    Leadbetter, S
    Pollack, LA
    Peipins, LA
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2005, 7 (03)
  • [7] Using Web-based search data to predict macroeconomic statistics
    Ettredge, M
    Gerdes, J
    Karuga, G
    [J]. COMMUNICATIONS OF THE ACM, 2005, 48 (11) : 87 - 92
  • [8] Eysenbach Gunther, 2006, AMIA Annu Symp Proc, P244
  • [9] Detecting influenza epidemics using search engine query data
    Ginsberg, Jeremy
    Mohebbi, Matthew H.
    Patel, Rajan S.
    Brammer, Lynnette
    Smolinski, Mark S.
    Brilliant, Larry
    [J]. NATURE, 2009, 457 (7232) : 1012 - U4
  • [10] Web Queries as a Source for Syndromic Surveillance
    Hulth, Anette
    Rydevik, Gustaf
    Linde, Annika
    [J]. PLOS ONE, 2009, 4 (02):