Beyond the hype: Big data concepts, methods, and analytics

被引:1999
作者
Gandomi, Amir [1 ]
Haider, Murtaza [1 ]
机构
[1] Ryerson Univ, Ted Rogers Sch Management, Toronto, ON M5B 2K3, Canada
关键词
Big data analytics; Big data definition; Unstructured data analytics; Predictive analytics; BUSINESS INTELLIGENCE; CHALLENGES;
D O I
10.1016/j.ijinfomgt.2014.10.007
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Size is the first, and at times, the only dimension that leaps out at the mention of big data. This paper attempts to offer a broader definition of big data that captures its other unique and defining characteristics. The rapid evolution and adoption of big data by industry has leapfrogged the discourse to popular outlets, forcing the academic press to catch up. Academic journals in numerous disciplines, which will benefit from a relevant discussion of big data, have yet to cover the topic. This paper presents a consolidated description of big data by integrating definitions from practitioners and academics. The paper's primary focus is on the analytic methods used for big data. A particular distinguishing feature of this paper is its focus on analytics related to unstructured data, which constitute 95% of big data. This paper highlights the need to develop appropriate and efficient analytical methods to leverage massive volumes of heterogeneous data in unstructured text, audio, and video formats. This paper also reinforces the need to devise new tools for predictive analytics for structured big data. The statistical methods in practice were devised to infer from sample data. The heterogeneity, noise, and the massive size of structured big data calls for developing computationally efficient algorithms that may avoid big data pitfalls, such as spurious correlation. (C) 2014 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:137 / 144
页数:8
相关论文
共 35 条
  • [1] Aggarwal CC, 2011, SOCIAL NETWORK DATA ANALYTICS, P1
  • [2] Al Hasan M, 2011, SOCIAL NETWORK DATA ANALYTICS, P243
  • [3] [Anonymous], 2010, OSDI
  • [4] Sentiment analysis and opinion mining
    [J]. Liu, Bing, 1600, Morgan and Claypool Publishers (05): : 1 - 184
  • [5] [Anonymous], 2011, BIG DATA NEXT FRONTI
  • [6] [Anonymous], 2012, ANAL REAL WORLD USE
  • [7] [Anonymous], 2010, Synthesis Lectures on Data Mining and Knowledge Discovery, DOI [DOI 10.2200/S00298ED1V01Y201009DMK003, 10.2200/s00298ed1v01y201009dmk003]
  • [8] [Anonymous], 2010, ADV SPEECH RECOGNITI
  • [9] [Anonymous], 2012, Small and midsize companies look to make big gains with "big data"
  • [10] Barbier G, 2011, SOCIAL NETWORK DATA ANALYTICS, P327