Big Data: A Survey

被引:1639
作者
Chen, Min [1 ]
Mao, Shiwen [2 ]
Liu, Yunhao [3 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430074, Peoples R China
[2] Auburn Univ, Dept Elect & Comp Engn, Auburn, AL 36849 USA
[3] Tsinghua Univ, Sch Software, TNLIST, Beijing 100084, Peoples R China
基金
美国国家科学基金会;
关键词
Big data; Cloud computing; Internet of things; Data center; Hadoop; Smart grid; Big data analysis; OF-THE-ART; MAP-REDUCE; ENERGY-EFFICIENT; SENSOR NETWORKS; PERFORMANCE; SCALE; CHALLENGES; SYSTEM; FUTURE; TOP;
D O I
10.1007/s11036-013-0489-0
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we review the background and state-of-the-art of big data. We first introduce the general background of big data and review related technologies, such as could computing, Internet of Things, data centers, and Hadoop. We then focus on the four phases of the value chain of big data, i.e., data generation, data acquisition, data storage, and data analysis. For each phase, we introduce the general background, discuss the technical challenges, and review the latest advances. We finally examine the several representative applications of big data, including enterprise management, Internet of Things, online social networks, medial applications, collective intelligence, and smart grid. These discussions aim to provide a comprehensive overview and big-picture to readers of this exciting area. This survey is concluded with a discussion of open problems and future directions.
引用
收藏
页码:171 / 209
页数:39
相关论文
共 155 条
  • [51] SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets
    Chaiken, Ronnie
    Jenkins, Bob
    Larson, Per-Ake
    Ramsey, Bill
    Shakib, Darren
    Weaver, Simon
    Zhou, Jingren
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (02): : 1265 - 1276
  • [52] Focused crawling: a new approach to topic-specific Web resource discovery
    Chakrabarti, S
    van den Berg, M
    Dom, B
    [J]. COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 1999, 31 (11-16): : 1623 - 1640
  • [53] Chakrabarti Soumen., 2000, ACM SIGKDD Explorations, P1, DOI [10.1145/846183.846187, DOI 10.1145/846183.846187]
  • [54] Chandramohan V, 2002, CONF LOCAL COMPUT NE, P728, DOI 10.1109/LCN.2002.1181851
  • [55] Bigtable: A distributed storage system for structured data
    Chang, Fay
    Dean, Jeffrey
    Ghemawat, Sanjay
    Hsieh, Wilson C.
    Wallach, Deborah A.
    Burrows, Mike
    Chandra, Tushar
    Fikes, Andrew
    Gruber, Robert E.
    [J]. ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2008, 26 (02):
  • [56] An Overview of Business Intelligence Technology
    Chaudhuri, Surajit
    Dayal, Umeshwar
    Narasayya, Vivek
    [J]. COMMUNICATIONS OF THE ACM, 2011, 54 (08) : 88 - 98
  • [57] Crockford D., 2006, APPL JSON MEDIA TYPE
  • [58] Cukier K, 2010, ECONOMIST NEWSPAPER
  • [59] Das T., 2012, P 9 USENIX C NETW SY, P2, DOI DOI 10.1111/J.1095-8649.2005.00662.X
  • [60] Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137