ASTERIX: towards a scalable, semistructured data platform for evolving-world models

被引:45
作者
Behm, Alexander [1 ]
Borkar, Vinayak R. [1 ]
Carey, Michael J. [1 ]
Grover, Raman [1 ]
Li, Chen [1 ]
Onose, Nicola [1 ]
Vernica, Rares [1 ]
Deutsch, Alin [2 ]
Papakonstantinou, Yannis [2 ]
Tsotras, Vassilis J. [3 ]
机构
[1] Univ Calif Irvine, Irvine, CA 92697 USA
[2] Univ Calif San Diego, San Diego, CA 92103 USA
[3] Univ Calif Riverside, Riverside, CA 92521 USA
基金
美国国家科学基金会;
关键词
Data-intensive computing; Cloud computing; Semistructured data; ASTERIX; Hyracks; PERFORMANCE; MAPREDUCE;
D O I
10.1007/s10619-011-7082-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
ASTERIX is a new data-intensive storage and computing platform project spanning UC Irvine, UC Riverside, and UC San Diego. In this paper we provide an overview of the ASTERIX project, starting with its main goal-the storage and analysis of data pertaining to evolving-world models. We describe the requirements and associated challenges, and explain how the project is addressing them. We provide a technical overview of ASTERIX, covering its architecture, its user model for data and queries, and its approach to scalable query processing and data management. ASTERIX utilizes a new scalable runtime computational platform called Hyracks that is also discussed at an overview level; we have recently made Hyracks available in open source for use by other interested parties. We also relate our work on ASTERIX to the current state of the art and describe the research challenges that we are currently tackling as well as those that lie ahead.
引用
收藏
页码:185 / 216
页数:32
相关论文
共 57 条
  • [1] Abiteboul S., 1989, LNCS
  • [2] Abiteboul S., 1999, DATA WEB RELATIONS S
  • [3] XML processing in DHT networks
    Abiteboul, Serge
    Manolescu, Ioana
    Polyzotis, Neoklis
    Preda, Nicoleta
    Sun, Chong
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 606 - +
  • [4] The Claremont Report on Database Research
    Agrawal, Rakesh
    Ailamaki, Anastasia
    Bernstein, Philip A.
    Brewer, Eric A.
    Carey, Michael J.
    Chaudhuri, Surajit
    Doan, Anhai
    Florescu, Daniela
    Franklin, Michael J.
    Garcia-Molina, Hector
    Gehrke, Johannes
    Gruenwald, Le
    Haas, Laura M.
    Halevy, Alon Y.
    Hellerstein, Joseph M.
    Ioannidis, Yannis E.
    Korth, Hank F.
    Kossmann, Donald
    Madden, Samuel
    Magoulas, Roger
    Chinooi, Beng
    O'Reilly, Tim
    Ramakrishnan, Raghu
    Sarawagi, Sunita
    Stonebraker, Michael
    Szalay, Alexander S.
    Weikum, Gerhard
    [J]. COMMUNICATIONS OF THE ACM, 2009, 52 (06) : 56 - 65
  • [5] AMERYAHIA S, 2009, XQUERY XPATH FULL TE
  • [6] [Anonymous], 2005, Scientific Programming
  • [7] [Anonymous], P 2 ACM EUROPEAN C C, DOI DOI 10.1145/1272996.1273005
  • [8] [Anonymous], 2002, DATABASE MANAGEMENT
  • [9] [Anonymous], 2010, SoCC, DOI DOI 10.1145/1807128.1807148
  • [10] apache, Apache avro