RDFFrames: Knowledge Graph Access for Machine Learning Tools

被引:3
作者
Mohamed, Aisha [1 ]
Abuoda, Ghadeer [2 ]
Ghanem, Abdurrahman [3 ]
Kaoudi, Zoi [4 ]
Aboulnaga, Ashraf [1 ]
机构
[1] HBKU, Qatar Comp Res Inst, Ar Rayyan, Qatar
[2] HBKU, Coll Sci & Engn, Ar Rayyan, Qatar
[3] Bluescape, San Carlos, CA USA
[4] Tech Univ Berlin, Berlin, Germany
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2020年 / 13卷 / 12期
关键词
Graphic methods - Information management - Knowledge graph - Query languages - Query processing - Resource Description Framework (RDF);
D O I
10.14778/3415478.3415501
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Knowledge graphs represented in RDF are becoming increasingly popular and are essential to many machine learning applications. A rich ecosystem of RDF data management systems and tools has evolved over the years, most notably RDF database management systems that support the SPARQL query language. Surprisingly, machine learning tools for knowledge graphs typically do not use SPARQL despite the obvious advantages of using a database system. This is due to the mismatch between SPARQL and machine learning tools in terms of expected data model and interface style. Machine learning tools work on data in tabular format and process it using imperative relational API calls, while SPARQL matches graph patterns to RDF triples. To access knowledge graphs for machine learning, we observe that it is more natural to use a navigational paradigm based on graph traversal rather than the SPARQL paradigm based on triple patterns. We demonstrate RDFFrames, a framework that bridges the gap between machine learning tools and RDF database systems by offering the usability and flexibility of machine learning tools together with the performance of a database system. RDFFrames enables the user to make a sequence of Python calls to define the data to be extracted from a knowledge graph stored in an RDF database system, and it translates these calls into a compact SPARQL query, executes it on the database system, and returns the results in a standard tabular format.
引用
收藏
页码:2889 / 2892
页数:4
相关论文
共 4 条
  • [1] Mohamed A., 2020, ARXIV200203614
  • [2] Pirahesh Hamid., 1992, SIGMOD
  • [3] Wang Yunhe, 2018, 32 AAAI C ART INT
  • [4] The Split-Apply-Combine Strategy for Data Analysis
    Wickham, Hadley
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2011, 40 (01): : 1 - 29