A Review of Relational Machine Learning for Knowledge Graphs

被引:891
作者
Nickel, Maximilian [1 ,2 ]
Murphy, Kevin [3 ]
Tresp, Volker [4 ,5 ]
Gabrilovich, Evgeniy [3 ]
机构
[1] MIT, Lab Computat & Stat Learning, Cambridge, MA 02139 USA
[2] Ist Italiano Tecnol, I-16163 Genoa, Italy
[3] Google Inc, Mountain View, CA 94043 USA
[4] Siemens AG, Corp Technol, D-81739 Munich, Germany
[5] Univ Munich, D-80539 Munich, Germany
关键词
Graph-based models; knowledge extraction; knowledge graphs; latent feature models; statistical relational learning; LINK-PREDICTION; SEMANTIC WEB; BASE;
D O I
10.1109/JPROC.2015.2483592
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Relational machine learning studies methods for the statistical analysis of relational, or graph-structured, data. In this paper, we provide a review of how such statistical models can be "trained'' on large knowledge graphs, and then used to predict new facts about the world (which is equivalent to predicting new edges in the graph). In particular, we discuss two fundamentally different kinds of statistical relational models, both of which can scale to massive data sets. The first is based on latent feature models such as tensor factorization and multiway neural networks. The second is based on mining observable patterns in the graph. We also show how to combine these latent and observable models to get improved modeling power at decreased computational cost. Finally, we discuss how such statistical models of graphs can be combined with text-based information extraction methods for automatically constructing knowledge graphs from the Web. To this end, we also discuss Google's knowledge vault project as an example of such combination.
引用
收藏
页码:11 / 33
页数:23
相关论文
共 147 条
  • [91] Kimmig A., 2012, P NIPS WORKSH PROB P
  • [92] Klyne G., 2004, Resource description framework (RDF): Concepts and abstract syntax
  • [93] Tensor Decompositions and Applications
    Kolda, Tamara G.
    Bader, Brett W.
    [J]. SIAM REVIEW, 2009, 51 (03) : 455 - 500
  • [94] Higher-order web link analysis using multilinear algebra
    Kolda, TG
    Bader, BW
    Kenny, JP
    [J]. FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 242 - 249
  • [95] Koller D, 2009, Probabilistic graphical models: principles and techniques
  • [96] Koren Y, 2008, P 14 ACM SIGKDD INT, P426
  • [97] MATRIX FACTORIZATION TECHNIQUES FOR RECOMMENDER SYSTEMS
    Koren, Yehuda
    Bell, Robert
    Volinsky, Chris
    [J]. COMPUTER, 2009, 42 (08) : 30 - 37
  • [98] Krompass D, 2014, LECT NOTES COMPUT SC, V8797, P114, DOI 10.1007/978-3-319-11915-1_8
  • [99] Lao N, 2011, C EMP METH NAT LANG
  • [100] Relational retrieval using a combination of path-constrained random walks
    Lao, Ni
    Cohen, William W.
    [J]. MACHINE LEARNING, 2010, 81 (01) : 53 - 67