Temporal Link Prediction Using Matrix and Tensor Factorizations

被引:352
作者
Dunlavy, Daniel M. [1 ]
Kolda, Tamara G. [2 ]
Acar, Evrim
机构
[1] Sandia Natl Labs, Albuquerque, NM 87123 USA
[2] Sandia Natl Labs, Livermore, CA 94551 USA
基金
美国能源部;
关键词
Link mining; link prediction; evolution; tensor factorization; CANDECOMP; PARAFAC;
D O I
10.1145/1921632.1921636
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The data in many disciplines such as social networks, Web analysis, etc. is link-based, and the link structure can be exploited for many different data mining tasks. In this article, we consider the problem of temporal link prediction: Given link data for times 1 through T, can we predict the links at time T + 1? If our data has underlying periodic structure, can we predict out even further in time, i.e., links at time T + 2, T + 3, etc.? In this article, we consider bipartite graphs that evolve over time and consider matrix-and tensor-based methods for predicting future links. We present a weight-based method for collapsing multiyear data into a single matrix. We show how the well-known Katz method for link prediction can be extended to bipartite graphs and, moreover, approximated in a scalable way using a truncated singular value decomposition. Using a CANDECOMP/PARAFAC tensor decomposition of the data, we illustrate the usefulness of exploiting the natural three-dimensional structure of temporal link data. Through several numerical experiments, we demonstrate that both matrix-and tensor-based techniques are effective for temporal link prediction despite the inherent difficulty of the problem. Additionally, we show that tensor-based techniques are particularly effective for temporal data with varying periodic patterns.
引用
收藏
页数:27
相关论文
共 39 条
[1]   Unsupervised Multiway Data Analysis: A Literature Survey [J].
Acar, Evrim ;
Yener, Buelent .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (01) :6-20
[2]  
Acar E, 2006, LECT NOTES COMPUT SC, V3975, P213
[3]   Link Prediction on Evolving Data using Matrix and Tensor Factorizations [J].
Acar, Evrim ;
Dunlavy, Daniel M. ;
Kolda, Tamara G. .
2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, :262-+
[4]  
[Anonymous], J CHEMOMETR IN PRESS
[5]  
[Anonymous], 2006, P SIAM DAT MIN WORKS
[6]  
[Anonymous], LECT NOTES COMPUTER
[7]  
[Anonymous], 1992, Numerical Methods for Large Eigenvalue Problems
[8]  
[Anonymous], 1989, Multiway data analysis
[9]  
[Anonymous], 2005, SIGKDD Explor. Newsl
[10]  
[Anonymous], 2009, P SIAM INT C DAT MIN