Decoding Binary Node Labels from Censored Edge Measurements: Phase Transition and Efficient Recovery

被引:55
作者
Abbe, Emmanuel [1 ,2 ]
Bandeira, Afonso S. [1 ]
Bracher, Annina [3 ]
Singer, Amit [1 ]
机构
[1] Princeton Univ, Program Appl & Computat Math, Princeton, NJ 08544 USA
[2] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA
[3] Swiss Fed Inst Technol, Dept Elect Engn, CH-8092 Zurich, Switzerland
来源
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING | 2014年 / 1卷 / 01期
关键词
Synchronization problem; Information theoretic bounds; Stochastic block model; Semidefinite relaxations; graph-based codes;
D O I
10.1109/TNSE.2014.2368716
中图分类号
T [工业技术];
学科分类号
08 [工学];
摘要
We consider the problem of clustering a graph G into two communities by observing a subset of the vertex correlations. Specifically, we consider the inverse problem with observed variables Y = B(G)x circle plus Z, where B-G is the incidence matrix of a graph G, x is the vector of unknown vertex variables (with a uniform prior) and Z is a noise vector with Bernoulli(epsilon) i.i.d. entries. All variables and operations are Boolean. This model is motivated by coding, synchronization, and community detection problems. In particular, it corresponds to a stochastic block model or a correlation clustering problem with two communities and censored edges. Without noise, exact recovery (up to global flip) of x is possible if and only the graph G is connected, with a sharp threshold at the edge probability log(n)/n for Erdos-Renyi random graphs. The first goal of this paper is to determine how the edge probability p needs to scale to allow exact recovery in the presence of noise. Defining the degree (oversampling) rate of the graph by alpha = np/log(n), it is shown that exact recovery is possible if and only if alpha > 2/(1 - 2 epsilon)(2) + o(1/(1 - 2 epsilon)(2)). In other words, 2/(1 - 2 epsilon)(2) is the information theoretic threshold for exact recovery at low-SNR. In addition, an efficient recovery algorithm based on semidefinite programming is proposed and shown to succeed in the threshold regime up to twice the optimal rate. For a deterministic graph G, defining the degree rate as alpha = d/log(n), where d is the minimum degree of the graph, it is shown that the proposed method achieves the rate alpha > 4((1 + lambda)/(1 - lambda)(2))/(1 - 2 epsilon)(2) + o(1/(1 - 2 epsilon)(2)), where 1 - lambda is the spectral gap of the graph G.
引用
收藏
页码:10 / 22
页数:13
相关论文
共 36 条
[1]
Abbe Emmanuel, 2013, Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques. Algorithms and Techniques. 16th International Workshop, APPROX 2013 and 17th International Workshop, RANDOM 2013. Proceedings: LNCS 8096, P332, DOI 10.1007/978-3-642-40328-6_24
[2]
Abbe E., 2014, ARXIV14053267CSSI
[3]
Abbe E, 2014, IEEE INT SYMP INFO, P1251, DOI 10.1109/ISIT.2014.6875033
[4]
Phase Retrieval with Polarization [J].
Alexeev, Boris ;
Bandeira, Afonso S. ;
Fickus, Matthew ;
Mixon, Dustin G. .
SIAM JOURNAL ON IMAGING SCIENCES, 2014, 7 (01) :35-66
[5]
LAMBDA-1, ISOPERIMETRIC-INEQUALITIES FOR GRAPHS, AND SUPERCONCENTRATORS [J].
ALON, N ;
MILMAN, VD .
JOURNAL OF COMBINATORIAL THEORY SERIES B, 1985, 38 (01) :73-88
[6]
EIGENVALUES AND EXPANDERS [J].
ALON, N .
COMBINATORICA, 1986, 6 (02) :83-96
[7]
Alon N., 2004, P 36 ANN ACM S THEOR, P72, DOI DOI 10.1145/1007352.1007371
[8]
[Anonymous], 2003, P 35 ANN ACM S THEOR
[9]
[Anonymous], GENERALIZED BLOCKMOD
[10]
Bandeira A. S., 2013, ARXIV13085207CSDS