Testing k-wise and Almost k-wise Independence

被引:69
作者
Alon, Noga [1 ]
Andoni, Alexandr [1 ]
Kaufman, Tali [1 ]
Matulef, Kevin [1 ]
Rubinfeld, Ronitt [1 ]
Xie, Ning [1 ]
机构
[1] Tel Aviv Univ, Tel Aviv, Israel
来源
STOC 07: PROCEEDINGS OF THE 39TH ANNUAL ACM SYMPOSIUM ON THEORY OF COMPUTING | 2007年
关键词
k-wise independence; almost k-wise independence; property testing; Fourier analysis; hidden-clique;
D O I
10.1145/1250790.1250863
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this work, we consider the problems of testing whether a distribution over {0, 1}(n) is k-wise (resp. (epsilon, k)-wise) independent using samples drawn from that distribution. For the problem of distinguishing k-wise independent distributions from those that are delta-far from k-wise independence in statistical distance, we upper bound the number of required samples by (O) over tilde (n(k)/delta(2)) and lower bound it by Omega(n(k-1/2)/delta) (n f- (these bounds hold for constant k, and essentially the same bounds hold for general k). To achieve these bounds, we use Fourier analysis to relate a distribution's distance from k-wise independence to its biases, a measure of the parity imbalance it induces on a set of variables. The relationships we derive are tighter than previously known, and may be of independent interest. To distinguish (epsilon, k)-wise independent distributions from those that are delta-far from (epsilon, k)-wise independence in statistical distance, we upper bound the number of required samples by O (klogn/delta(2)epsilon(2)) and lower bound it by Omega(root klogn/2(k)(epsilon+delta)root log1/2(k)(epsilon+delta). Although these bounds are an exponential improvement (in terms of n and k) over the corresponding bounds for testing k-wise independence, we give evidence that the time complexity of testing (epsilon, k)-wise independence is unlikely to be poly (n, 1/epsilon, 1/delta) for k = circle minus (log n), since this would disprove a plausible conjecture concerning the hardness of finding hidden cliques in random graphs. Under the conjecture, our result implies that for, say, k = log n and epsilon = 1/n(0.99), there is a set of (epsilon, k)-wise independent distributions, and a set of distributions at distance delta = 1/n(0.51) from (epsilon, k)-wise independence, which are indistinguishable by polynomial time algorithms.
引用
收藏
页码:496 / 505
页数:10
相关论文
共 29 条
[1]   SIMPLE CONSTRUCTIONS OF ALMOST K-WISE INDEPENDENT RANDOM-VARIABLES [J].
ALON, N ;
GOLDREICH, O ;
HASTAD, J ;
PERALTA, R .
RANDOM STRUCTURES & ALGORITHMS, 1992, 3 (03) :289-304
[2]   Problems and results in extremal combinatorics - I [J].
Alon, N .
DISCRETE MATHEMATICS, 2003, 273 (1-3) :31-53
[3]   Almost k-wise independence versus k-wise independence [J].
Alon, N ;
Goldreich, O ;
Mansour, Y .
INFORMATION PROCESSING LETTERS, 2003, 88 (03) :107-110
[4]   A FAST AND SIMPLE RANDOMIZED PARALLEL ALGORITHM FOR THE MAXIMAL INDEPENDENT SET PROBLEM [J].
ALON, N ;
BABAI, L ;
ITAI, A .
JOURNAL OF ALGORITHMS, 1986, 7 (04) :567-583
[5]  
Alon N, 1998, RANDOM STRUCT ALGOR, V13, P457, DOI 10.1002/(SICI)1098-2418(199810/12)13:3/4<457::AID-RSA14>3.0.CO
[6]  
2-W
[7]  
Alon N., 2004, The probabilistic method
[8]  
[Anonymous], 2000, EL C COMP COMPL
[9]   Testing random variables for independence and identity [J].
Batu, T ;
Fischer, E ;
Fortnow, L ;
Kumar, R ;
Rubinfeld, R ;
White, P .
42ND ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 2001, :442-451
[10]  
BATU T, 2004, P 36 ANN ACM S THEOR, P381