Testing random variables for independence and identity

被引:112
作者
Batu, T [1 ]
Fischer, E [1 ]
Fortnow, L [1 ]
Kumar, R [1 ]
Rubinfeld, R [1 ]
White, P [1 ]
机构
[1] Cornell Univ, Dept Comp Sci, Ithaca, NY 14853 USA
来源
42ND ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS | 2001年
关键词
D O I
10.1109/SFCS.2001.959920
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Given access to independent samples of a distribution A over [n] x [m], we show how to test whether the distributions formed by projecting A to each coordinate are in dependent, i.e., whether A is epsilon -close in the L-1 norm to the product distribution A(1) x A(2) for some distributions A(1) over [n] and A(2) over [m]. The sample complexity of our test is (O) over tilde (n(2/3)m(1/3)poly(epsilon (-1))), assuming without loss of generality that m less than or equal to n. We also give a matching lower bound, up to poly(log n, epsilon (-1)) factors. Furthermore, given access to samples of a distribution X over [n], we show how to test if X is epsilon -close in L-1 norm to an explicitly specified distribution Y. Our test uses (O) over tilde (n(1/2)poly(epsilon (-1))) samples, which nearly matches the known tight bounds for the case when Y is uniform.
引用
收藏
页码:442 / 451
页数:10
相关论文
共 8 条
[1]   Testing that distributions are close [J].
Batu, T ;
Fortnow, L ;
Rubinfeld, R ;
Smith, WD ;
White, P .
41ST ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 2000, :259-269
[2]  
FELLER W, 1968, INTRO PROBABILITY AP, V1
[3]   Property testing and its connection to learning and approximation [J].
Goldreich, O ;
Goldwasser, S ;
Ron, D .
JOURNAL OF THE ACM, 1998, 45 (04) :653-750
[4]  
Goldreich O., 2000, LECT NOTES COMPUTER, V6650
[5]  
Lehmann E.L., 1986, Testing Statistical Hypotheses
[6]  
Ron D., 2001, HDB RANDOMIZED COMPU, V9
[7]   Robust characterizations of polynomials with applications to program testing [J].
Rubinfeld, R ;
Sudan, M .
SIAM JOURNAL ON COMPUTING, 1996, 25 (02) :252-271
[8]  
SAHAI A, 1999, DIMACS SERIES DISCRE, V43, P251