Empirical geometry of multivariate data: A deconvolution approach

被引:24
作者
Koltchinskii, VI [1 ]
机构
[1] Univ New Mexico, Dept Math & Stat, Albuquerque, NM 87131 USA
关键词
support of probability distribution; metric entropy; entropy dimension; clusters; deconvolving estimators;
D O I
10.1214/aos/1016218232
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Let {Y (j): j = 1,..., n} be independent observations in R-m, m greater than or equal to 1 with common distribution Q. Suppose that Y (j) = X (j) + xi (j), j = 1,...,n, where {X (j), xi (j), j = 1,...,n} are independent, X (j), j = 1,..., n have common distribution P and xi (j), j = 1,...,n have common distribution mu, so that Q = P * mu. The problem is to recover hidden geometric structure of the support of P based an the independent observations Y (j). Assuming that the distribution of the errors mu is known, deconvolution statistical estimates of the fractal dimension and the hierarchical cluster tree of the support that converge with exponential rates are suggested. Moreover, the exponential rates of convergence hold for adaptive versions of the estimates even in the case of normal noise xi (j) with unknown covariance. In the case of the dimension estimation, though, the exponential rate holds only when the set of all possible values of the dimension is finite (e.g., when the dimension is known to be integer). If this set is infinite, the optimal convergence rate of the estimator becomes very slow (typically, logarithmic), even when there is no noise in the observations.
引用
收藏
页码:591 / 629
页数:39
相关论文
共 30 条
[1]  
[Anonymous], CLUSTERING ALGORITHM
[2]  
[Anonymous], 1996, Clustering and Classification Ed. by, DOI DOI 10.1142/1930
[3]  
[Anonymous], 1996, Clustering and classification
[4]  
[Anonymous], 1979, GRAPH THEORY INTRO C, DOI DOI 10.1007/978-1-4612-9967-7
[5]  
[Anonymous], 1978, CLASSIFICATION AUTOM
[6]  
Bhattacharya RN., 1976, NORMAL APPROXIMATION
[7]   OPTIMAL RATES OF CONVERGENCE FOR DECONVOLVING A DENSITY [J].
CARROLL, RJ ;
HALL, P .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1988, 83 (404) :1184-1186
[8]  
CHENCOV NN, 1972, STAT DECISION RULES
[9]  
Cuevas A, 1997, ANN STAT, V25, P2300
[10]   The estimation of the order of a mixture model [J].
DacunhaCastelle, D ;
Gassiat, E .
BERNOULLI, 1997, 3 (03) :279-299