Geodesic entropic graphs for dimension and entropy estimation in manifold learning

被引:178
作者
Costa, JA [1 ]
Hero, AO [1 ]
机构
[1] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
conformal embedding; intrinsic dimension; intrinsic entropy; manifold learning; minimal spanning tree; nonlinear dimensionality reduction;
D O I
10.1109/TSP.2004.831130
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the manifold learning problem, one seeks to discover a smooth low dimensional surface, i.e., a manifold embedded in a higher dimensional linear vector space, based on a set of measured sample points on the surface. In this paper, we consider the closely related problem of estimating the manifold's intrinsic dimension and the intrinsic entropy of the sample points. Specifically, we view the sample points as realizations of an unknown multivariate density supported on an unknown smooth manifold. We introduce a novel geometric approach based on entropic graph methods. Although the theory presented applies to this general class of graphs, we focus on the geodesic-minimal-spanning-tree (GMST) to obtaining asymptotically consistent estimates of the manifold dimension and the Renyi alpha-entropy of the sample density on the manifold. The GMST, approach is striking in its simplicity and does not require reconstruction of the manifold or estimation of the multivariate density of the samples. The GMST method simply constructs a minimal spanning tree (MST) sequence using a geodesic edge matrix and uses the overall lengths of the MSTs to simultaneously estimate manifold dimension and entropy. We illustrate the GMST approach on standard synthetic manifolds as well as on real data sets consisting of images of faces.
引用
收藏
页码:2210 / 2221
页数:12
相关论文
共 38 条
  • [1] Alexander KS, 1996, ANN APPL PROBAB, V6, P466
  • [2] [Anonymous], P CAMB PHILO SOC, DOI DOI 10.1017/S0305004100034095
  • [3] AVRAM F, 1990, ANN APPL PROBAB, V9, P223
  • [4] BELKIN M, 2002, ADV NEURAL INFORMATI, V14
  • [5] BERNSTEIN M., 2000, Technical Report
  • [6] BERTSIMAS D, 1992, OPER RES LET, V2, P113
  • [7] Boothby W.M., 2003, An Introduction to Differentiable Manifolds and Riemannian Geometry
  • [8] Estimating the intrinsic dimension of data with a fractal-based method
    Camastra, F
    Vinciarelli, A
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (10) : 1404 - 1407
  • [9] Carmo M.P., 1992, RIEMANNIAN GEOMETRY
  • [10] Carmo M. P. D., 1976, DIFFERENTIAL GEOMETR