Principal curves of oriented points: Theoretical and computational improvements

被引:24
作者
Delicado, P [1 ]
Huerta, M
机构
[1] Univ Politecn Cataluna, Dept Estadist & Invest Operat, E-08028 Barcelona, Spain
[2] Univ Politecn Cataluna, Dept Llenguatjes & Sistemes Informat, E-08028 Barcelona, Spain
关键词
bandwidth choice; clustering; Minimum Spanning Tree; object oriented programming; population pyramids;
D O I
10.1007/s001800300145
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Principal curves where introduced by Hastie & Stuetzle (1989) as smooth parametric curves passing through the middle of a multidimensional data set. Delicado (2001) defines Principal Curves of Oriented Points, based on the fixed points of a function from IRP into itself This definition is nonparametric and smoothing methods are used to find principal curves of a data set. Here we extend this work in two directions. First, we propose a bandwidth choice method based on the Minimum Spanning Tree of the data set. Second, we present an object oriented application that implements the principal curves computation for any dimension in a flexible recursive way. Examples on synthetic and real data are included.
引用
收藏
页码:293 / 315
页数:23
相关论文
共 33 条