Asymptotics for trimmed k-means and associated tolerance zones

被引:4
作者
García-Escudero, LA [1 ]
Gordaliza, A [1 ]
Matrán, C [1 ]
机构
[1] Univ Valladolid, Fac Ciencias, Dept Estadist & Investigac, Valladolid 47002, Spain
关键词
asymptotics; clustering methods; distribution freeness; robustness; tolerance zones; trimmed k-means;
D O I
10.1016/S0378-3758(98)00196-7
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Impartial trimming procedures with respect to general 'penalty' functions, di, have been recently introduced in Cuesta-Albertos et al. (1997. Arm. Statist. 25, 553-576) in the (generalized) k-means framework. Under regularity assumptions, for real-valued samples, we obtain the asymptotic normality both of the impartial trimmed k-mean estimator (Phi(x) = x(2)) and of the impartial trimmed k-median estimator (Phi(x) = x). In spite of the additional complexity coming from the several groups setting, the empirical quantile methodology used in Butler (1982. Arm. Statist. 10, 197-204) for the LTS estimator (and subsequently in Tableman (1994. Statist. Probab. Lett. 19, 387-398) for the LTAD estimator) also works in our framework. Besides their relevance for the robust estimation of quantizers, our results open the possibility of considering asymptotic distribution-free tolerance regions, constituted by unions of intervals, for predicting a future observation, generalizing the idea in Butler (1982). (C) 1999 Elsevier Science B.V. All rights reserved. AMS classifications: Primary 62G20; 62G15; secondary 62G35.
引用
收藏
页码:247 / 262
页数:16
相关论文
共 26 条