Asymptotics for trimmed k-means and associated tolerance zones

被引：4

作者：

García-Escudero, LA ^{[1
]}

Gordaliza, A ^{[1
]}

Matrán, C ^{[1
]}

机构：

[1] Univ Valladolid, Fac Ciencias, Dept Estadist & Investigac, Valladolid 47002, Spain

来源：

JOURNAL OF STATISTICAL PLANNING AND INFERENCE | 1999年 / 77卷 / 02期

关键词：

asymptotics; clustering methods; distribution freeness; robustness; tolerance zones; trimmed k-means;

D O I：

10.1016/S0378-3758(98)00196-7

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Impartial trimming procedures with respect to general 'penalty' functions, di, have been recently introduced in Cuesta-Albertos et al. (1997. Arm. Statist. 25, 553-576) in the (generalized) k-means framework. Under regularity assumptions, for real-valued samples, we obtain the asymptotic normality both of the impartial trimmed k-mean estimator (Phi(x) = x(2)) and of the impartial trimmed k-median estimator (Phi(x) = x). In spite of the additional complexity coming from the several groups setting, the empirical quantile methodology used in Butler (1982. Arm. Statist. 10, 197-204) for the LTS estimator (and subsequently in Tableman (1994. Statist. Probab. Lett. 19, 387-398) for the LTAD estimator) also works in our framework. Besides their relevance for the robust estimation of quantizers, our results open the possibility of considering asymptotic distribution-free tolerance regions, constituted by unions of intervals, for predicting a future observation, generalizing the idea in Butler (1982). (C) 1999 Elsevier Science B.V. All rights reserved. AMS classifications: Primary 62G20; 62G15; secondary 62G35.

引用

页码：247 / 262

页数：16

共 26 条

[1] BICKEL PJ, 1967, 5 P BERK S MATH STAT, V1, P575
[2] GENERALIZED MEANS AND ASSOCIATED FAMILIES OF DISTRIBUTIONS
BRONS, HK
BRUNK, HD
FRANCK, WE
HANSON, DL
[J]. ANNALS OF MATHEMATICAL STATISTICS, 1969, 40 (02): : 339 - &
[3] NONPARAMETRIC INTERVAL AND POINT PREDICTION USING DATA TRIMMED BY A GRUBBS-TYPE OUTLIER RULE
BUTLER, RW
[J]. ANNALS OF STATISTICS, 1982, 10 (01) : 197 - 204
[4] NOTE ON GROUPING
COX, DR
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1957, 52 (280) : 543 - 547
[5] THE STRONG LAW OF LARGE NUMBERS FOR K-MEANS AND BEST POSSIBLE NETS OF BANACH VALUED RANDOM-VARIABLES
CUESTA, JA
MATRAN, C
[J]. PROBABILITY THEORY AND RELATED FIELDS, 1988, 78 (04) : 523 - 534
[6] Cuesta-Albertos JA, 1997, ANN STAT, V25, P553
[7] ON GROUPING FOR MAXIMUM HOMOGENEITY
FISHER, WD
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1958, 53 (284) : 789 - 798
[8] BEST APPROXIMATIONS TO RANDOM-VARIABLES BASED ON TRIMMING PROCEDURES
GORDALIZA, A
[J]. JOURNAL OF APPROXIMATION THEORY, 1991, 64 (02) : 162 - 180
[9] ON THE BREAKDOWN POINT OF MULTIVARIATE LOCATION ESTIMATORS BASED ON TRIMMING PROCEDURES
GORDALIZA, A
[J]. STATISTICS & PROBABILITY LETTERS, 1991, 11 (05) : 387 - 394
[10] Hartigan J. A., 1975, CLUSTERING ALGORITHM

← 1 2 3 →