Nonparametric estimation of long-tailed density functions and its application to the analysis of World Wide Web traffic

被引:16
作者
Markovitch, NM
Krieger, UR
机构
[1] Russian Acad Sci, Inst Control Sci, Moscow 117806, Russia
[2] Tnova Deutsch Telekom, Technol Zentrum, D-64295 Darmstadt, Germany
[3] Goethe Univ Frankfurt, Dept Comp Sci, D-60054 Frankfurt, Germany
关键词
World Wide Web; nonparametric density estimation; Parzen-Rosenblatt estimate; polygram; long-tailed distribution; consistency;
D O I
10.1016/S0166-5316(00)00031-6
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The study of WWW-traffic measurements has shown that different traffic characteristics can be modeled by long-tail distributed random variables (r.v.s). In this paper we discuss the nonparametric estimation of the probability density function of long-tailed distributions. Two nonparametric estimates, a Parzen-Rosenblatt kernel estimate and a histogram with variable bin width called polygram, are considered. The consistency of these estimates for heavy-tailed densities is discussed. To provide the consistency of the estimates in the metric space L-1, the transformation of the initial r.v. to a new r.v. distributed on the interval [0, 1] is proposed. Then the proposed estimates are applied to analyze real data of WWW-sessions. The latter are characterized by the sizes of the responses and inter-response intervals as well as the sizes and durations of sub-sessions. By these means the effectiveness of the nonparametric procedures in comparison to parametric models of the WWW-traffic characteristics is demonstrated. (C) 2000 Published by Elsevier Science B.V.
引用
收藏
页码:205 / 222
页数:18
相关论文
共 20 条
  • [1] ABOUJAOUDE S, 1976, ANN I H POINCARE B, V12, P299
  • [2] [Anonymous], 1982, ESTIMATION DEPENDENC
  • [3] BARFORD P, 1999, PERFORM EVAL REV AUG
  • [4] DISTRIBUTION ESTIMATION CONSISTENT IN TOTAL VARIATION AND IN 2 TYPES OF INFORMATION DIVERGENCE
    BARRON, AR
    GYORFI, L
    VANDERMEULEN, EC
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1992, 38 (05) : 1437 - 1454
  • [5] Bol'shev L. N., 1968, TABLES MATH STAT
  • [6] CONSISTENT CROSS-VALIDATED DENSITY-ESTIMATION
    CHOW, YS
    GEMAN, S
    WU, LD
    [J]. ANNALS OF STATISTICS, 1983, 11 (01) : 25 - 38
  • [7] Crovella ME, 1998, PRACTICAL GUIDE TO HEAVY TAILS, P3
  • [8] Estimating the tail index
    Csörgö, S
    Viharos, L
    [J]. ASYMPTOTIC METHODS IN PROBABILITY AND STATISTICS: A VOLUME IN HONOUR OF MIKLOS CSORGO, 1998, : 833 - 881
  • [9] DEVROYE L, 1985, NONPARAMETRIC DENIST
  • [10] BEST OBTAINABLE ASYMPTOTIC RATES OF CONVERGENCE IN ESTIMATION OF A DENSITY-FUNCTION AT A POINT
    FARRELL, RH
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1972, 43 (01): : 170 - &