Analysis of Web page image tag distribution characteristics

被引:4
作者
Ajiferuke, I [1 ]
Wolfram, D
机构
[1] Univ Western Ontario, Middlesex Coll, Fac Informat & Media Studies, London, ON N6A 5B7, Canada
[2] Univ Wisconsin, Sch Informat Studies, Milwaukee, WI 53201 USA
关键词
informetric modeling; cybermetrics; image tag distributions;
D O I
10.1016/j.ipm.2004.01.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The authors investigate the frequency distribution of the use of image tags in Web pages. Using data sampled from top level Web pages across five top level domains and from sample pages within individual websites, the authors model observed patterns in the frequency of image tag usage by fitting collected data distributions to different theoretical models used in informetrics. Models tested include the modified power law (MPL), Mandelbrot (MDB), generalized waring (GW), generalized inverse Gaussian-Poisson (GIGP), and generalized negative binomial (GNB) distributions. The GIGP provided the best fit for data sets for top level pages across the top level domains tested. The poor fits of the models to the observed data distributions from specific websites were due to the multimodal nature of the observed data sets. Mixtures of the tested models for the data sets provided better fits. The ability to effectively model Web page attributes, such as the distribution of the number of image tags used per page, is needed for accurate simulation models of Web page content, and makes it possible to estimate the number of requests needed to display the complete content of Web pages. (c) 2004 Elsevier Ltd. All rights reserved.
引用
收藏
页码:987 / 1002
页数:16
相关论文
共 36 条
[1]  
Adamic IA, 2001, COMMUN ACM, V44, P55, DOI 10.1145/383694.383707
[2]  
AJIFERUKE I, 1991, J AM SOC INFORM SCI, V42, P279, DOI 10.1002/(SICI)1097-4571(199105)42:4<279::AID-ASI4>3.0.CO
[3]  
2-O
[4]   Topology of evolving networks:: Local events and universality [J].
Albert, R ;
Barabási, AL .
PHYSICAL REVIEW LETTERS, 2000, 85 (24) :5234-5237
[5]   Internet -: Diameter of the World-Wide Web [J].
Albert, R ;
Jeong, H ;
Barabási, AL .
NATURE, 1999, 401 (6749) :130-131
[6]   Informetric analyses on the World Wide Web: Methodological approaches to 'webometrics' [J].
Almind, TC ;
Ingwersen, P .
JOURNAL OF DOCUMENTATION, 1997, 53 (04) :404-426
[7]  
Barford P., 1998, Performance Evaluation Review, V26, P151, DOI 10.1145/277858.277897
[8]  
BRODER A, 2000, P 9 INT WORLD WID WE, V30, P209
[9]  
BURRELL QL, 1993, J AM SOC INFORM SCI, V44, P61, DOI 10.1002/(SICI)1097-4571(199303)44:2<61::AID-ASI1>3.0.CO
[10]  
2-J