Word co-occurrences on Webpages as a measure of the relatedness of organizations: A new Webometrics concept

被引:52
作者
Vaughan, Liwen [1 ]
You, Justin [2 ]
机构
[1] Univ Western Ontario, Fac Informat & Media Studies, London, ON N6A 5B7, Canada
[2] ApacBridge Consulting, Ottawa, ON K2G 6C7, Canada
关键词
Web co-link analysis; Web co-word analysis; Webometrics; Competitive intelligence; INFORMATION-SCIENCE; SEARCH ENGINES; WEB;
D O I
10.1016/j.joi.2010.04.005
中图分类号
TP39 [计算机的应用];
学科分类号
080201 [机械制造及其自动化];
摘要
Web hyperlink analysis has been a key topic of Webometric research. However, inlink data collection from commercial search engines has been limited to only one source in recent years, which is not a promising prospect for the future development of the field. We need to tap into other Web data sources and to develop new methods. Toward this end, we propose a new Webometrics concept that is based on words rather than inlinks on Webpages. We propose that word co-occurrences on Webpages can be a measure of the relatedness of organizations. Word co-occurrence data can be collected from both general search engines and blog search engines, which expands data sources greatly. The proposed concept is tested in a group of companies in the LTE and WiMax sectors of the telecommunications industry. Data on the co-occurrences of company names on Webpages were collected from Google and Google Blog. The co-occurrence matrices were analyzed using MDS. The resulting MDS maps were compared with industry reality and with the MDS maps from co-link analysis. Results show that Web co-word analysis could potentially be as useful as Web co-link analysis. Google Blog seems to be a better source than Google for co-word data collection. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:483 / 491
页数:9
相关论文
共 28 条
[1]
Adar E., 2007, P 16 INT C WORLD WID, P161, DOI [DOI 10.1145/1242572.1242595, 10.1145/1242572.1242595]
[2]
Informetric analyses on the World Wide Web: Methodological approaches to 'webometrics' [J].
Almind, TC ;
Ingwersen, P .
JOURNAL OF DOCUMENTATION, 1997, 53 (04) :404-426
[3]
[Anonymous], 2005, Proceedings 11th International Conference Knowledge Discovery in Data Mining, DOI DOI 10.1145/1081870.1081883
[4]
[Anonymous], 2006, AAAI Symposium on Computational Approaches to Analysing Weblogs (AAAI-CAAW)
[5]
The use of Web search engines in information science research [J].
Bar-Ilan, J .
ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 2004, 38 :231-288
[6]
Bar-Ilan J., 2005, CYBERMETRICS, V9
[7]
Bar-Ilan J, 2007, INFORM RES, V12
[8]
*GOOGL, 2006, GOOGL SOAP SEARCH AP
[9]
*GOOGL, 2009, LINKS YOUR SIT
[10]
HAMBLEN M, 2008, WIMAX LONG TERM EVOL