Type/Token-Taken informetrics

被引:9
作者
Egghe, L
机构
[1] Limburgs Univ Ctr, B-3590 Diepenbeek, Belgium
[2] Univ Instelling Antwerp, B-2610 Wilrijk, Belgium
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2003年 / 54卷 / 07期
关键词
INFORMATION; SYSTEMS;
D O I
10.1002/asi.10247
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Type/Token-Taken informetrics is a new part of informetrics that studies the use of items rather than the items itself. Here, items are the objects that are produced by the sources (e.g., journals producing articles, authors producing papers, etc.). In linguistics a source is also called a type (e.g., a word), and an item a token (e.g., the use of words in texts). In informetrics, types that occur often, for example, in a database will also be requested often, for example, in information retrieval. The relative use of these occurrences will be higher than their relative occurrences itself; hence, the name Type/Token-Taken informetrics. This article studies the frequency distribution of Type/Token-Taken informetrics, starting from the one of Type/Token informetrics (i.e., source-item relationships). We are also studying the average number mu(*) of item uses in Type/Token-Taken informetrics and compare this with the classical average number mu in Type/Token informetrics. We show that mu(*) greater than or equal to mu always, and that mu(*) is an increasing function of mu. A method is presented to actually calculate mu(*) from mu, and a given a, which is the exponent in Lotka's frequency distribution of Type/Token informetrics. We leave open the problem of developing non-Lotkaian Type/TokenTaken informetrics.
引用
收藏
页码:603 / 610
页数:8
相关论文
共 19 条
[1]  
Bradford SC., 1934, ENGINEERING, V137, P85
[2]   THE ANALYSIS OF LIBRARY DATA [J].
BURRELL, QL ;
CANE, VR .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1982, 145 :439-471
[3]   The distribution of N-grams [J].
Egghe, L .
SCIENTOMETRICS, 2000, 47 (02) :237-252
[4]   THEORY OF SEARCH KEYS AND APPLICATIONS IN RETRIEVAL TECHNIQUES USED BY CATALOGERS [J].
EGGHE, L .
MATHEMATICAL AND COMPUTER MODELLING, 1992, 16 (04) :69-90
[5]   THE DUALITY OF INFORMETRIC SYSTEMS WITH APPLICATIONS TO THE EMPIRICAL LAWS [J].
EGGHE, L .
JOURNAL OF INFORMATION SCIENCE, 1990, 16 (01) :17-27
[6]  
Egghe L., 1990, INTRO INFORMETRICS Q, V14, P251
[7]  
EGGHE L, 2003, SOURCE ITEM COVERAGE
[8]  
EGGHE L, 1989, THESIS CITY U LONDON
[9]  
Herdan G., 1960, Type-token Mathematics: A Textbook of Mathematical Linguistics
[10]  
Herdan G, 1964, Quantitative Linguistics