The relationship between Zipf's law and the distribution of first digits

被引:12
作者
Irmay, S [1 ]
机构
[1] TECHNION ISRAEL INST TECHNOL,HAIFA,ISRAEL
关键词
D O I
10.1080/02664769723594
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Zipf's experimental law states that, for a given large piece of text, the product of the relative frequency of a word and its order in descending frequency order is a constant, shown to be equal to 1 divided by the natural logarithm of the number of different words. It is shown to be approximately equal to Benford's logarithmic distribution of first significant digits in tables of numbers. Eleven samples allow comparison of observed and theoretical frequencies.
引用
收藏
页码:383 / 393
页数:11
相关论文
共 34 条
  • [1] *AC HEBR LANG, 1973, BOOK BEN SIR
  • [2] [Anonymous], FRENCH WORD BOOK BAS
  • [3] [Anonymous], 1959, WORD COUNT MODERN AR
  • [4] Balgur R, 1968, LIST BASIC WORDS SCH
  • [5] Benford F., 1938, P AM PHILOS SOC, P551, DOI DOI 10.2307/984802
  • [6] BRILL Moshe, 1940, BASIC WORD LIST ARAB
  • [7] Carroll J. B., 1971, Word frequency book
  • [8] CHEYDLEUR, 1928, FRENCH IDIOM LIST
  • [9] Estoup J.-B., 1916, GAMMES STENOGRAPHIQU
  • [10] ON PROBABILITY THAT A RANDOM INTEGER HAS INITIAL DIGIT 4
    FLEHINGER, BJ
    [J]. AMERICAN MATHEMATICAL MONTHLY, 1966, 73 (10) : 1056 - +