Can analysis of word frequency distinguish between writings of different authors?

被引:10
作者
Vilensky, B
机构
[1] Department of Physics, Bar-Ilan University
关键词
D O I
10.1016/0378-4371(96)00109-4
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Various literature writings are compared by the ''rank distance'', d, between two word frequency Zipf plots introduced by S. Havlin (Physica A 216 (1995) 148). We studied 22 books written by six authors. For this ensemble of books we find that the mean distance between books written by the same authors ([d] = 15.2 +/- 2.6) is considerably smaller than that between books written by different authors ([d] = 21.8 +/- 3.2), in good agreement with earlier results on a smaller sample of books. Our results suggest that the distribution of the rank difference of the same words in different books decays exponentially.
引用
收藏
页码:705 / 711
页数:7
相关论文
共 16 条
[11]   MOSAIC ORGANIZATION OF DNA NUCLEOTIDES [J].
PENG, CK ;
BULDYREV, SV ;
HAVLIN, S ;
SIMONS, M ;
STANLEY, HE ;
GOLDBERGER, AL .
PHYSICAL REVIEW E, 1994, 49 (02) :1685-1689
[12]   LONG-RANGE CORRELATIONS IN NUCLEOTIDE-SEQUENCES [J].
PENG, CK ;
BULDYREV, SV ;
GOLDBERGER, AL ;
HAVLIN, S ;
SCIORTINO, F ;
SIMONS, M ;
STANLEY, HE .
NATURE, 1992, 356 (6365) :168-170
[13]   LONG RANGE CORRELATION IN HUMAN WRITINGS [J].
Schenkel, Alain ;
Zhang, Jun ;
Zhang, Yi-Cheng .
FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 1993, 1 (01) :47-57
[14]  
Stanley HE, 1995, LECT NOTES PHYS, V450, P331, DOI 10.1007/3-540-59222-9_44
[15]   ZIPF PLOTS AND THE SIZE DISTRIBUTION OF FIRMS [J].
STANLEY, MHR ;
BULDYREV, SV ;
HAVLIN, S ;
MANTEGNA, RN ;
SALINGER, MA ;
STANLEY, HE .
ECONOMICS LETTERS, 1995, 49 (04) :453-457
[16]  
ZIPF GK, 1949, HUMAN BEHAVIOR PRINC