Beyond the Zipf-Mandelbrot law in quantitative linguistics

被引:144
作者
Montemurro, MA [1 ]
机构
[1] Natl Univ Cordoba, Fac Matemat Astron & Fis, RA-5000 Cordoba, Argentina
来源
PHYSICA A | 2001年 / 300卷 / 3-4期
关键词
Zipf-Mandelbrot law; human language;
D O I
10.1016/S0378-4371(01)00355-7
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
In this paper the Zipf-Mandelbrot law is revisited in the context of linguistics. Despite its widespread popularity the Zipf-Mandelbrot law can only describe the statistical behaviour of a rather restricted fraction of the total number of words contained in some given corpus. In particular, we focus our attention on the important deviations that become statistically relevant as larger corpora are considered and that ultimately could be understood as salient features of the underlying complex process of language generation. Finally, it is shown that all the different observed regimes can be accurately encompassed within a single mathematical framework recently introduced by C. Tsallis. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:567 / 578
页数:12
相关论文
共 14 条
[1]  
[Anonymous], 1949, Human behaviour and the principle of least-effort
[2]   Numerical analysis of word frequencies in artificial and natural language texts [J].
Cohen, A ;
Mantegna, RN ;
Havlin, S .
FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 1997, 5 (01) :95-104
[3]   Fractal binary sequences: Tsallis thermodynamics and the Zipf law [J].
Denisov, S .
PHYSICS LETTERS A, 1997, 235 (05) :447-451
[4]  
FERRER R, UNPUB J QUANTITATIVE
[5]  
FERRER R, 0103016 SANT F
[6]  
LI W, 1992, IEEE T INFORM THEORY, V38, P6
[7]  
MANDELBROT B, 1966, READINGS MATH SOC SC
[8]  
MANDELBROT B., 1983, FRACTAL STRUCTURE NA
[9]   Explaining the uneven distribution of numbers in nature: the laws of Benford and Zipf [J].
Pietronero, L ;
Tosatti, E ;
Tosatti, V ;
Vespignani, A .
PHYSICA A, 2001, 293 (1-2) :297-304
[10]  
SIMON HA, 1955, BIOMETRIKA, V42, P435