Indo-European languages tree by Levenshtein distance

被引:79
作者
Serva, M. [1 ]
Petroni, F. [2 ]
机构
[1] Univ Aquila, Dipartimento Matemat, I-67010 Coppito, Italy
[2] GRAPES, B-4000 Liege, Belgium
关键词
D O I
10.1209/0295-5075/81/68005
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The evolution of languages closely resembles the evolution of haploid organisms. This similarity has been recently exploited ( Gray R. D. and Atkinson Q. D., Nature, 426 ( 2003) 435; Gray R. D. and Jordan F. M., Nature, 405 ( 2000) 1052) to construct language trees. The key point is the definition of a distance among all pairs of languages which is the analogous of a genetic distance. Many methods have been proposed to de. ne these distances; one of these, used by glottochronology, computes the distance from the percentage of shared "cognates". Cognates are words inferred to have a common historical origin, and subjective judgment plays a relevant role in the identfication process. Here we push closer the analogy with evolutionary biology and we introduce a genetic distance among language pairs by considering a renormalized Levenshtein distance among words with same meaning and averaging on all words contained in a Swadesh list ( Swadesh M., Proc. Am. Philos. Soc., 96 ( 1952) 452). The subjectivity of process is consistently reduced and the reproducibility is highly facilitated. We test our method against the Indo-European group considering fifty different languages and the two hundred words of the Swadesh list for any of them. We find out a tree which closely resembles the one published in Gray and Atkinson ( 2003), with some significant differences. Copyright (c) EPLA, 2008.
引用
收藏
页数:5
相关论文
共 6 条
  • [1] Language-tree divergence times support the Anatolian theory of Indo-European origin
    Gray, RD
    Atkinson, QD
    [J]. NATURE, 2003, 426 (6965) : 435 - 439
  • [2] Kingman JFC., 1982, Journal of Applied Probability, V19, P27, DOI [DOI 10.2307/3213548, 10.2307/3213548]
  • [3] On the genealogy of populations: trees, branches and offspring
    Serva, M
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2005, : 176 - 194
  • [4] Evolution of the most recent common ancestor of a population with no selection
    Simon, Damien
    Derrida, Bernard
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2006,
  • [5] Sneath P. H. A., NUMERICAL TAXONOMY
  • [6] SWADESH MORRIS., 1952, P AM PHILOS SOC, V96, P452, DOI DOI 10.2307/3143802