Using a Chinese treebank to measure dependency distance

被引:61
作者
Liu, Haitao [1 ]
Hudson, Richard [1 ]
Feng, Zhiwei [1 ]
机构
[1] CUC, Beijing, Peoples R China
关键词
Dependency syntax; Chinese treebank; dependency distance; COMPLEXITY;
D O I
10.1515/CLLT.2009.007
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This article describes a method for calculating the 'dependency distance' between the words in a text - i.e. the number of words that separate each word from the word on which it depends syntactically - and reports the results of applying this method to a Chinese treebank. This study shows that Chinese dependencies tend strongly to be governor-final and that the mean dependency distance of words is much higher for Chinese than for other languages that have been studied including English, German and Japanese. It is unclear whether this difference means that Chinese is syntactically more difficult to process.
引用
收藏
页码:161 / 174
页数:14
相关论文
共 24 条
  • [1] Abeille A., 2003, Treebank: Building and using Parsed Corpora
  • [2] [Anonymous], UCL WORK PAERS LINGU
  • [3] Buch-Kromann M., 2006, Discontinuous grammar: A dependency-based model of human parsing and language learning
  • [4] Cancho RFI, 2004, PHYS REV E, V70, DOI 10.1103/PhysRevE.70.056135
  • [5] Collier AK, 1996, MAGN RESON CHEM, V34, P191
  • [6] EPPLER E, 2004, SYNTAX GERMAN UNPUB
  • [7] Linguistic complexity: locality of syntactic dependencies
    Gibson, E
    [J]. COGNITION, 1998, 68 (01) : 1 - 76
  • [8] Constraints on sentence comprehension
    Gibson, E
    Pearlmutter, NJ
    [J]. TRENDS IN COGNITIVE SCIENCES, 1998, 2 (07) : 262 - 268
  • [9] Consequences of the serial nature of linguistic input for sentenial complexity
    Grodner, D
    Gibson, E
    [J]. COGNITIVE SCIENCE, 2005, 29 (02) : 261 - 290
  • [10] Heringer H. J., 1980, Syntax. Fragen-Losungen-Alternativen