CHINESE SYNTACTIC AND TYPOLOGICAL PROPERTIES BASED ON DEPENDENCY SYNTACTIC TREEBANKS

被引:7
作者
Liu, Haitao [1 ]
Zhao, Yiyi [1 ]
Li, Wenwen [1 ]
机构
[1] Commun Univ China, Inst Appl Linguist, CN-100024 Beijing, Peoples R China
关键词
Chinese; dependency distance; dependency direction; dependency treebank; linguistic typology;
D O I
10.2478/v10010-009-0025-3
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper offers a quantitative analysis of the syntactic and typological properties of Chinese based on five Chinese dependency treebanks. The study shows that mean dependency distance of Chinese is 2.84; 40-50% dependencies are between non-adjacent words; Chinese is a mixed language with a governor-final and SV-VO-AdjN preference; the mean dependency distance of governor-initial dependencies is greater than that of governor-final ones. Methodologically, the paper adopts five treebanks with different text genres and annotation schemes as a resource to study syntactic features of a language. This method avoids corpus influences on results so that the conclusions can be more reliable and robust. If suitable treebanks are available, it will be an easy task to apply our method to other languages. In this way, the method has a broad theoretical and cross-linguistic perspective.
引用
收藏
页码:509 / 523
页数:15
相关论文
共 29 条
[1]  
ABEILLE A, 2003, TREEBANK BUILDING US
[2]  
[Anonymous], 2006, QUANTITATIVE LINGUIS
[3]  
[Anonymous], 1995, MEASURING SYNT UNPUB
[4]  
Bod R., 2003, PROBABILISTIC LINGUI
[5]  
BUCHKROMANN M, 2006, THESIS COPENHAGEN BU
[6]  
Collier AK, 1996, MAGN RESON CHEM, V34, P191
[7]  
Cowan N, 2005, WORKING MEMORY CAPAC
[8]  
DESMEDT K, 2007, P 6 INT WORKSH TREEB
[9]  
Gries S. T., 2016, QUANTITATIVE CORPUS
[10]  
Haspelmath Martin., 2005, The world atlas of language structures