基于历史模型的蒙古文自动词性标注研究

被引：1

作者：

赵建东

高光来

飞龙

机构：

[1] 内蒙古大学计算机学院

来源：

中文信息学报 | 2013年 / 05期

关键词：

历史模型; lookahead; 蒙古文; 自动词性标注;

D O I：

暂无

中图分类号：

TP391.1 [文字信息处理];

学科分类号：

摘要：

蒙古文自动词性标注方面的研究工作较少,制约了对蒙古文的机器翻译、语法分析及语义分析等领域的深入研究。针对于此,提出了加入lookahead学习机制的基于历史模型的蒙古文自动词性标注方法。实验表明,加入lookahead学习机制的基于历史模型的蒙古文自动词性标注方法对蒙古文的未登录词、集内词、总体词自动词性标注的准确率分别达到了71.276 6%、99.148 2%、95.301 0%,说明此方法可以较好地进行蒙古文的自动词性标注。

引用

页码：156 / 159+165 +165

页数：5

共 13 条

[1] Fast and accurate part of speech tagging:The SVM approach. Gimenez J,Marquez I. Proceedings of the 4th International Conference on Recent Advances in Natural Language Processing . 2003
[2] Maximum Entropy Modeling Toolkit for Python and C++[CP]. Zhang Le. http://homepages.inf.ed.ac.uk/lzhang10/maxent_toolkit.html . 2011
[3] Lookahead Part-Of-Speech Tagger[CP]. Yoshimasa Tsuruoka. http://www.logos.ic.i.u-tokyo.ac.jp/-tsuruoka/lapos/ . 2012
[4] Discriminative training methods for hidden markov models:theory and experiments with perceptron algorithms. Michael Collins. Proceedings of EMNLP . 2002
[5] Learning with Lookahead:Can History-Based Models Rival Globally Optimized Models?. TSURUOKA Y,MIYAO Y,KAZAMA J. Proceedings of the Fif-teenth Conference on Computational Natural Language Learning . 2011
[6] A maximum entropy model of part-of-speech tagging. A. Ratnaparkhi. Proc. EMNLP . 1996
[7] Transformation-based error-driven learning and natural language processing: a case study in part of speech tagging. Brill E. Computational Linguistics . 1995
[8] 改进的基于转换方法的拉丁蒙文词性标注
胡冠龙
张建
李淼
[J]. 计算机应用, 2007, (04) : 963 - 965
[9] 基于HMM的蒙古文自动词性标注研究
艳红
王斯日古楞
[J]. 内蒙古师范大学学报(自然科学汉文版), 2010, 39 (02) : 206 - 209
[10] 融合形态特征的最大熵蒙古文词性标注模型
张贯虹
斯劳格劳
乌达巴拉
[J]. 计算机研究与发展, 2011, 48 (12) : 2385 - 2390

← 1 2 →