条件随机域与上下文线索结合的生物实体识别

被引:3
作者
杨志豪
林鸿飞
李彦鹏
机构
[1] 大连理工大学计算机科学与工程系
关键词
文本挖掘; 生物实体识别; 条件随机域; 上下文线索;
D O I
暂无
中图分类号
TP391.4 [模式识别与装置];
学科分类号
0811 ; 081101 ; 081104 ; 1405 ;
摘要
介绍一个用于在生物医学文献中识别基因、蛋白质等生物实体的识别方法。该方法基于条件随机域方法,选取适当特征进行实体识别,利用上下文线索进一步提高识别性能。实验结果表明上下文线索的引入使识别性能在条件随机域方法基础上提高了近3%,从而获得了较好的最终识别效果。
引用
收藏
页码:203 / 204+208 +208
页数:3
相关论文
共 5 条
  • [1] Developing a Robust Part-of-Speech Tagger for Biomedical Text. Tsuruoka Y,Tateishi Y,Kim J D,et al. Proc.of the10th Panhellenic Conference on Informatics . 2005
  • [2] Conditional Random Fields:Probabilistic Models for Segmenting and Labeling Sequence Data. Lafferty J,McCallum A,Pereira F. Proc.of the International Conference on Machine Learning . 2001
  • [3] Exploring Deep Knowledge Resources in Biomedical Name Recognition. Zhou Guodong,Su Jian. Proc.of the Joint Workshop on Natural Language Processing in Biomedicine and Its Applications . 2004
  • [4] Two-phase Biomedical NE Recognition Based on SVMs. Lee K J,,Hwang Y S,Rim H C. Proc.of the Workshop on Natural Language Processing in Biomedicine . 2003
  • [5] Biomedical Named Entity Recognition Using Conditional Random Fields and Novel Feature Sets. Settles B. Proc.of the Joint Workshop on Natural Language Processing in Biomedicine and Its Applications . 2004