LSTM-CRF for Drug-Named Entity Recognition

被引:80
作者
Zeng, Donghuo [1 ]
Sun, Chengjie [1 ]
Lin, Lei [1 ]
Liu, Bingquan [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, 92 West Dazhi St, Harbin 150001, Peoples R China
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
drug name entity recognition; information extraction; long short-term memory; conditional random field;
D O I
10.3390/e19060283
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Drug-Named Entity Recognition (DNER) for biomedical literature is a fundamental facilitator of Information Extraction. For this reason, the DDIExtraction2011 (DDI2011) and DDIExtraction2013 (DDI2013) challenge introduced one task aiming at recognition of drug names. State-of-the-art DNER approaches heavily rely on hand-engineered features and domain-specific knowledge which are difficult to collect and define. Therefore, we offer an automatic exploring words and characters level features approach: a recurrent neural network using bidirectional long short-term memory (LSTM) with Conditional Random Fields decoding (LSTM-CRF). Two kinds of word representations are used in this work: word embedding, which is trained from a large amount of text, and character-based representation, which can capture orthographic feature of words. Experimental results on the DDI2011 and DDI2013 dataset show the effect of the proposed LSTM-CRF method. Our method outperforms the best system in the DDI2013 challenge.
引用
收藏
页数:12
相关论文
共 40 条
[1]  
[Anonymous], 2010, P PYTH SCI C
[2]  
[Anonymous], rmsprop: divide the gradient by a running average of its recent magnitude
[3]  
[Anonymous], 2013, ASS COMPUT LINGUIST
[4]  
[Anonymous], 2013, P 7 INT WORKSH SEM E
[5]  
[Anonymous], 2016, P NAACL HLT
[6]  
[Anonymous], 2005, P 43 ANN M ASS COMP
[7]  
[Anonymous], 2015, P 6 INT WORKSHOP HLT, DOI DOI 10.18653/V1/W15-2608
[8]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[9]  
Chalapathy R., ARXIV160907585
[10]  
Collobert R, 2011, J MACH LEARN RES, V12, P2493