LSTM-CRF for Drug-Named Entity Recognition

被引：80

作者：

Zeng, Donghuo ^{[1
]}

Sun, Chengjie ^{[1
]}

Lin, Lei ^{[1
]}

Liu, Bingquan ^{[1
]}

机构：

[1] Harbin Inst Technol, Sch Comp Sci & Technol, 92 West Dazhi St, Harbin 150001, Peoples R China

来源：

ENTROPY | 2017年 / 19卷 / 06期

基金：

国家高技术研究发展计划(863计划); 中国国家自然科学基金;

关键词：

drug name entity recognition; information extraction; long short-term memory; conditional random field;

D O I：

10.3390/e19060283

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

Drug-Named Entity Recognition (DNER) for biomedical literature is a fundamental facilitator of Information Extraction. For this reason, the DDIExtraction2011 (DDI2011) and DDIExtraction2013 (DDI2013) challenge introduced one task aiming at recognition of drug names. State-of-the-art DNER approaches heavily rely on hand-engineered features and domain-specific knowledge which are difficult to collect and define. Therefore, we offer an automatic exploring words and characters level features approach: a recurrent neural network using bidirectional long short-term memory (LSTM) with Conditional Random Fields decoding (LSTM-CRF). Two kinds of word representations are used in this work: word embedding, which is trained from a large amount of text, and character-based representation, which can capture orthographic feature of words. Experimental results on the DDI2011 and DDI2013 dataset show the effect of the proposed LSTM-CRF method. Our method outperforms the best system in the DDI2013 challenge.

引用

页数：12

共 40 条

[1]

[Anonymous], 2010, P PYTH SCI C

[2]

[Anonymous], rmsprop: divide the gradient by a running average of its recent magnitude

[3]

[Anonymous], 2013, ASS COMPUT LINGUIST

[4]

[Anonymous], 2013, P 7 INT WORKSH SEM E

[5]

[Anonymous], 2016, P NAACL HLT

[6]

[Anonymous], 2005, P 43 ANN M ASS COMP

[7]

[Anonymous], 2015, P 6 INT WORKSHOP HLT, DOI DOI 10.18653/V1/W15-2608

[8] LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].

BENGIO, Y ;

SIMARD, P ;

FRASCONI, P .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166

[9]

Chalapathy R., ARXIV160907585

[10]

Collobert R, 2011, J MACH LEARN RES, V12, P2493

← 1 2 3 4 →