Multi-Level Attention Based BLSTM Neural Network for Biomedical Event Extraction

被引:20
作者
He, Xinyu [1 ]
Li, Lishuang [2 ]
Song, Xingchen [3 ]
Huang, Degen [2 ]
Ren, Fuji [4 ]
机构
[1] Dalian Univ Technol, Sch Comp Software & Theory, Dalian, Peoples R China
[2] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Peoples R China
[3] Dalian Univ Technol, Comp Sci & Technol, Dalian, Peoples R China
[4] Univ Tokushima, Tokushima 7708501, Japan
基金
中国国家自然科学基金;
关键词
event extraction; trigger detection; argument detection; BLSTM neural network; multi-level attention;
D O I
10.1587/transinf.2018EDP7268
中图分类号
TP [自动化技术、计算机技术];
学科分类号
080201 [机械制造及其自动化];
摘要
Biomedical event extraction is an important and challenging task in Information Extraction, which plays a key role for medicine research and disease prevention. Most of the existing event detection methods are based on shallow machine learning methods which mainly rely on domain knowledge and elaborately designed features. Another challenge is that some crucial information as well as the interactions among words or arguments may be ignored since most works treat words and sentences equally. Therefore, we employ a Bidirectional Long Short Term Memory (BLSTM) neural network for event extraction, which can skip handcrafted complex feature extraction. Furthermore, we propose a multi-level attention mechanism, including word level attention which determines the importance of words in a sentence, and the sentence level attention which determines the importance of relevant arguments. Finally, we train dependency word embeddings and add sentence vectors to enrich semantic information. The experimental results show that our model achieves an F-score of 59.61% on the commonly used dataset (MLEE) of biomedical event extraction, which outperforms other state-of-the-art methods.
引用
收藏
页码:1842 / 1850
页数:9
相关论文
共 19 条
[1]
[Anonymous], P INT JOINT WORKSH N
[2]
[Anonymous], 2014, Neural Information Processing Systems
[3]
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[4]
Bjorne J., 2014, TUCS DISSERTATIONS, V178, P1
[5]
Bjorne J., 2009, COMPUT INTELL-US, V27, P541, DOI [DOI 10.1111/J.1467-8640.2011.00399.X, 10.1111/j.1467-8640.2011.00399.x]
[6]
Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[7]
Levy O., 2014, M ASS COMP LING BALT
[8]
Li L., 2016, CHIN NAT C CHIN COMP
[9]
Miwa M., 2010, Proceedings of the 23rd International Conference on Computational Linguistics, P779
[10]
Embedding assisted prediction architecture for event trigger identification [J].
Nie, Yifan ;
Rong, Wenge ;
Zhang, Yiyuan ;
Ouyang, Yuanxin ;
Xiong, Zhang .
JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2015, 13 (03)