Biomedical events extraction using the hidden vector state model

被引:14
作者
Zhou, Deyu [1 ]
He, Yulan [2 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing 210093, Jiangsu Provinc, Peoples R China
[2] Open Univ, Knowledge Media Inst, Milton Keynes MK7 6AA, Bucks, England
关键词
Abstract annotation; Hidden vector state model; Semantic parsing; Biomedical events extraction; PROTEIN-PROTEIN INTERACTIONS;
D O I
10.1016/j.artmed.2011.08.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objective: Biomedical events extraction concerns about events describing changes on the state of biomolecules from literature. Comparing to the protein-protein interactions (PPIs) extraction task which often only involves the extraction of binary relations between two proteins, biomedical events extraction IS much harder since it needs to deal with complex events consisting of embedded or hierarchical relations among proteins, events, and their textual triggers. In this paper, we propose an information extraction system based on the hidden vector state (HVS) model, called HVS-BioEvent, for biomedical events extraction, and investigate its capability in extracting complex events. Methods and material: HVS has been previously employed for extracting PPIs. In HVS-BioEvent, we propose an automated way to generate abstract annotations for HVS training and further propose novel machine learning approaches for event trigger words identification, and for biomedical events extraction from the HVS parse results. Results: Our proposed system achieves an F-score of 49.57% on the corpus used in the BioNLP'09 shared task, which is only 2.38% lower than the best performing system by UTurku in the BioNLP'09 shared task. Nevertheless, HVS-BioEvent outperforms UTurku's system on complex events extraction with 36.57% vs. 30.52% being achieved for extracting regulation events, and 40.61% vs. 38.99% for negative regulation events. Conclusions: The results suggest that the HVS model with the hierarchical hidden state structure is indeed more;suitable for complex event extraction since it could naturally model embedded structural context in sentences. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:205 / 213
页数:9
相关论文
共 26 条
[1]  
Airola A., 2008, P WORKSH CURR TRENDS, P1, DOI DOI 10.3115/1572306.1572308
[2]  
[Anonymous], ADV NEURAL INFORM PR
[3]  
[Anonymous], P 3 INT S SEM MIN BI
[4]  
[Anonymous], P WORKSH BIONLP
[5]  
[Anonymous], 2009, P WORKSHOP BIONLP
[6]  
[Anonymous], P 43 ANN M ASS COMP
[7]  
[Anonymous], P WORKSH BIONLP
[8]  
Bjrne J., 2009, P BIONLP 09 SHARED T, P10
[9]  
Bunescu R.C., 2005, P C HUM LANG TECHN E, DOI DOI 10.3115/1220575.1220666
[10]   PreBIND and Textomy - mining the biomedical literature for protein-protein interactions using a support vector machine [J].
Donaldson, I ;
Martin, J ;
de Bruijn, B ;
Wolting, C ;
Lay, V ;
Tuekam, B ;
Zhang, SD ;
Baskin, B ;
Bader, GD ;
Michalickova, K ;
Pawson, T ;
Hogue, CWV .
BMC BIOINFORMATICS, 2003, 4 (1)