Using conditional random fields for result identification in biomedical abstracts

被引:11
作者
Lin, Ryan T. K. [2 ]
Dai, Hong-Jie [2 ,3 ]
Bow, Yue-Yang [2 ]
Chiu, Justin Liang-Te [4 ]
Tsai, Richard Tzong-Han [1 ]
机构
[1] Yuan Ze Univ, Dept Comp Sci & Engn, Chungli, Taiwan
[2] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[3] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu 30043, Taiwan
[4] Natl Taiwan Univ, Dept Comp Sci & Engn, Taipei 10764, Taiwan
关键词
Result identification; sequence labeling; conditional random fields; EXTRACTION; RETRIEVAL;
D O I
10.3233/ICA-2009-0321
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
The abstracts of biomedical papers usually contain three sections: objective, methods, and results-conclusion. The results-conclusion section is the most important because it usually describes the main contribution of a paper. Unfortunately, not all biomedical journals follow this three-section format. In this paper, we propose a machine learning (ML) based approach to automatically identify the results-conclusion section. The results-conclusion section identification problem is formulated as a sequence labeling task. Four feature sets, including Position, Named Entity, Tense, and Word Frequency, are employed with Conditional Random Fields (CRFs) as the underlying ML model. The experiment results show that the proposed approach can achieve F-measure, precision, and recall of 97.08%, 96.63% and 97.53%, respectively.
引用
收藏
页码:339 / 352
页数:14
相关论文
共 44 条
[1]
[Anonymous], 2008, P 3 INT JOINT C NAT
[2]
[Anonymous], 2007, Introduction to Statistical Relational Learning, DOI DOI 10.1677/JME-08-0087
[3]
[Anonymous], 2001, P 18 INT C MACH LEAR, DOI DOI 10.5555/645530.655813
[4]
*ANS I, 1979, AM NAT STAND WRIT AB
[5]
Besnard P, 2008, INTEGR COMPUT-AID E, V15, P351
[6]
Extraction of semantic biomedical relations from text using conditional random fields [J].
Bundschus, Markus ;
Dejori, Mathaeus ;
Stetter, Martin ;
Tresp, Volker ;
Kriegel, Hans-Peter .
BMC BIOINFORMATICS, 2008, 9 (1)
[7]
Chen QF, 2008, INTEGR COMPUT-AID E, V15, P369
[8]
Chien HL, 2002, EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, P786
[9]
Dai H., 2007, P 2 BIOCREATIVE CHAL, P69
[10]
Fukuda K, 1998, Pac Symp Biocomput, P707