Citation Metadata Extraction via Deep Neural Network-based Segment Sequence Labeling

被引:5
作者
An, Dong [1 ]
Gao, Liangcai [1 ]
Jiang, Zhuoren [2 ]
Liu, Runtao [1 ]
Tang, Zhi [1 ]
机构
[1] Peking Univ, Inst Comp Sci & Technol, Beijing 100871, Peoples R China
[2] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Guangdong, Peoples R China
来源
CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT | 2017年
基金
中国博士后科学基金;
关键词
Citation Metadata Extraction; Academic Information Extraction; Sequence Labeling; Information Retrieval;
D O I
10.1145/3132847.3133074
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Citation metadata extraction plays an important role in academic information retrieval and knowledge management. Current works on this task generally use rule-based, template-based or learning-based approaches but these methods usually either rely on handcrafted features or are limited with domains. Recently, neural networks have shown strong ability in addressing sequence labeling tasks. In this paper, we propose a sequence labeling model for citation metadata extraction, called segment sequence labeling. Instead of inferring at word level, the input sequence is first divided into segments, and then features of the segments are computed to infer the label sequence of the segments. We first run experiments to validate the effectiveness of different parts of the model by comparing it with a CRF-based model and a neural network-based model. Experimental results show our model beats both models on most fields. Besides, our model is evaluated on public datasets UMass [1] and Cora [12] and has achieved significant performance improvement. Our model was trained on the data which were generated from BibTeX files collected on the Web and annotated automatically.
引用
收藏
页码:1967 / 1970
页数:4
相关论文
共 12 条
[1]  
Anzaroot Sam, 2013, ICML WORKSH PEER REV
[2]   BibPro: A Citation Parser Based on Sequence Alignment [J].
Chen, Chien-Chih ;
Yang, Kai-Hsiang ;
Chen, Chuen-Liang ;
Ho, Jan-Ming .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (02) :236-250
[3]  
Chiu JP., 2016, T ASS COMPUTATIONAL, V4, P357, DOI [DOI 10.1162/TACLA00104, 10.1162/tacl_a_00104, DOI 10.1162/TACL_A_00104]
[4]  
Councill IG, 2008, SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, P661
[5]  
Han H, 2003, ACM-IEEE J CONF DIG, P37
[6]  
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[7]  
Huang Z, 2015, ARXIV150801991
[8]  
Lafferty John, 2001, INT C MACH LEARN ICM
[9]  
Lample G., 2016, P NAACL HLT, P260
[10]  
Ma XZ, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P1064