Reference metadata extraction using a hierarchical knowledge representation framework

被引:38
作者
Day, Min-Yuh
Tsai, Richard Tzong-Han
Sung, Cheng-Lung
Hsieh, Chiu-Chen
Lee, Cheng-Wei
Wu, Shih-Hung
Wu, Kun-Pin
ong, Chorng-Shy Ong
Hsu, Wen-Lian [1 ]
机构
[1] Acad Sinica, Inst Informat Sci, Taipei 115, Taiwan
[2] Natl Taiwan Univ, Dept Informat Management, Taipei 106, Taiwan
[3] Chaoyang Univ Technol, Dept CSIE, Taichung 413, Taiwan
关键词
reference extraction; metadata extraction; knowledge representation framework; INFOMAP;
D O I
10.1016/j.dss.2006.08.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The integration of bibliographical information on scholarly publications available on the Internet is an important task in the academic community. Accurate reference metadata extraction from such publications is essential for the integration of metadata. from heterogeneous reference sources. In this paper, we propose a hierarchical template-based reference metadata extraction method for scholarly publications. We adopt a hierarchical knowledge representation framework called INFOMAP, which automatically extracts metadata. The experimental results show that, by using INFOMAP, we can extract author, title, journal, volume, number (issue), year, and page information from different kinds of reference styles with a high degree of precision. The overall average accuracy is 92.39% for the six major reference styles compared in this study. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:152 / 167
页数:16
相关论文
共 25 条
  • [1] Agichtein Eugene., 2004, P ACM SIGKDD INT C K, P20
  • [2] BORKAR VR, 2001, P ACM SIGMOD INT C M, P175
  • [3] BOUCKAERT RR, 2002, WORKSH TEXT LEARN TE
  • [4] Bradford S.C., 1934, ENGINEERING, V137, P85, DOI DOI 10.1177/016555158501000407
  • [5] Burnett K, 1999, J AM SOC INFORM SCI, V50, P1209, DOI 10.1002/(SICI)1097-4571(1999)50:13<1209::AID-ASI6>3.0.CO
  • [6] 2-Y
  • [7] Chowdhury GG, 1999, LIBR TRENDS, V48, P182
  • [8] Davenport TH, 1998, SLOAN MANAGE REV, V39, P43
  • [9] Day MY, 2005, 5TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES, PROCEEDINGS, P318
  • [10] Ding Y, 1999, P 2 AS DIG LIB C TAI, P47