Duplicate detection algorithms of bibliographic descriptions

被引:8
作者
Sitas, Anestis [1 ,2 ]
Kapidakis, Sarantos [3 ]
机构
[1] Aristotle Univ Thessaloniki, Sch Philosophy, Thessaloniki, Greece
[2] Technol Inst Thessaloniki, Sch Lib Sci, Thessaloniki, Greece
[3] Ionian Univ, Arch & Lib Sci Dept, Paleo Anaktoro, Greece
关键词
cataloguing; algorithms; bibliographic systems; records management;
D O I
10.1108/07378830810880379
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Purpose - The purpose of this paper is to focus on duplicate record detection algorithms used for detection in bibliographic databases. Design/methodology/approach - Individual algorithms, their application process for duplicate detection and their results are described based on available literature (published articles), information found at various library web sites and follow-up e-mail communications. Findings - Algorithms are categorized according to their application as a process of a single step or two consecutive steps. The results of deletion, merging, and temporary and virtual consolidation of duplicate records are studied. Originality/value - The paper presents an overview of the duplication detection algorithms and an up-to-date state of their application in different library systems.
引用
收藏
页码:287 / 301
页数:15
相关论文
共 14 条
[1]  
COUSINS S, 2006, COPAC SERVICE
[2]  
Cousins SA, 1998, J INFORM SCI, V24, P231, DOI 10.1177/016555159802400402
[3]  
COYLE K, 1992, 6 U CAL DLA
[4]  
COYLE K, 1985, ASIS P, V4, P77
[5]  
HICKEY TB, 1979, J LIB AUTOMATION, V2, P125
[6]  
Hunstad S., 1988, Cataloging & Classification Quarterly, V8, P239, DOI 10.1300/J104v08n03_17
[7]  
*ILCSO, 2004, US OCLC ILLINET ONL
[8]  
LAZINGER SS, 1994, INFORM TECHNOL LIBR, V13, P213
[9]  
Meir DD, 1998, INFORM TECHNOL LIBR, V17, P116
[10]  
ONEILL E, 1990, DUPLICATE RECORDS ON