Metadata and data structures for the historical newspaper digital library

被引:12
作者
Allen, RB [1 ]
Schalow, J [1 ]
机构
[1] Univ Maryland, Coll Lib & Informat Serv, College Pk, MD 20742 USA
来源
PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON INFORMATION KNOWLEDGE MANAGEMENT, CIKM'99 | 1999年
关键词
digital libraries; history; metadata; newspapers; OCR;
D O I
10.1145/319950.319971
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We examine metadata and data-structure issues for the Historical Newspaper Digital Library. This project proposes to digitize and then do OCR and linguisting processing on several years worth of historical newspapers. Newspapers are very complex information objects so developing a rich description of their content is challenging. In addition to frameworks for the logical structure and physical layout, we propose metadata relevant to the image processing and to the historians who will use this collection. Finally, we consider how the metadata infrastructure might be managed as it evolves with improved text processing capabilities and how an infrastructure might be developed to support a community of users.
引用
收藏
页码:147 / 153
页数:7
相关论文
共 11 条
[1]  
ALAM H, 1995, S DOC IM UND TECHN, P113
[2]  
[Anonymous], 1974, EAGLE BROOKLYN COMMU
[3]  
[Anonymous], 1997, HDB CHARACTER RECOGN
[4]  
BASKETTE FK, 1996, ART EDITING
[5]   An approach to a digital library of newspapers [J].
Cabo, MJA ;
Llavori, RB .
INFORMATION PROCESSING & MANAGEMENT, 1997, 33 (05) :645-661
[6]  
*DOC PROC GROUP, 1995, S DOC IM UND TECHN, P39
[7]  
HARROWR T, 1997, NEWSPAPER DESIGNERS
[8]  
KANUNGO T, 1999, CSTR4015 U MAR LAB L
[9]  
*LIB C, 1995, THES GRAPH OBJ
[10]  
*WORK GROUP 3, TEI XML DIG LIB