Historical Document Layout Analysis Competition

被引:43
作者
Antonacopoulos, A. [1 ]
Clausner, C. [1 ]
Papadopoulos, C. [1 ]
Pletschacher, S. [1 ]
机构
[1] Univ Salford, Pattern Recognit & Image Anal PRImA Res Lab, Sch Comp Sci & Engn, Manchester M5 4WT, Lancs, England
来源
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011) | 2011年
关键词
layout analysis; performance evaluation; page segmentation; region classification; datasets; historical documents;
D O I
10.1109/ICDAR.2011.301
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
This paper presents an objective comparative evaluation of layout analysis methods for scanned historical documents. It describes the competition (modus operandi, dataset and evaluation methodology) held in the context of ICDAR2011 and the International Workshop on Historical Document Imaging and Processing (HIP2011), presenting the results of the evaluation of four submitted methods. A commercial state-of-the-art system is also evaluated for comparison. Two scenarios are reported in this paper, one evaluating the ability of methods to accurately segment regions and the other evaluating the whole pipeline of segmentation and region classification (with a text extraction goal). The results indicate that there is a convergence to a certain methodology with some variations in the approach. However, there is still a considerable need to develop robust methods that deal with the idiosyncrasies of historical documents.
引用
收藏
页码:1516 / 1520
页数:5
相关论文
共 10 条
[1]
Antonacopoulos A., 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P1370, DOI 10.1109/ICDAR.2009.275
[2]
Antonacopoulos Apostolos, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P296, DOI 10.1109/ICDAR.2009.271
[3]
Breuel T. M., 2002, P DAS2002 PRINC USA
[4]
Clausner C., 2011, P ICDAR2011 BEIJ CHI
[5]
Gatos B., 2005, P INT C ADV PATT REC, P612
[6]
Document representation and its application to page decomposition [J].
Jain, AK ;
Yu, B .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (03) :294-308
[7]
AUTOMATED EVALUATION OF OCR ZONING [J].
KANAI, J ;
RICE, SV ;
NARTKER, TA ;
NAGY, G .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (01) :86-90
[8]
Pletschacher S., 2010, Proceedings of the 2010 20th International Conference on Pattern Recognition (ICPR 2010), P257, DOI 10.1109/ICPR.2010.72
[9]
Performance evaluation and benchmarking of six-page segmentation algorithms [J].
Shafait, Faisal ;
Keysers, Daniel ;
Breuel, Thomas M. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (06) :941-954
[10]
Zheng Y., 2001, P ICDAR2001 SEATTL U