Holistic word recognition for handwritten historical documents

被引:134
作者
Lavrenko, V [1 ]
Rath, TM [1 ]
Manmatha, R [1 ]
机构
[1] Univ Massachusetts, Ctr Intelligent Informat Retrieval, Amherst, MA 01003 USA
来源
FIRST INTERNATIONAL WORKSHOP ON DOCUMENT IMAGE ANALYSIS FOR LIBRARIES, PROCEEDINGS | 2004年
关键词
D O I
10.1109/DIAL.2004.1263256
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most offline handwriting recognition approaches proceed by segmenting words into smaller pieces (usually characters) which are recognized separately. The recognition result of a word is then the composition of the individually recognized parts. Inspired by results in cognitive psychology, researchers have begun to focus on holistic word recognition approaches: Here we present a holistic word recognition approach for single-author historical documents, which is motivated by the fact that for severely degraded documents a segmentation of words into characters will produce very poor results. The quality of the original documents does not allow its to recognize them with high accuracy-our goal here is to produce transcriptions that will allow successful retrieval of images, which has been shown to be feasible even in such noisy environments. We believe that this is the first, systematic approach to recognizing words in historical manuscripts with extensive experiments. Our experiments show recognition accuracy of 65%, which exceeds performance of other systems which operate on non-degraded input images (non historical documents).
引用
收藏
页码:278 / 287
页数:10
相关论文
共 19 条
[1]   A survey of methods and strategies in character segmentation [J].
Casey, RG ;
Lecolinet, E .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1996, 18 (07) :690-706
[2]  
Cattell J.M., 1886, Mind, V11, P377, DOI DOI 10.1093/MIND/OS-XI.42.220
[3]  
FALOUTSOS C, 1999, MODERN INFORMATION R
[4]   VITERBI ALGORITHM [J].
FORNEY, GD .
PROCEEDINGS OF THE IEEE, 1973, 61 (03) :268-278
[5]  
GAROFOLO JS, 2000, P RIAO 2000 CONT BAS, V1, P1
[6]  
Harding SM, 1997, LECT NOTES COMPUT SC, V1324, P345, DOI 10.1007/BFb0026737
[7]  
Ishidera E, 2003, PROC INT CONF DOC, P1173
[8]  
Kim U., 1999, Asian Journal of Social Psychology, V2, P1, DOI 10.1111/1467-839X.00023
[9]  
Kornai A., 1996, P 5 INT WORKSH FRONT
[10]   The role of holistic paradigms in handwritten word recognition [J].
Madhvanath, S ;
Govindaraju, V .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (02) :149-164