Text information extraction in images and video: a survey

被引:483
作者
Jung, K [1 ]
Kim, KI
Jain, AK
机构
[1] Soongsil Univ, Sch Media, Coll Informat, Seoul 156743, South Korea
[2] Korea Adv Inst Sci & Technol, AI Lab, Dept Comp Sci, Seoul, South Korea
[3] Michigan State Univ, Comp Sci & Engn Dept, E Lansing, MI 48824 USA
关键词
text information extraction; text detection; text localization; text tracking; text enhancement; OCR;
D O I
10.1016/j.patcog.2003.10.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text data present in images and video contain useful information for automatic annotation, indexing, and structuring of images. Extraction of this information involves detection, localization, tracking, extraction, enhancement, and recognition of the text from a given image. However, variations of text due to differences in size, style, orientation, and alignment, as well as low image contrast and complex background make the problem of automatic text extraction extremely challenging. While comprehensive surveys of related problems such as face detection, document analysis, and image & video indexing can be found, the problem of text information extraction is not well surveyed. A large number of techniques have been proposed to address this problem, and the purpose of this paper is to classify and review these algorithms, discuss benchmark data and performance evaluation, and to point out promising directions for future research. (C) 2004 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:977 / 997
页数:21
相关论文
共 88 条
[1]  
[Anonymous], 1995, CMUCS95186
[2]  
[Anonymous], P INT C IM PROC ICIP
[3]   A survey on the use of pattern recognition methods for abstraction, indexing and retrieval of images and video [J].
Antani, S ;
Kasturi, R ;
Jain, R .
PATTERN RECOGNITION, 2002, 35 (04) :945-965
[4]  
Antani S., 1999, CSE99016
[5]  
Antani S., 2000, P IAPR WORKSH DOC AN, P506
[6]  
ANTANI SK, 2001, THESIS PENNSYLVANIA
[7]  
CHADDHA N, 1994, CONF REC ASILOMAR C, P1356, DOI 10.1109/ACSSC.1994.471679
[8]  
Chen D, 2000, SURVEY TEXT DETECTIO
[9]  
Chen DT, 2001, PROC CVPR IEEE, P621
[10]  
Chen DT, 2002, INT C PATT RECOG, P227, DOI 10.1109/ICPR.2002.1047438