Automatic text location in images and video frames

被引:317
作者
Jain, AK [1 ]
Yu, B [1 ]
机构
[1] Michigan State Univ, Dept Comp Sci, E Lansing, MI 48824 USA
关键词
automatic text location; Web search; image database; video indexing; multivalued image decomposition; connected component analysis;
D O I
10.1016/S0031-3203(98)00067-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Textual data is very important in a number of applications such as image database indexing and document understanding. The goal of automatic text location without character recognition capabilities is to extract image regions that contain only text. These regions can then be either Fed to an optical character recognition module or highlighted for a user. Text location is a very difficult problem because the characters in text can vary in font, size, spacing, alignment, orientation, color and texture. Further, characters are often embedded in a complex background in the image. We propose a new text location algorithm that is suitable in a number of applications, including conversion of newspaper advertisements from paper documents to their electronic versions, World Wide Web search, color image indexing and Video indexing. In many of these applications, it is not necessary to extract all the text, so we emphasize on extracting important text with large size and high contrast. Our algorithm is very fast and has been shown to be successful in extracting important text in a large number of test images. (C) 1998 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:2055 / 2076
页数:22
相关论文
共 25 条
[1]  
[Anonymous], 1993, JPEG still image compression standard
[2]   A ROBUST ALGORITHM FOR TEXT STRING SEPARATION FROM MIXED TEXT GRAPHICS IMAGES [J].
FLETCHER, LA ;
KASTURI, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1988, 10 (06) :910-918
[3]  
GORDON AS, 1995, P INT JOINT C ART IN, P23
[4]  
GRAY M, INTERNET STAT GROWTH
[5]  
Jain A. K., 1992, Machine Vision and Applications, V5, P169, DOI 10.1007/BF02626996
[6]   Image retrieval using color and shape [J].
Jain, AK ;
Vailaya, A .
PATTERN RECOGNITION, 1996, 29 (08) :1233-1244
[7]   Document representation and its application to page decomposition [J].
Jain, AK ;
Yu, B .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (03) :294-308
[8]  
Jain K, 1988, Algorithms for clustering data
[9]  
Lee E R, 1994, P INT C IM PROC, P301
[10]   Automatic text recognition in digital videos [J].
Lienhart, R ;
Stuber, F .
IMAGE AND VIDEO PROCESSING IV, 1996, 2666 :180-188