Automatic text segmentation and text recognition for video indexing

被引:89
作者
Lienhart, R [1 ]
Effelsberg, W
机构
[1] Intel Corp, Microprocessor Res Labs, Santa Clara, CA 95052 USA
[2] Univ Mannheim, D-68131 Mannheim, Germany
关键词
video processing; character segmentation; text recognition; OCR; video indexing; video content analysis;
D O I
10.1007/s005300050006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient indexing and retrieval of digital video is an important function of video databases. One powerful index for retrieval is the text appearing in them. It enables content-based browsing. We present our new methods for automatic segmentation of text in digital videos. The algorithms we propose make use of typical characteristics of text in videos in order to enable and enhance segmentation performance. The unique features of our approach are the tracking of characters and words over their complete duration of occurrence in a video and the integration of the multiple bitmaps of a character over time into a single bitmap. The output of the text segmentation step is then directly passed to a standard OCR software package in order to translate the segmented text into ASCII. Also, a straightforward indexing and retrieval scheme is introduced. It is used in the experiments to demonstrate that the proposed text segmentation algorithms together with existing text recognition algorithms are suitable for indexing and retrieval of relevant video sequences in and from a video database. Our experimental results are very encouraging and suggest that these algorithms can be used in video retrieval applications as well as to recognize higher level semantics in videos.
引用
收藏
页码:69 / 81
页数:13
相关论文
共 30 条
[1]  
[Anonymous], 1995, PROC ICJAI, DOI DOI 10.1145/217279.215068
[2]  
[Anonymous], 1995, CMUCS95186
[3]  
[Anonymous], 1995, P IEEE INT C MULT CO
[4]  
[Anonymous], 1994, String Searching Algorithms
[5]  
[Anonymous], P ACM INT C DIG LIB
[6]  
CANNY JF, 1986, PAMI, V8, P6, DOI DOI 10.1109/TPAMI.1986.4767851
[7]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[8]  
DAVIS M, 1994, P ACM MULT 15 20 OCT, P478
[9]  
FLICKNER M, 1995, IEEE COMPUT, V28, P23, DOI DOI 10.1109/2.410146
[10]  
HJELSVOLD R, 1995, P ACM MULTIMIDEA, P283