Automatic text detection and tracking in digital video

被引:345
作者
Li, HP [1 ]
Doermann, D
Kia, O
机构
[1] Univ Maryland, Ctr Automat Res, Language & Media Proc Lab, College Pk, MD 20742 USA
[2] Natl Inst Stand & Technol, Gaithersburg, MD 20899 USA
关键词
digital libraries; neural network; text detection; text tracking; video indexing;
D O I
10.1109/83.817607
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text that appears in a scene or is graphically added to video can provide an important supplemental source of index information as well as clues for decoding the video's structure and for classification. In this work, we present algorithms for detecting and tracking text in digital video. Our system implements a scale-spate feature extractor that feeds an artificial neural processor to detect text blocks. Our text tracking scheme consists of two modules: a sum of squared difference (SSD) -based module to find the initial position and a contour-based module to refine the position. Experiments conducted with a variety of video sources show that our scheme can detect and track text robustly.
引用
收藏
页码:147 / 156
页数:10
相关论文
共 18 条
[1]  
CHELLAPPA R, 1992, NEURAL NETWORKS SIGN, P37
[2]  
DAVIS M, P ACM MULT 94, P478
[3]   Multiscale segmentation of unstructured document pages using soft decision integration [J].
Etemad, K ;
Doermann, D ;
Chellappa, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (01) :92-96
[4]   Efficient region tracking with parametric models of geometry and illumination [J].
Hager, GD ;
Belhumeur, PN .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (10) :1025-1039
[5]  
HEMANDO J, 1994, SIGNAL PROCESS, V36, P393
[6]  
Jain A. K., 1992, Machine Vision and Applications, V5, P169, DOI 10.1007/BF02626996
[7]  
Jain AK, 1998, INT C PATT RECOG, P1497, DOI 10.1109/ICPR.1998.711990
[8]  
Kim S.K., 1996, P INT C IM PROC, V2, P661, DOI DOI 10.1109/ICIP.1996.560964
[9]   Archiving, indexing, and retrieval of video in the compressed domain [J].
Kobla, V ;
Doermann, D ;
Lin, KI .
MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS, 1996, 2916 :78-89
[10]  
LI H, 1998, LAMPTR028 CARTR900