Fast and robust text detection in images and video frames

被引:217
作者
Ye, QX
Huang, QM
Gao, W
Zhao, DB
机构
[1] Chinese Acad Sci, Comp Technol Inst, Beijing 100864, Peoples R China
[2] Chinese Acad Sci, Grad Sch, Beijing 100864, Peoples R China
[3] Harbin Inst Technol, Dept Comp Sci, Harbin 150006, Peoples R China
关键词
text detection; multiscale wavelet feature; feature combination; SVM classification;
D O I
10.1016/j.imavis.2005.01.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text in images and video frames carries important information for visual content understanding and retrieval. In this paper, by using multiscale wavelet features, we propose a novel coarse-to-fine algorithm that is able to locate text lines even under complex background. First. in the coarse detection, after the wavelet energy feature is calculated to locate all possible text pixels, a density-based region growing method is developed to connect these pixels into regions which are further separated into candidate text lines by structural information. Secondly, in the fine detection, with four kinds of texture features extracted to represent the texture pattern of a text line, a forward search algorithm is applied to select the most effective features. Finally, an SVM classifier is used to identify true text from the candidates based on the selected features. Experimental results show that this approach can fast and robustly detect text lines under various conditions. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:565 / 576
页数:12
相关论文
共 32 条
[1]  
[Anonymous], 1995, CMUCS95186
[2]  
Chen DT, 2001, PROC CVPR IEEE, P621
[3]  
Chen DT, 2002, INT C PATT RECOG, P227, DOI 10.1109/ICPR.2002.1047438
[4]   ORTHONORMAL BASES OF COMPACTLY SUPPORTED WAVELETS [J].
DAUBECHIES, I .
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 1988, 41 (07) :909-996
[5]  
FURHT B, 1995, VIDEO IMAGE PROCESSI, P226
[6]  
Heisele B, 2001, PROC CVPR IEEE, P18
[7]   An automatic performance evaluation protocol for video text detection algorithms [J].
Hua, XS ;
Liu, WY ;
Zhang, HJ .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2004, 14 (04) :498-507
[8]  
HUA XS, 2002, INT C IM PROC NEW YO, P22
[9]   Automatic text location in images and video frames [J].
Jain, AK ;
Yu, B .
PATTERN RECOGNITION, 1998, 31 (12) :2055-2076
[10]  
JAIN AK, 2001, IEEE T PAMI, V2, P4