Integrating multiple character proposals for robust scene text extraction

被引:24
作者
Lee, SeongHun [1 ]
Kim, Jin Hyung [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Taejon 305701, South Korea
关键词
Scene text extraction; Two-stage CRF models; Multiple image segmentations; Component; Character proposal; OBJECT DETECTION; COLOR; LOCALIZATION; RECOGNITION;
D O I
10.1016/j.imavis.2013.08.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text contained in scene images provides the semantic context of the images. For that reason, robust extraction of text regions is essential for successful scene text understanding. However, separating text pixels from scene images still remains as a challenging issue because of uncontrolled lighting conditions and complex backgrounds. In this paper, we propose a two-stage conditional random field (TCRF) approach to robustly extract text regions from the scene images. The proposed approach models the spatial and hierarchical structures of the scene text, and it finds text regions based on the scene text model. In the first stage, the system generates multiple character proposals for the given image by using multiple image segmentations and a local CRF model. In the second stage, the system selectively integrates the generated character proposals to determine proper character regions by using a holistic CRF model. Through the TCRF approach, we cast the scene text separation problem as a probabilistic labeling problem, which yields the optimal label configuration of pixels that maximizes the conditional probability of the given image. Experimental results indicate that our framework exhibits good performance in the case of the public databases. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:823 / 840
页数:18
相关论文
共 38 条
[1]  
[Anonymous], P 21 INT C MACH LEAR
[2]  
[Anonymous], 2005, P MULT INF RETR WORK
[3]  
[Anonymous], 2001, PROC 18 INT C MACH L
[4]  
Berkhin P, 2006, GROUPING MULTIDIMENSIONAL DATA: RECENT ADVANCES IN CLUSTERING, P25
[5]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[6]  
Davidson I., 2005, P 5 INT C DAT MIN, P138
[7]  
Egyul Kim, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P166, DOI 10.1109/ICDAR.2009.21
[8]  
Epshtein B, 2010, PROC CVPR IEEE, P2963, DOI 10.1109/CVPR.2010.5540041
[9]  
Freund Y., 1996, Machine Learning. Proceedings of the Thirteenth International Conference (ICML '96), P148
[10]   Advancing content-based image retrieval by exploiting image color and region features [J].
Gong, YH .
MULTIMEDIA SYSTEMS, 1999, 7 (06) :449-457