Detecting text in natural scene images with conditional clustering and convolution neural network

被引:3
作者
Zhu, Anna [1 ]
Wang, Guoyou [1 ]
Dong, Yangbo [1 ]
Iwana, Brian Kenji [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Automat, State Key Lab Multispectral Informat Proc Technol, Wuhan 430074, Peoples R China
[2] Kyushu Univ, Human Interface Lab, Informat Sci & Elect Engn, Nishi Ku, Fukuoka 8190395, Japan
关键词
scene text detection; conditional clustering; convolutional neural network; character's postprobability; EXTRACTION;
D O I
10.1117/1.JEI.24.5.053019
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a robust method of detecting text in natural scenes. The work consists of four parts. First, automatically partition the images into different layers based on conditional clustering. The clustering operates in two sequential ways. One has a constrained clustering center and conditional determined cluster numbers, which generate small-size subregions. The other has fixed cluster numbers, which generate full-size subregions. After the clustering, we obtain a bunch of connected components (CCs) in each subregion. In the second step, the convolutional neural network (CNN) is used to classify those CCs to character components or noncharacter ones. The output score of the CNN can be transferred to the postprobability of characters. Then we group the candidate characters into text strings based on the probability and location. Finally, we use a verification step. We choose a multichannel strategy to evaluate the performance on the public datasets: ICDAR2011 and ICDAR2013. The experimental results demonstrate that our algorithm achieves a superior performance compared with the state-of-the-art text detection algorithms. (C) 2015 SPIE and IS&T
引用
收藏
页数:10
相关论文
共 33 条
[1]  
[Anonymous], 18 IEEE INT C IM PRO
[2]  
[Anonymous], SIGNAL PROCESS
[3]   PhotoOCR: Reading Text in Uncontrolled Conditions [J].
Bissacco, Alessandro ;
Cummins, Mark ;
Netzer, Yuval ;
Neven, Hartmut .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :785-792
[5]  
Chen XR, 2004, PROC CVPR IEEE, P366
[6]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[7]  
de Campos TE, 2009, VISAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, P273
[8]  
Fairchild M. D., 2013, Color Appearance Models
[9]  
He K., 2015, P IEEE INT C COMP VI
[10]  
Huang WL, 2014, LECT NOTES COMPUT SC, V8692, P497, DOI 10.1007/978-3-319-10593-2_33